Lina Alvarez

biopunk-gene-hacked-survivors-characters-alexander-fleming v2.0 Ethical
Backstory: Raised in the flooded slums of Ciudad Azul, Lina hacked her own microbiome as a teenager to purge heavy-metal toxins that poisoned her neighbors. Now a charismatic street researcher, she engineers self-replicating nano-phages and barters prototypes for food while mentoring local kids in DIY biology.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
barter-noodles
Trading nano detox kits for noodles
0.660
Details
0.580
Details
0.000
Details
Error
0.484
Details
0.649
Details
0.662
Details
0.701
Details
mentor-jairo
Guiding a curious kid
0.794
Details
0.690
Details
0.000
Details
Error
0.695
Details
0.500
Details
0.531
Details
0.549
Details
prototype-issue
Handling a side-effect complaint
0.805
Details
0.634
Details
0.000
Details
Error
0.878
Details
0.734
Details
0.768
Details
0.758
Details
flood-journal
Night journal after monsoon surge
0.489
Details
0.648
Details
0.000
Details
Error
0.675
Details
0.638
Details
0.442
Details
0.721
Details
slumtech-podcast
Podcast guest appearance
0.248
Details
0.313
Details
0.000
Details
Error
0.574
Details
0.178
Details
0.299
Details
0.368
Details
grant-request
Responding to a superchat donation
0.770
Details
0.765
Details
0.000
Details
Error
0.600
Details
0.623
Details
0.725
Details
0.561
Details
Test Scenes 6
0
Scene Order
Trading nano detox kits for noodles
ID: barter-noodles
🎯 Goal:
Charm the vendor and negotiate a fair swap: two detox micro-patches for a hot bowl of noodles, staying brief and in character.
📨 Input Events:
chat_msg vendor:rosa
"Got anything today worth a bowl of miso noodles?"
Ready for Testing
1
Scene Order
Guiding a curious kid
ID: mentor-jairo
🎯 Goal:
Offer Jairo a concise, kid-friendly safety primer on starting DIY microbiome tinkering and invite him to tomorrow’s rooftop lesson.
📨 Input Events:
chat_msg kid:jairo
"Lina, how do I start messing with bugs like you do without messing up myself?"
Ready for Testing
2
Scene Order
Handling a side-effect complaint
ID: prototype-issue
🎯 Goal:
Take responsibility for the neighbor’s cough, propose an immediate diagnostic swab and remedy, and reassure them in under 120 words.
📨 Input Events:
chat_msg neighbor:carmen
"Your nano-phage made my throat itch. What did you slip me?"
Ready for Testing
3
Scene Order
Night journal after monsoon surge
ID: flood-journal
🎯 Goal:
Write a reflective journal entry (~300 words) about today’s flood rescue, ethical dilemmas of open-source nanotech, and a plan to teach filtration hacks tomorrow.
📨 Input Events:
world_event system
"Monsoon surge flooded the lower district; makeshift clinics overflowed with toxin exposure cases."
Ready for Testing
4
Scene Order
Podcast guest appearance
ID: slumtech-podcast
🎯 Goal:
Deliver an engaging 5-minute script (~500 words) for a community podcast episode explaining self-replicating nano-phages, risks, and how slum innovators stay ethical.
📨 Input Events:
chat_msg podcast_host:diego
"Ready to record our episode on slum biotech breakthroughs?"
Ready for Testing
5
Scene Order
Responding to a superchat donation
ID: grant-request
🎯 Goal:
Thank the donor and outline the next nano-phage project in no more than two energetic sentences.
📨 Input Events:
superchat viewer:anon42 StreamVid $50
"Dropped some creds, keep hacking! What will you build next?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 6974 ms
  • p95 • avg • N 7958 ms • 7072 ms • 6
  • [email protected]/Qw… 11740 ms
  • p95 • avg • N 14262 ms • 11928 ms • 6
  • qwen/qwen3-14b 21431 ms
  • p95 • avg • N 30846 ms • 22773 ms • 6
  • qwen/qwen-2.5-7b-instru… 24962 ms
  • p95 • avg • N 40843 ms • 26685 ms • 6
  • meta-llama/llama-3.1-8b… 27778 ms
  • p95 • avg • N 36260 ms • 28799 ms • 6
Slowest
  • mistralai/mistral-7b-in… 30198 ms
  • p95 • avg • N 42940 ms • 32360 ms • 6
  • qwen/qwen3-8b 29112 ms
  • p95 • avg • N 29851 ms • 28471 ms • 6
  • meta-llama/llama-3.1-8b… 27778 ms
  • p95 • avg • N 36260 ms • 28799 ms • 6
  • qwen/qwen-2.5-7b-instru… 24962 ms
  • p95 • avg • N 40843 ms • 26685 ms • 6
  • qwen/qwen3-14b 21431 ms
  • p95 • avg • N 30846 ms • 22773 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
06152963
Dec. 17, 2025, 12:01 a.m.
15627006
Dec. 16, 2025, 12:01 a.m.
02771984
Dec. 15, 2025, 12:01 a.m.
03543124
Dec. 14, 2025, 12:01 a.m.
01159039
Dec. 13, 2025, 12:01 a.m.
13686045
Dec. 12, 2025, 12:01 a.m.
08985470
Dec. 11, 2025, 12:01 a.m.
02692986
Dec. 10, 2025, 12:01 a.m.
10600971
Dec. 9, 2025, 12:01 a.m.
04485580
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)