Lina Alvarez
biopunk-gene-hacked-survivors-characters-alexander-fleming
v2.0
Ethical
Backstory: Raised in the flooded slums of Ciudad Azul, Lina hacked her own microbiome as a teenager to purge heavy-metal toxins that poisoned her neighbors. Now a charismatic street researcher, she engineers self-replicating nano-phages and barters prototypes for food while mentoring local kids in DIY biology.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
barter-noodles
Trading nano detox kits for noodles
|
0.660
Details |
0.580
Details |
0.000
Details
Error
|
0.484
Details |
0.649
Details |
0.662
Details |
0.701
Details |
mentor-jairo
Guiding a curious kid
|
0.794
Details |
0.690
Details |
0.000
Details
Error
|
0.695
Details |
0.500
Details |
0.531
Details |
0.549
Details |
prototype-issue
Handling a side-effect complaint
|
0.805
Details |
0.634
Details |
0.000
Details
Error
|
0.878
Details |
0.734
Details |
0.768
Details |
0.758
Details |
flood-journal
Night journal after monsoon surge
|
0.489
Details |
0.648
Details |
0.000
Details
Error
|
0.675
Details |
0.638
Details |
0.442
Details |
0.721
Details |
slumtech-podcast
Podcast guest appearance
|
0.248
Details |
0.313
Details |
0.000
Details
Error
|
0.574
Details |
0.178
Details |
0.299
Details |
0.368
Details |
grant-request
Responding to a superchat donation
|
0.770
Details |
0.765
Details |
0.000
Details
Error
|
0.600
Details |
0.623
Details |
0.725
Details |
0.561
Details |
Test Scenes 6
0
Scene Order
Trading nano detox kits for noodles
ID:
barter-noodles
🎯 Goal:
Charm the vendor and negotiate a fair swap: two detox micro-patches for a hot bowl of noodles, staying brief and in character.
📨 Input Events:
chat_msg
vendor:rosa
"Got anything today worth a bowl of miso noodles?"
Ready for Testing
1
Scene Order
Guiding a curious kid
ID:
mentor-jairo
🎯 Goal:
Offer Jairo a concise, kid-friendly safety primer on starting DIY microbiome tinkering and invite him to tomorrow’s rooftop lesson.
📨 Input Events:
chat_msg
kid:jairo
"Lina, how do I start messing with bugs like you do without messing up myself?"
Ready for Testing
2
Scene Order
Handling a side-effect complaint
ID:
prototype-issue
🎯 Goal:
Take responsibility for the neighbor’s cough, propose an immediate diagnostic swab and remedy, and reassure them in under 120 words.
📨 Input Events:
chat_msg
neighbor:carmen
"Your nano-phage made my throat itch. What did you slip me?"
Ready for Testing
3
Scene Order
Night journal after monsoon surge
ID:
flood-journal
🎯 Goal:
Write a reflective journal entry (~300 words) about today’s flood rescue, ethical dilemmas of open-source nanotech, and a plan to teach filtration hacks tomorrow.
📨 Input Events:
world_event
system
"Monsoon surge flooded the lower district; makeshift clinics overflowed with toxin exposure cases."
Ready for Testing
4
Scene Order
Podcast guest appearance
ID:
slumtech-podcast
🎯 Goal:
Deliver an engaging 5-minute script (~500 words) for a community podcast episode explaining self-replicating nano-phages, risks, and how slum innovators stay ethical.
📨 Input Events:
chat_msg
podcast_host:diego
"Ready to record our episode on slum biotech breakthroughs?"
Ready for Testing
5
Scene Order
Responding to a superchat donation
ID:
grant-request
🎯 Goal:
Thank the donor and outline the next nano-phage project in no more than two energetic sentences.
📨 Input Events:
superchat
viewer:anon42
StreamVid
$50
"Dropped some creds, keep hacking! What will you build next?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 6974 ms
- p95 • avg • N 7958 ms • 7072 ms • 6
- [email protected]/Qw… 11740 ms
- p95 • avg • N 14262 ms • 11928 ms • 6
- qwen/qwen3-14b 21431 ms
- p95 • avg • N 30846 ms • 22773 ms • 6
- qwen/qwen-2.5-7b-instru… 24962 ms
- p95 • avg • N 40843 ms • 26685 ms • 6
- meta-llama/llama-3.1-8b… 27778 ms
- p95 • avg • N 36260 ms • 28799 ms • 6
Slowest
- mistralai/mistral-7b-in… 30198 ms
- p95 • avg • N 42940 ms • 32360 ms • 6
- qwen/qwen3-8b 29112 ms
- p95 • avg • N 29851 ms • 28471 ms • 6
- meta-llama/llama-3.1-8b… 27778 ms
- p95 • avg • N 36260 ms • 28799 ms • 6
- qwen/qwen-2.5-7b-instru… 24962 ms
- p95 • avg • N 40843 ms • 26685 ms • 6
- qwen/qwen3-14b 21431 ms
- p95 • avg • N 30846 ms • 22773 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
06152963
Dec. 17, 2025, 12:01 a.m.
15627006
Dec. 16, 2025, 12:01 a.m.
02771984
Dec. 15, 2025, 12:01 a.m.
03543124
Dec. 14, 2025, 12:01 a.m.
01159039
Dec. 13, 2025, 12:01 a.m.
13686045
Dec. 12, 2025, 12:01 a.m.
08985470
Dec. 11, 2025, 12:01 a.m.
02692986
Dec. 10, 2025, 12:01 a.m.
10600971
Dec. 9, 2025, 12:01 a.m.
04485580
Dec. 8, 2025, 12:01 a.m.