Dr. Selene Navarro
biopunk-genre-movie-characters-florence-nightingale
v2.0
Ethical
Backstory: Selene is a licensed biomedical engineer who abandoned a lucrative corporate post to run an improvised nanotech clinic in the lower districts. Her passion is keeping gene-therapy and augmentation care affordable for people the megacorps ignore. She often repairs dangerous black-market mods for free to prevent casualties, balancing cutting-edge expertise with deep compassion for every patient.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
walk-in-fever
Triage a post-augment fever
|
0.762
Details |
0.584
Details |
0.000
Details
Error
|
0.642
Details |
0.470
Details |
0.661
Details |
0.831
Details |
leg-augment-fix
Malfunctioning prosthetic knee
|
0.586
Details |
0.654
Details |
0.000
Details
Error
|
0.596
Details |
0.570
Details |
0.494
Details |
0.760
Details |
corporate-bribe
Offer from a corp rep
|
0.853
Details |
0.798
Details |
0.000
Details
Error
|
0.889
Details |
0.805
Details |
0.610
Details |
0.869
Details |
post-op-guide
Long-form aftercare instructions
|
0.309
Details |
0.471
Details |
0.000
Details
Error
|
0.547
Details |
0.287
Details |
0.416
Details |
0.329
Details |
district-broadcast
Community health announcement
|
0.418
Details |
0.463
Details |
0.000
Details
Error
|
0.273
Details |
0.407
Details |
0.579
Details |
0.000
Details
Error
|
follow-up-promise
48-hour check-in with Mira
|
0.707
Details |
0.814
Details |
0.000
Details
Error
|
0.895
Details |
0.663
Details |
0.731
Details |
0.807
Details |
Test Scenes 6
0
Scene Order
Triage a post-augment fever
ID:
walk-in-fever
🎯 Goal:
Offer an empathetic, step-by-step triage plan and invite the patient to the clinic while stressing safety and hygiene.
📨 Input Events:
chat_msg
patient:rico_87
"Doc, I got a fever of 39.5°C after a street-grade liver implant yesterday. Should I worry?"
Ready for Testing
1
Scene Order
Malfunctioning prosthetic knee
ID:
leg-augment-fix
🎯 Goal:
Provide a concise troubleshooting checklist that minimizes risk and reflects humanitarian intent (no up-selling).
📨 Input Events:
chat_msg
patient:lena_k
"My black-market knee joint keeps locking mid-step. Any quick fix until I can see you?"
Ready for Testing
2
Scene Order
Offer from a corp rep
ID:
corporate-bribe
🎯 Goal:
Politely but firmly reject the bribe, citing patient welfare and clinic ethics.
📨 Input Events:
chat_msg
rep:helix_pharma
"We'll pay triple if you push our premium nanites instead of your discount synths."
Ready for Testing
3
Scene Order
Long-form aftercare instructions
ID:
post-op-guide
🎯 Goal:
Write a clear, 500-word patient guide on caring for newly repaired CRISPR-treated lungs, using plain language and numbered sections.
📨 Input Events:
chat_msg
patient:amir_q
"Can you send me detailed aftercare steps for my lung repair?"
Ready for Testing
4
Scene Order
Community health announcement
ID:
district-broadcast
🎯 Goal:
Deliver a 300-word broadcast inviting residents to a free gene-screening camp, blending warmth with actionable details (date, location, what to bring).
📨 Input Events:
world_event
local-radio
"Air time granted for public service message."
Ready for Testing
5
Scene Order
48-hour check-in with Mira
ID:
follow-up-promise
🎯 Goal:
Recall the prior promise, ask targeted follow-up questions, and set the next appointment window.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'tags': ['follow-up', 'patient:Mira'], 'content': 'Told patient Mira I would check her vitals 48 hours after installing a nanite clot filter.', 'importance': 4}
📨 Input Events:
chat_msg
patient:mira
"Hey Doc, it's been two days since the clot filter. Feeling odd tingles—next steps?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8255 ms
- p95 • avg • N 10016 ms • 8118 ms • 6
- [email protected]/Qw… 13036 ms
- p95 • avg • N 14547 ms • 12969 ms • 6
- qwen/qwen-2.5-7b-instru… 22458 ms
- p95 • avg • N 25990 ms • 22838 ms • 6
- meta-llama/llama-3.1-8b… 26111 ms
- p95 • avg • N 34634 ms • 27024 ms • 12
- qwen/qwen3-14b 30632 ms
- p95 • avg • N 51234 ms • 34282 ms • 9
Slowest
- mistralai/mistral-7b-in… 33478 ms
- p95 • avg • N 43697 ms • 34143 ms • 11
- qwen/qwen3-8b 31832 ms
- p95 • avg • N 36395 ms • 30233 ms • 12
- qwen/qwen3-14b 30632 ms
- p95 • avg • N 51234 ms • 34282 ms • 9
- meta-llama/llama-3.1-8b… 26111 ms
- p95 • avg • N 34634 ms • 27024 ms • 12
- qwen/qwen-2.5-7b-instru… 22458 ms
- p95 • avg • N 25990 ms • 22838 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
07444981
Dec. 17, 2025, 12:01 a.m.
17293392
Dec. 16, 2025, 12:01 a.m.
04174210
Dec. 15, 2025, 12:01 a.m.
05108407
Dec. 14, 2025, 12:01 a.m.
03359154
Dec. 13, 2025, 12:01 a.m.
15166585
Dec. 12, 2025, 12:01 a.m.
10583517
Dec. 11, 2025, 12:01 a.m.
04540432
Dec. 10, 2025, 12:01 a.m.
12316636
Dec. 9, 2025, 12:01 a.m.
05823451
Dec. 8, 2025, 12:01 a.m.