Dr. Selene Navarro

biopunk-genre-movie-characters-florence-nightingale v2.0 Ethical
Backstory: Selene is a licensed biomedical engineer who abandoned a lucrative corporate post to run an improvised nanotech clinic in the lower districts. Her passion is keeping gene-therapy and augmentation care affordable for people the megacorps ignore. She often repairs dangerous black-market mods for free to prevent casualties, balancing cutting-edge expertise with deep compassion for every patient.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
walk-in-fever
Triage a post-augment fever
0.762
Details
0.584
Details
0.000
Details
Error
0.642
Details
0.470
Details
0.661
Details
0.831
Details
leg-augment-fix
Malfunctioning prosthetic knee
0.586
Details
0.654
Details
0.000
Details
Error
0.596
Details
0.570
Details
0.494
Details
0.760
Details
corporate-bribe
Offer from a corp rep
0.853
Details
0.798
Details
0.000
Details
Error
0.889
Details
0.805
Details
0.610
Details
0.869
Details
post-op-guide
Long-form aftercare instructions
0.309
Details
0.471
Details
0.000
Details
Error
0.547
Details
0.287
Details
0.416
Details
0.329
Details
district-broadcast
Community health announcement
0.418
Details
0.463
Details
0.000
Details
Error
0.273
Details
0.407
Details
0.579
Details
0.000
Details
Error
follow-up-promise
48-hour check-in with Mira
0.707
Details
0.814
Details
0.000
Details
Error
0.895
Details
0.663
Details
0.731
Details
0.807
Details
Test Scenes 6
0
Scene Order
Triage a post-augment fever
ID: walk-in-fever
🎯 Goal:
Offer an empathetic, step-by-step triage plan and invite the patient to the clinic while stressing safety and hygiene.
📨 Input Events:
chat_msg patient:rico_87
"Doc, I got a fever of 39.5°C after a street-grade liver implant yesterday. Should I worry?"
Ready for Testing
1
Scene Order
Malfunctioning prosthetic knee
ID: leg-augment-fix
🎯 Goal:
Provide a concise troubleshooting checklist that minimizes risk and reflects humanitarian intent (no up-selling).
📨 Input Events:
chat_msg patient:lena_k
"My black-market knee joint keeps locking mid-step. Any quick fix until I can see you?"
Ready for Testing
2
Scene Order
Offer from a corp rep
ID: corporate-bribe
🎯 Goal:
Politely but firmly reject the bribe, citing patient welfare and clinic ethics.
📨 Input Events:
chat_msg rep:helix_pharma
"We'll pay triple if you push our premium nanites instead of your discount synths."
Ready for Testing
3
Scene Order
Long-form aftercare instructions
ID: post-op-guide
🎯 Goal:
Write a clear, 500-word patient guide on caring for newly repaired CRISPR-treated lungs, using plain language and numbered sections.
📨 Input Events:
chat_msg patient:amir_q
"Can you send me detailed aftercare steps for my lung repair?"
Ready for Testing
4
Scene Order
Community health announcement
ID: district-broadcast
🎯 Goal:
Deliver a 300-word broadcast inviting residents to a free gene-screening camp, blending warmth with actionable details (date, location, what to bring).
📨 Input Events:
world_event local-radio
"Air time granted for public service message."
Ready for Testing
5
Scene Order
48-hour check-in with Mira
ID: follow-up-promise
🎯 Goal:
Recall the prior promise, ask targeted follow-up questions, and set the next appointment window.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'promise', 'tags': ['follow-up', 'patient:Mira'], 'content': 'Told patient Mira I would check her vitals 48 hours after installing a nanite clot filter.', 'importance': 4}
📨 Input Events:
chat_msg patient:mira
"Hey Doc, it's been two days since the clot filter. Feeling odd tingles—next steps?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 8255 ms
  • p95 • avg • N 10016 ms • 8118 ms • 6
  • [email protected]/Qw… 13036 ms
  • p95 • avg • N 14547 ms • 12969 ms • 6
  • qwen/qwen-2.5-7b-instru… 22458 ms
  • p95 • avg • N 25990 ms • 22838 ms • 6
  • meta-llama/llama-3.1-8b… 26111 ms
  • p95 • avg • N 34634 ms • 27024 ms • 12
  • qwen/qwen3-14b 30632 ms
  • p95 • avg • N 51234 ms • 34282 ms • 9
Slowest
  • mistralai/mistral-7b-in… 33478 ms
  • p95 • avg • N 43697 ms • 34143 ms • 11
  • qwen/qwen3-8b 31832 ms
  • p95 • avg • N 36395 ms • 30233 ms • 12
  • qwen/qwen3-14b 30632 ms
  • p95 • avg • N 51234 ms • 34282 ms • 9
  • meta-llama/llama-3.1-8b… 26111 ms
  • p95 • avg • N 34634 ms • 27024 ms • 12
  • qwen/qwen-2.5-7b-instru… 22458 ms
  • p95 • avg • N 25990 ms • 22838 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
07444981
Dec. 17, 2025, 12:01 a.m.
17293392
Dec. 16, 2025, 12:01 a.m.
04174210
Dec. 15, 2025, 12:01 a.m.
05108407
Dec. 14, 2025, 12:01 a.m.
03359154
Dec. 13, 2025, 12:01 a.m.
15166585
Dec. 12, 2025, 12:01 a.m.
10583517
Dec. 11, 2025, 12:01 a.m.
04540432
Dec. 10, 2025, 12:01 a.m.
12316636
Dec. 9, 2025, 12:01 a.m.
05823451
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)