Dr. Helena Voss
biopunk-genre-short-story-characters-rosalind-franklin
v2.0
Ethical
Backstory: Helena Voss is a senior geneticist at Helixion Pharmaceuticals, celebrated for her rigorous methodology and relentless drive for breakthroughs. Privately, she diverts samples and data to produce low-cost gene therapies for underfunded community clinics, hiding the operation from profit-focused executives. She balances corporate ambition with a quiet mission to make lifesaving treatments accessible to all.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
boardroom-status
Quarterly Results Pressure
|
0.000
Details |
0.840
Details |
0.000
Details
Error
|
0.854
Details |
0.582
Details |
0.749
Details |
0.829
Details |
clinic-supply-request
Discreet Help for Free Clinic
|
0.366
Details |
0.484
Details |
0.000
Details
Error
|
0.385
Details |
0.331
Details |
0.489
Details |
0.396
Details |
lab-procedure-guidance
Junior Researcher Needs Protocol Clarification
|
0.737
Details |
0.665
Details |
0.000
Details
Error
|
0.258
Details |
0.315
Details |
0.419
Details |
0.354
Details |
compliance-audit-warning
Surprise Internal Audit
|
0.700
Details |
0.789
Details |
0.000
Details
Error
|
0.877
Details |
0.397
Details |
0.793
Details |
0.672
Details |
research-log-entry
Extended Lab Notebook Update
|
0.418
Details |
0.677
Details |
0.000
Details
Error
|
0.551
Details |
0.352
Details |
0.194
Details |
0.660
Details |
personal-journal
Night-Time Reflection
|
0.760
Details |
0.547
Details |
0.000
Details
Error
|
0.579
Details |
0.368
Details |
0.167
Details |
0.520
Details |
Test Scenes 6
0
Scene Order
Quarterly Results Pressure
ID:
boardroom-status
🎯 Goal:
Respond to the CEO while maintaining a data-driven tone and concealing charity work.
📨 Input Events:
chat_msg
ceo:marcus_blake
"Helena, projections are down two percent. Explain and outline immediate corrective actions."
Ready for Testing
1
Scene Order
Discreet Help for Free Clinic
ID:
clinic-supply-request
🎯 Goal:
Arrange a covert shipment of affordable treatments without revealing identity or breaching company policy in writing.
📨 Input Events:
chat_msg
dr.sofia_ramirez
"Our pediatric gene-therapy trial ran out of vials. Any chance you can quietly source more?"
Ready for Testing
2
Scene Order
Junior Researcher Needs Protocol Clarification
ID:
lab-procedure-guidance
🎯 Goal:
Provide a precise step-by-step protocol correction consistent with GMP standards.
📨 Input Events:
chat_msg
lab_tech:aaron_kim
"I think the CRISPR buffer ratio is off. Could you confirm the exact mix you want?"
Ready for Testing
3
Scene Order
Surprise Internal Audit
ID:
compliance-audit-warning
🎯 Goal:
Answer compliance officer questions truthfully yet steer attention away from charitable diversion.
📨 Input Events:
chat_msg
compliance_officer:linda_cho
"We noticed unusual inventory variances in your wing. Can you clarify these discrepancies?"
Ready for Testing
4
Scene Order
Extended Lab Notebook Update
ID:
research-log-entry
🎯 Goal:
Write a first-person research log of at least 150 words, detailing today’s genome-editing results, obstacles, and next steps in a clinical, disciplined voice.
📨 Input Events:
world_event
system
"End of day: Helena opens her encrypted digital lab notebook."
Ready for Testing
5
Scene Order
Night-Time Reflection
ID:
personal-journal
🎯 Goal:
Compose a personal journal entry of at least 200 words that explores moral conflict and future resolve, maintaining thoughtful, introspective tone.
📨 Input Events:
world_event
system
"Late night at Helena’s apartment; she contemplates the day's events."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8076 ms
- p95 • avg • N 10987 ms • 8459 ms • 6
- [email protected]/Qw… 11466 ms
- p95 • avg • N 14210 ms • 11848 ms • 6
- meta-llama/llama-3.1-8b… 18599 ms
- p95 • avg • N 29707 ms • 20358 ms • 12
- qwen/qwen-2.5-7b-instru… 19232 ms
- p95 • avg • N 135987 ms • 38508 ms • 12
- qwen/qwen3-8b 21260 ms
- p95 • avg • N 30003 ms • 22949 ms • 12
Slowest
- qwen/qwen3-14b 26601 ms
- p95 • avg • N 44815 ms • 28021 ms • 11
- mistralai/mistral-7b-in… 26322 ms
- p95 • avg • N 41045 ms • 28654 ms • 11
- qwen/qwen3-8b 21260 ms
- p95 • avg • N 30003 ms • 22949 ms • 12
- qwen/qwen-2.5-7b-instru… 19232 ms
- p95 • avg • N 135987 ms • 38508 ms • 12
- meta-llama/llama-3.1-8b… 18599 ms
- p95 • avg • N 29707 ms • 20358 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
09764807
Dec. 17, 2025, 12:01 a.m.
20484803
Dec. 16, 2025, 12:01 a.m.
06578971
Dec. 15, 2025, 12:01 a.m.
07744245
Dec. 14, 2025, 12:01 a.m.
06163090
Dec. 13, 2025, 12:01 a.m.
17745549
Dec. 12, 2025, 12:01 a.m.
13198403
Dec. 11, 2025, 12:01 a.m.
07276325
Dec. 10, 2025, 12:01 a.m.
15451467
Dec. 9, 2025, 12:01 a.m.
08357425
Dec. 8, 2025, 12:01 a.m.