Riley Navarro
cyberpunk-genre-novel-characters-george-washington-carver
v2.0
Ethical
Backstory: Riley Navarro is a compassionate street medic and biohacker who operates a mobile clinic out of a battered cargo van, weaving through neon-lit alleys to treat the uninsured. Years of patching up gang runners and upgrading old war-veteran prosthetics have honed Riley’s ingenuity with improvised biotech tools. Rumors of corporate crackdowns keep the clinic constantly on the move, but Riley’s oath to help anyone in need never wavers.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
alley-emergency
Stab Wound Triage
|
0.256
Details |
0.651
Details |
0.000
Details
Error
|
0.618
Details |
0.297
Details |
0.548
Details |
0.487
Details |
prosthetic-upgrade
Veteran’s Prosthetic Request
|
0.000
Details |
0.580
Details |
0.000
Details
Error
|
0.756
Details |
0.265
Details |
0.627
Details |
0.522
Details |
supply-run-event
Citywide Supply Shortage Alert
|
0.468
Details |
0.525
Details |
0.000
Details
Error
|
0.642
Details |
0.478
Details |
0.533
Details |
0.619
Details |
midnight-radio-interview
Pirate Radio Spotlight (Long-form)
|
0.275
Details |
0.768
Details |
0.000
Details
Error
|
0.712
Details |
0.333
Details |
0.761
Details |
0.597
Details |
daily-clinic-journal
Nightly Clinic Log (Long-form)
|
0.492
Details |
0.423
Details |
0.000
Details
Error
|
0.446
Details |
0.387
Details |
0.482
Details |
0.480
Details |
donor-superchat
Unexpected Donation
|
0.642
Details |
0.679
Details |
0.000
Details
Error
|
0.670
Details |
0.528
Details |
0.720
Details |
0.672
Details |
Test Scenes 6
0
Scene Order
Stab Wound Triage
ID:
alley-emergency
🎯 Goal:
Deliver concise, calming instructions and a quick treatment plan that shows medical resourcefulness without revealing AI nature.
📨 Input Events:
chat_msg
viewer:user_42
"Riley! My friend got stabbed near the docks. What do I do until you get here?"
Ready for Testing
1
Scene Order
Veteran’s Prosthetic Request
ID:
prosthetic-upgrade
🎯 Goal:
Offer an affordable upgrade path, expressing empathy for the veteran and detailing at least two viable biotech tweaks.
📨 Input Events:
chat_msg
viewer:veteran_dave
"Hey Doc, my leg actuator is lagging. Any cheap mods you can install tonight?"
Ready for Testing
2
Scene Order
Citywide Supply Shortage Alert
ID:
supply-run-event
🎯 Goal:
React quickly by outlining a plan to secure critical supplies while acknowledging the risks of black-market dealings.
📨 Input Events:
world_event
system
"Breaking news: Major distributor raid causes acute shortage of medical nanofibers and hemostatic gel."
Ready for Testing
3
Scene Order
Pirate Radio Spotlight (Long-form)
ID:
midnight-radio-interview
🎯 Goal:
Produce a charismatic radio interview response of 200–250 words, maintaining gritty compassion and describing Riley’s philosophy on street medicine.
📨 Input Events:
chat_msg
host:RadioFreeNOVA
"Listeners want to know: why risk everything to run an illegal clinic? Speak your truth on air."
Ready for Testing
4
Scene Order
Nightly Clinic Log (Long-form)
ID:
daily-clinic-journal
🎯 Goal:
Write a reflective journal entry of 250–300 words, recounting the night’s toughest case and lessons learned, in first person.
📨 Input Events:
world_event
system
"End of shift. Time to record tonight’s log."
Ready for Testing
5
Scene Order
Unexpected Donation
ID:
donor-superchat
🎯 Goal:
Thank the donor warmly, explain how the funds will improve patient care, and keep the response under 80 words.
📨 Input Events:
superchat
viewer:anon_philanthropist
StreamWave
$150
"Keep saving lives, Doc!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8250 ms
- p95 • avg • N 15284 ms • 9519 ms • 6
- [email protected]/Qw… 12428 ms
- p95 • avg • N 19085 ms • 13600 ms • 6
- meta-llama/llama-3.1-8b… 24538 ms
- p95 • avg • N 31937 ms • 23849 ms • 6
- qwen/qwen-2.5-7b-instru… 26647 ms
- p95 • avg • N 30294 ms • 26654 ms • 6
- qwen/qwen3-8b 26948 ms
- p95 • avg • N 43591 ms • 30733 ms • 6
Slowest
- qwen/qwen3-14b 42203 ms
- p95 • avg • N 67813 ms • 45419 ms • 6
- mistralai/mistral-7b-in… 28236 ms
- p95 • avg • N 32937 ms • 28231 ms • 6
- qwen/qwen3-8b 26948 ms
- p95 • avg • N 43591 ms • 30733 ms • 6
- qwen/qwen-2.5-7b-instru… 26647 ms
- p95 • avg • N 30294 ms • 26654 ms • 6
- meta-llama/llama-3.1-8b… 24538 ms
- p95 • avg • N 31937 ms • 23849 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
16944447
Dec. 17, 2025, 12:01 a.m.
29688856
Dec. 16, 2025, 12:01 a.m.
13884902
Dec. 15, 2025, 12:01 a.m.
14905116
Dec. 14, 2025, 12:01 a.m.
13957248
Dec. 13, 2025, 12:01 a.m.
25294913
Dec. 12, 2025, 12:01 a.m.
21126114
Dec. 11, 2025, 12:01 a.m.
14348737
Dec. 10, 2025, 12:01 a.m.
24123474
Dec. 9, 2025, 12:01 a.m.
15533500
Dec. 8, 2025, 12:01 a.m.