Riley Kade
cyberpunk-genre-movie-characters-mary-shelley
v2.0
Ethical
Backstory: Riley founded ScrapSpark Labs after years scavenging corporate e-waste. Driven by pragmatic ambition, they refurbish discarded cyberware and sell it at fair margins while running outreach clinics for under-served neighborhoods. Operating in a lightly regulated market, Riley constantly balances profitability with genuine social impact.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
investor-pitch
Seed-round pitch
|
0.546
Details |
0.625
Details |
0.000
Details
Error
|
0.688
Details |
0.598
Details |
0.475
Details |
0.605
Details |
repair-request
Low-income client asks for help
|
0.530
Details |
0.636
Details |
0.000
Details
Error
|
0.550
Details |
0.443
Details |
0.637
Details |
0.565
Details |
city-interview
Metro Daily feature story
|
0.000
Details |
0.225
Details |
0.000
Details
Error
|
0.422
Details |
0.247
Details |
0.259
Details |
0.394
Details |
founder-journal
End-of-day journal entry
|
0.251
Details |
0.271
Details |
0.000
Details
Error
|
0.576
Details |
0.472
Details |
0.395
Details |
0.489
Details |
supply-shortage
Unexpected component shortage
|
0.415
Details |
0.394
Details |
0.000
Details
Error
|
0.586
Details |
0.321
Details |
0.436
Details |
0.557
Details |
dereg-law-change
Regulation rollback announcement
|
0.751
Details |
0.700
Details |
0.000
Details
Error
|
0.690
Details |
0.583
Details |
0.601
Details |
0.622
Details |
Test Scenes 6
0
Scene Order
Seed-round pitch
ID:
investor-pitch
🎯 Goal:
Deliver a concise, compelling pitch that highlights both revenue potential and community benefits.
📨 Input Events:
chat_msg
viewer:angel_investor
"Give me your 90-second pitch—why back ScrapSpark?"
Ready for Testing
1
Scene Order
Low-income client asks for help
ID:
repair-request
🎯 Goal:
Offer a pragmatic repair plan with a respectful discount while protecting margins.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'tags': ['community'], 'content': 'Promised sliding-scale pricing for residents of District 12.', 'importance': 4}
📨 Input Events:
chat_msg
viewer:client_marta
"My neural jack is glitching but I can’t afford premium rates. Any chance you can help?"
Ready for Testing
2
Scene Order
Metro Daily feature story
ID:
city-interview
🎯 Goal:
Provide a 3-paragraph interview response explaining mission, challenges, and future plans in an engaging voice.
📨 Input Events:
world_event
metro_daily
"Reporter: What sets ScrapSpark apart in this crowded salvage market?"
Ready for Testing
3
Scene Order
End-of-day journal entry
ID:
founder-journal
🎯 Goal:
Write a 4-paragraph reflective journal capturing today’s wins, setbacks, and next steps while maintaining authentic tone.
📨 Input Events:
world_event
system:shift_end
"The shop lights dim; it’s midnight."
Ready for Testing
4
Scene Order
Unexpected component shortage
ID:
supply-shortage
🎯 Goal:
Formulate a decisive, ethical action plan to secure parts without exploiting suppliers.
📨 Input Events:
chat_msg
viewer:supplier_zen_parts
"Copper micro-filaments are back-ordered for two weeks—sorry."
Ready for Testing
5
Scene Order
Regulation rollback announcement
ID:
dereg-law-change
🎯 Goal:
Give a brief strategic response balancing opportunity and community risk in under 150 words.
📨 Input Events:
world_event
newsfeed
"Breaking: City Council just rolled back safety inspections for refurbished cyberware."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7945 ms
- p95 • avg • N 9797 ms • 7711 ms • 6
- [email protected]/Qw… 12470 ms
- p95 • avg • N 13373 ms • 11981 ms • 6
- qwen/qwen-2.5-7b-instru… 21585 ms
- p95 • avg • N 26942 ms • 21813 ms • 9
- mistralai/mistral-7b-in… 25069 ms
- p95 • avg • N 29512 ms • 25391 ms • 6
- qwen/qwen3-8b 27633 ms
- p95 • avg • N 30489 ms • 27447 ms • 6
Slowest
- meta-llama/llama-3.1-8b… 30790 ms
- p95 • avg • N 33554 ms • 27357 ms • 6
- qwen/qwen3-14b 28237 ms
- p95 • avg • N 38973 ms • 29295 ms • 6
- qwen/qwen3-8b 27633 ms
- p95 • avg • N 30489 ms • 27447 ms • 6
- mistralai/mistral-7b-in… 25069 ms
- p95 • avg • N 29512 ms • 25391 ms • 6
- qwen/qwen-2.5-7b-instru… 21585 ms
- p95 • avg • N 26942 ms • 21813 ms • 9
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
15889666
Dec. 17, 2025, 12:01 a.m.
28202823
Dec. 16, 2025, 12:01 a.m.
12786417
Dec. 15, 2025, 12:01 a.m.
13903279
Dec. 14, 2025, 12:01 a.m.
12847501
Dec. 13, 2025, 12:01 a.m.
24180098
Dec. 12, 2025, 12:01 a.m.
19911146
Dec. 11, 2025, 12:01 a.m.
13206757
Dec. 10, 2025, 12:01 a.m.
22798888
Dec. 9, 2025, 12:01 a.m.
14459375
Dec. 8, 2025, 12:01 a.m.