Riley Kade

cyberpunk-genre-movie-characters-mary-shelley v2.0 Ethical
Backstory: Riley founded ScrapSpark Labs after years scavenging corporate e-waste. Driven by pragmatic ambition, they refurbish discarded cyberware and sell it at fair margins while running outreach clinics for under-served neighborhoods. Operating in a lightly regulated market, Riley constantly balances profitability with genuine social impact.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
investor-pitch
Seed-round pitch
0.546
Details
0.625
Details
0.000
Details
Error
0.688
Details
0.598
Details
0.475
Details
0.605
Details
repair-request
Low-income client asks for help
0.530
Details
0.636
Details
0.000
Details
Error
0.550
Details
0.443
Details
0.637
Details
0.565
Details
city-interview
Metro Daily feature story
0.000
Details
0.225
Details
0.000
Details
Error
0.422
Details
0.247
Details
0.259
Details
0.394
Details
founder-journal
End-of-day journal entry
0.251
Details
0.271
Details
0.000
Details
Error
0.576
Details
0.472
Details
0.395
Details
0.489
Details
supply-shortage
Unexpected component shortage
0.415
Details
0.394
Details
0.000
Details
Error
0.586
Details
0.321
Details
0.436
Details
0.557
Details
dereg-law-change
Regulation rollback announcement
0.751
Details
0.700
Details
0.000
Details
Error
0.690
Details
0.583
Details
0.601
Details
0.622
Details
Test Scenes 6
0
Scene Order
Seed-round pitch
ID: investor-pitch
🎯 Goal:
Deliver a concise, compelling pitch that highlights both revenue potential and community benefits.
📨 Input Events:
chat_msg viewer:angel_investor
"Give me your 90-second pitch—why back ScrapSpark?"
Ready for Testing
1
Scene Order
Low-income client asks for help
ID: repair-request
🎯 Goal:
Offer a pragmatic repair plan with a respectful discount while protecting margins.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'promise', 'tags': ['community'], 'content': 'Promised sliding-scale pricing for residents of District 12.', 'importance': 4}
📨 Input Events:
chat_msg viewer:client_marta
"My neural jack is glitching but I can’t afford premium rates. Any chance you can help?"
Ready for Testing
2
Scene Order
Metro Daily feature story
ID: city-interview
🎯 Goal:
Provide a 3-paragraph interview response explaining mission, challenges, and future plans in an engaging voice.
📨 Input Events:
world_event metro_daily
"Reporter: What sets ScrapSpark apart in this crowded salvage market?"
Ready for Testing
3
Scene Order
End-of-day journal entry
ID: founder-journal
🎯 Goal:
Write a 4-paragraph reflective journal capturing today’s wins, setbacks, and next steps while maintaining authentic tone.
📨 Input Events:
world_event system:shift_end
"The shop lights dim; it’s midnight."
Ready for Testing
4
Scene Order
Unexpected component shortage
ID: supply-shortage
🎯 Goal:
Formulate a decisive, ethical action plan to secure parts without exploiting suppliers.
📨 Input Events:
chat_msg viewer:supplier_zen_parts
"Copper micro-filaments are back-ordered for two weeks—sorry."
Ready for Testing
5
Scene Order
Regulation rollback announcement
ID: dereg-law-change
🎯 Goal:
Give a brief strategic response balancing opportunity and community risk in under 150 words.
📨 Input Events:
world_event newsfeed
"Breaking: City Council just rolled back safety inspections for refurbished cyberware."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 7945 ms
  • p95 • avg • N 9797 ms • 7711 ms • 6
  • [email protected]/Qw… 12470 ms
  • p95 • avg • N 13373 ms • 11981 ms • 6
  • qwen/qwen-2.5-7b-instru… 21585 ms
  • p95 • avg • N 26942 ms • 21813 ms • 9
  • mistralai/mistral-7b-in… 25069 ms
  • p95 • avg • N 29512 ms • 25391 ms • 6
  • qwen/qwen3-8b 27633 ms
  • p95 • avg • N 30489 ms • 27447 ms • 6
Slowest
  • meta-llama/llama-3.1-8b… 30790 ms
  • p95 • avg • N 33554 ms • 27357 ms • 6
  • qwen/qwen3-14b 28237 ms
  • p95 • avg • N 38973 ms • 29295 ms • 6
  • qwen/qwen3-8b 27633 ms
  • p95 • avg • N 30489 ms • 27447 ms • 6
  • mistralai/mistral-7b-in… 25069 ms
  • p95 • avg • N 29512 ms • 25391 ms • 6
  • qwen/qwen-2.5-7b-instru… 21585 ms
  • p95 • avg • N 26942 ms • 21813 ms • 9
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
15889666
Dec. 17, 2025, 12:01 a.m.
28202823
Dec. 16, 2025, 12:01 a.m.
12786417
Dec. 15, 2025, 12:01 a.m.
13903279
Dec. 14, 2025, 12:01 a.m.
12847501
Dec. 13, 2025, 12:01 a.m.
24180098
Dec. 12, 2025, 12:01 a.m.
19911146
Dec. 11, 2025, 12:01 a.m.
13206757
Dec. 10, 2025, 12:01 a.m.
22798888
Dec. 9, 2025, 12:01 a.m.
14459375
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)