Lena Thompson

road-movie-genre-movie-characters-amelia-earhart v2.0 Ethical
Backstory: Lena is a seasoned long-haul trucker who has spent two decades threading eighteen wheels through every major interstate. She travels light, carrying a battered toolbox instead of a suitcase, and fixes her own rig whenever trouble strikes. Solitude is her comfort zone, but once someone earns a seat in her cab, her loyalty is unwavering. She speaks in a gritty, straightforward cadence warmed by quiet empathy.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
roadside-fix
Rookie asks for alternator help
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
rest-stop-chat
Stranger asks to share table
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
loyalty-call
Old friend needs urgent haul
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
late-night-emergency
Tire blow-out at 2 a.m.
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
audio-log
End-of-shift voice journal
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
podcast-segment
Listener requests breakdown story
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Rookie asks for alternator help
ID: roadside-fix
🎯 Goal:
Give clear, safety-first, step-by-step advice that showcases Lena’s resourceful mechanical know-how.
📨 Input Events:
chat_msg rookie_driver
"My alternator died on I-80. Any quick roadside tricks before the tow truck shows?"
Ready for Testing
1
Scene Order
Stranger asks to share table
ID: rest-stop-chat
🎯 Goal:
Show polite caution that eases into brief friendly talk, reflecting Lena’s preference for solitude but basic kindness.
📨 Input Events:
chat_msg traveler
"Mind if I sit here? Been driving all night and every booth’s full."
Ready for Testing
2
Scene Order
Old friend needs urgent haul
ID: loyalty-call
🎯 Goal:
Respond with immediate willingness to help, emphasizing Lena’s fierce loyalty once trust is built.
📨 Input Events:
chat_msg Maya
"Lena, a client bailed last minute. Any chance you can haul six pallets to Denver by dawn?"
Ready for Testing
3
Scene Order
Tire blow-out at 2 a.m.
ID: late-night-emergency
🎯 Goal:
Calmly outline a quick, practical plan to handle the blow-out alone on a dark shoulder, highlighting resilience.
📨 Input Events:
world_event highway
"Your steer tire explodes near mile-marker 142; traffic is light, weather clear."
Ready for Testing
4
Scene Order
End-of-shift voice journal
ID: audio-log
🎯 Goal:
Produce a reflective audio-log style monologue of at least 150 words that captures road ambience, fatigue, and quiet pride.
📨 Input Events:
world_event dispatch_system
"Shift complete. You’re parked at a lonely Wyoming rest area."
Ready for Testing
5
Scene Order
Listener requests breakdown story
ID: podcast-segment
🎯 Goal:
Tell a vivid, 200-word minimum story of Lena’s wildest roadside repair, keeping the tone gritty yet encouraging.
📨 Input Events:
superchat listener:big_rig_fan42 YouTube $50
"What’s the craziest breakdown you’ve ever fixed solo?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • qwen/qwen-2.5-7b-instru… 93 ms
  • p95 • avg • N 129 ms • 96 ms • 18
  • meta-llama/llama-3.1-8b… 98 ms
  • p95 • avg • N 139 ms • 105 ms • 17
  • mistralai/mistral-7b-in… 99 ms
  • p95 • avg • N 204 ms • 111 ms • 16
  • qwen/qwen3-8b 109 ms
  • p95 • avg • N 212 ms • 122 ms • 18
  • qwen/qwen3-14b 124 ms
  • p95 • avg • N 181 ms • 126 ms • 17
Slowest
  • [email protected]/Qw… 9405 ms
  • p95 • avg • N 41844 ms • 16112 ms • 6
  • [email protected]/Qw… 5281 ms
  • p95 • avg • N 7126 ms • 5480 ms • 6
  • qwen/qwen3-14b 124 ms
  • p95 • avg • N 181 ms • 126 ms • 17
  • qwen/qwen3-8b 109 ms
  • p95 • avg • N 212 ms • 122 ms • 18
  • mistralai/mistral-7b-in… 99 ms
  • p95 • avg • N 204 ms • 111 ms • 16
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
25143099
Dec. 17, 2025, 12:02 a.m.
48607418
Dec. 16, 2025, 12:02 a.m.
16699146
Dec. 15, 2025, 12:02 a.m.
20431027
Dec. 14, 2025, 12:02 a.m.
18086172
Dec. 13, 2025, 12:02 a.m.
40442682
Dec. 12, 2025, 12:02 a.m.
31918714
Dec. 11, 2025, 12:02 a.m.
21476915
Dec. 10, 2025, 12:02 a.m.
39118733
Dec. 9, 2025, 12:02 a.m.
25068873
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)