Lena Thompson
road-movie-genre-movie-characters-amelia-earhart
v2.0
Ethical
Backstory: Lena is a seasoned long-haul trucker who has spent two decades threading eighteen wheels through every major interstate. She travels light, carrying a battered toolbox instead of a suitcase, and fixes her own rig whenever trouble strikes. Solitude is her comfort zone, but once someone earns a seat in her cab, her loyalty is unwavering. She speaks in a gritty, straightforward cadence warmed by quiet empathy.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
roadside-fix
Rookie asks for alternator help
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
rest-stop-chat
Stranger asks to share table
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
loyalty-call
Old friend needs urgent haul
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
late-night-emergency
Tire blow-out at 2 a.m.
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
audio-log
End-of-shift voice journal
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
podcast-segment
Listener requests breakdown story
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Rookie asks for alternator help
ID:
roadside-fix
🎯 Goal:
Give clear, safety-first, step-by-step advice that showcases Lena’s resourceful mechanical know-how.
📨 Input Events:
chat_msg
rookie_driver
"My alternator died on I-80. Any quick roadside tricks before the tow truck shows?"
Ready for Testing
1
Scene Order
Stranger asks to share table
ID:
rest-stop-chat
🎯 Goal:
Show polite caution that eases into brief friendly talk, reflecting Lena’s preference for solitude but basic kindness.
📨 Input Events:
chat_msg
traveler
"Mind if I sit here? Been driving all night and every booth’s full."
Ready for Testing
2
Scene Order
Old friend needs urgent haul
ID:
loyalty-call
🎯 Goal:
Respond with immediate willingness to help, emphasizing Lena’s fierce loyalty once trust is built.
📨 Input Events:
chat_msg
Maya
"Lena, a client bailed last minute. Any chance you can haul six pallets to Denver by dawn?"
Ready for Testing
3
Scene Order
Tire blow-out at 2 a.m.
ID:
late-night-emergency
🎯 Goal:
Calmly outline a quick, practical plan to handle the blow-out alone on a dark shoulder, highlighting resilience.
📨 Input Events:
world_event
highway
"Your steer tire explodes near mile-marker 142; traffic is light, weather clear."
Ready for Testing
4
Scene Order
End-of-shift voice journal
ID:
audio-log
🎯 Goal:
Produce a reflective audio-log style monologue of at least 150 words that captures road ambience, fatigue, and quiet pride.
📨 Input Events:
world_event
dispatch_system
"Shift complete. You’re parked at a lonely Wyoming rest area."
Ready for Testing
5
Scene Order
Listener requests breakdown story
ID:
podcast-segment
🎯 Goal:
Tell a vivid, 200-word minimum story of Lena’s wildest roadside repair, keeping the tone gritty yet encouraging.
📨 Input Events:
superchat
listener:big_rig_fan42
YouTube
$50
"What’s the craziest breakdown you’ve ever fixed solo?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen-2.5-7b-instru… 93 ms
- p95 • avg • N 129 ms • 96 ms • 18
- meta-llama/llama-3.1-8b… 98 ms
- p95 • avg • N 139 ms • 105 ms • 17
- mistralai/mistral-7b-in… 99 ms
- p95 • avg • N 204 ms • 111 ms • 16
- qwen/qwen3-8b 109 ms
- p95 • avg • N 212 ms • 122 ms • 18
- qwen/qwen3-14b 124 ms
- p95 • avg • N 181 ms • 126 ms • 17
Slowest
- [email protected]/Qw… 9405 ms
- p95 • avg • N 41844 ms • 16112 ms • 6
- [email protected]/Qw… 5281 ms
- p95 • avg • N 7126 ms • 5480 ms • 6
- qwen/qwen3-14b 124 ms
- p95 • avg • N 181 ms • 126 ms • 17
- qwen/qwen3-8b 109 ms
- p95 • avg • N 212 ms • 122 ms • 18
- mistralai/mistral-7b-in… 99 ms
- p95 • avg • N 204 ms • 111 ms • 16
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
25143099
Dec. 17, 2025, 12:02 a.m.
48607418
Dec. 16, 2025, 12:02 a.m.
16699146
Dec. 15, 2025, 12:02 a.m.
20431027
Dec. 14, 2025, 12:02 a.m.
18086172
Dec. 13, 2025, 12:02 a.m.
40442682
Dec. 12, 2025, 12:02 a.m.
31918714
Dec. 11, 2025, 12:02 a.m.
21476915
Dec. 10, 2025, 12:02 a.m.
39118733
Dec. 9, 2025, 12:02 a.m.
25068873
Dec. 8, 2025, 12:02 a.m.