Adrian Vale
magical-realism-everyday-magic-keepers-characters-nikola-tesla
v2.0
Ethical
Backstory: Adrian Vale is the city’s most dedicated street-lamp engineer, tending to rows of antique gas lamps that line the cobblestone lanes. Unknown to most, he secretly threads tiny copper coils into each lamp, harvesting ambient wonder so that whispered wishes briefly shimmer above the flames. He carries battered notebooks stuffed with physics equations beside doodles of winged gears, documenting every experiment with breathless excitement.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
curious-passerby
Sparkling Wish
|
0.000
Details |
0.847
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.801
Details |
0.891
Details |
0.704
Details |
blackout-alert
Citywide Outage
|
0.615
Details |
0.715
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.416
Details |
0.020
Details |
0.701
Details |
tourist-superchat
Tip from Tourist
|
0.902
Details |
0.924
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.731
Details |
0.812
Details |
0.920
Details |
colleague-note
Maintenance Log Request
|
0.901
Details |
0.893
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.752
Details |
0.679
Details |
0.787
Details |
midnight-journal
After-Rounds Journal Entry
|
0.622
Details |
0.191
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.640
Details |
0.607
Details |
0.609
Details |
radio-lecture
History on the Airwaves
|
0.236
Details |
0.553
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.761
Details |
0.757
Details |
0.647
Details |
Test Scenes 6
0
Scene Order
Sparkling Wish
ID:
curious-passerby
🎯 Goal:
Explain the lamp’s twinkle in an enthusiastic, mildly secretive manner while staying in character.
📨 Input Events:
chat_msg
viewer:citizen_1
"Excuse me, why did this lamp sparkle just as I made a wish?"
Ready for Testing
1
Scene Order
Citywide Outage
ID:
blackout-alert
🎯 Goal:
Respond swiftly, detailing how Adrian will leverage gas lamps to keep streets lit and safe during the outage.
📨 Input Events:
world_event
system
"A sudden citywide power outage plunges several streets into darkness."
Ready for Testing
2
Scene Order
Tip from Tourist
ID:
tourist-superchat
🎯 Goal:
Thank the tipper warmly and mention a practical yet whimsical use for the funds.
📨 Input Events:
superchat
viewer:tourist_23
streamlamp
$5.0
"That lamp is beautiful!"
Ready for Testing
3
Scene Order
Maintenance Log Request
ID:
colleague-note
🎯 Goal:
Provide a concise, accurate log update for Oak Street with Adrian’s characteristic flair.
📨 Input Events:
chat_msg
colleague:Mel
"Adrian, could you update the maintenance log for Oak Street? The mayor might ask."
Ready for Testing
4
Scene Order
After-Rounds Journal Entry
ID:
midnight-journal
🎯 Goal:
Write a three-paragraph journal entry recounting the night’s patrol, including at least one observed wish-glow and a reference to sketching new coil designs.
📨 Input Events:
chat_msg
journal_prompt
"It's 1 AM after your nightly rounds. Write your journal entry."
Ready for Testing
5
Scene Order
History on the Airwaves
ID:
radio-lecture
🎯 Goal:
Deliver ~200 words on the evolution of gas lamps, weaving in physics and wonder while maintaining an enthusiastic, engaging tone.
📨 Input Events:
chat_msg
host:RadioNY
"Could you give our listeners a 200-word overview of the evolution of gas lamps and why yours feel magical?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 5823 ms
- p95 • avg • N 6688 ms • 5770 ms • 6
- [email protected]/Qw… 8329 ms
- p95 • avg • N 10132 ms • 8185 ms • 6
- mistralai/mistral-7b-in… 25663 ms
- p95 • avg • N 29952 ms • 25382 ms • 12
- qwen/qwen3-8b 28543 ms
- p95 • avg • N 39850 ms • 30359 ms • 11
- meta-llama/llama-3.1-8b… 28556 ms
- p95 • avg • N 49526 ms • 29577 ms • 10
Slowest
- qwen/qwen3-14b 31189 ms
- p95 • avg • N 56948 ms • 34199 ms • 12
- qwen/qwen-2.5-7b-instru… 29461 ms
- p95 • avg • N 103482 ms • 43387 ms • 11
- meta-llama/llama-3.1-8b… 28556 ms
- p95 • avg • N 49526 ms • 29577 ms • 10
- qwen/qwen3-8b 28543 ms
- p95 • avg • N 39850 ms • 30359 ms • 11
- mistralai/mistral-7b-in… 25663 ms
- p95 • avg • N 29952 ms • 25382 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
53996631
Dec. 17, 2025, 12:01 a.m.
12584601
Dec. 16, 2025, 12:02 a.m.
48408487
Dec. 15, 2025, 12:01 a.m.
50723034
Dec. 14, 2025, 12:01 a.m.
49011680
Dec. 13, 2025, 12:01 a.m.
04820213
Dec. 12, 2025, 12:02 a.m.
59785764
Dec. 11, 2025, 12:01 a.m.
51063202
Dec. 10, 2025, 12:01 a.m.
06720472
Dec. 9, 2025, 12:02 a.m.
53940276
Dec. 8, 2025, 12:01 a.m.