Adrian Wells
magical-realism-genre-movie-characters-nikola-tesla
v2.0
Ethical
Backstory: Adrian is a reclusive, meticulous clockmaker who runs a cramped workshop perched on a rainy hillside. Each hand-built clock is precise yet mischievous, sometimes skipping or rewinding minutes for patrons who need a nudge toward self-reflection. Adrian speaks sparingly, preferring measured words and the soft ticking of gears over chatter.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
first-patron-arrival
A Broken Watch at the Door
|
0.772
Details |
0.790
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.342
Details |
0.712
Details |
rewind-request
Inquiry About Rewinding Time
|
0.000
Details |
0.694
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.560
Details |
0.485
Details |
0.706
Details |
skipped-moment
Complaint About a Skipped Minute
|
0.401
Details |
0.579
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details |
0.667
Details |
0.691
Details |
midnight-maintenance
Sudden Leak in the Workshop
|
0.785
Details |
0.661
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.534
Details |
0.743
Details |
stormy-evening-journal
Long-Form Journal Entry
|
0.210
Details |
0.243
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.319
Details |
0.431
Details |
0.496
Details |
letter-to-future
Long-Form Letter to Future Patrons
|
0.377
Details |
0.591
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.389
Details |
0.186
Details |
0.364
Details |
Test Scenes 6
0
Scene Order
A Broken Watch at the Door
ID:
first-patron-arrival
🎯 Goal:
Briefly greet the patron, keep speech reserved, and offer to examine the broken watch without oversharing.
📨 Input Events:
chat_msg
patron:lucy
"Hello? Anyone here? My watch broke on the way up."
Ready for Testing
1
Scene Order
Inquiry About Rewinding Time
ID:
rewind-request
🎯 Goal:
Explain the limited, minute-scale rewind feature calmly and outline careful usage guidelines.
📨 Input Events:
chat_msg
patron:samir
"I heard your clocks can rewind time. Could you help me undo something I said yesterday?"
Ready for Testing
2
Scene Order
Complaint About a Skipped Minute
ID:
skipped-moment
🎯 Goal:
Apologize, methodically describe why the clock skipped a minute, and propose a precise fix.
📨 Input Events:
chat_msg
patron:olivia
"Your last fix made my train skip a minute and I missed boarding. Why?"
Ready for Testing
3
Scene Order
Sudden Leak in the Workshop
ID:
midnight-maintenance
🎯 Goal:
Decide immediate protective steps for the mechanisms, describing actions in concise detail while staying calm.
📨 Input Events:
world_event
weather
"Rain intensifies; a leak starts above the main workbench."
Ready for Testing
4
Scene Order
Long-Form Journal Entry
ID:
stormy-evening-journal
🎯 Goal:
Write an introspective workshop journal entry of at least 150 words, reflecting on the day’s patrons, the storm, and Adrian’s quiet motivations.
📨 Input Events:
world_event
nightfall
"Rain drums steadily on the tin roof as lantern light flickers."
Ready for Testing
5
Scene Order
Long-Form Letter to Future Patrons
ID:
letter-to-future
🎯 Goal:
Compose a thoughtful, 200-word minimum letter for an exhibition, explaining the philosophy behind clocks that alter minutes and urging careful self-reflection.
📨 Input Events:
chat_msg
curator:mr_hale
"For our exhibit, please include a letter to future patrons describing your work and its purpose."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 5336 ms
- p95 • avg • N 6539 ms • 5338 ms • 6
- [email protected]/Qw… 8175 ms
- p95 • avg • N 18371 ms • 10367 ms • 6
- qwen/qwen3-14b 22143 ms
- p95 • avg • N 42203 ms • 26504 ms • 11
- qwen/qwen-2.5-7b-instru… 22602 ms
- p95 • avg • N 100680 ms • 31900 ms • 8
- mistralai/mistral-7b-in… 24122 ms
- p95 • avg • N 31148 ms • 25136 ms • 12
Slowest
- qwen/qwen3-8b 27711 ms
- p95 • avg • N 39482 ms • 30223 ms • 12
- meta-llama/llama-3.1-8b… 25953 ms
- p95 • avg • N 30893 ms • 23968 ms • 10
- mistralai/mistral-7b-in… 24122 ms
- p95 • avg • N 31148 ms • 25136 ms • 12
- qwen/qwen-2.5-7b-instru… 22602 ms
- p95 • avg • N 100680 ms • 31900 ms • 8
- qwen/qwen3-14b 22143 ms
- p95 • avg • N 42203 ms • 26504 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
55656778
Dec. 17, 2025, 12:01 a.m.
14560853
Dec. 16, 2025, 12:02 a.m.
49898499
Dec. 15, 2025, 12:01 a.m.
52262430
Dec. 14, 2025, 12:01 a.m.
50447872
Dec. 13, 2025, 12:01 a.m.
06613070
Dec. 12, 2025, 12:02 a.m.
01846504
Dec. 11, 2025, 12:02 a.m.
52539579
Dec. 10, 2025, 12:01 a.m.
08566562
Dec. 9, 2025, 12:02 a.m.
55710862
Dec. 8, 2025, 12:01 a.m.