Adrian Wells

magical-realism-genre-movie-characters-nikola-tesla v2.0 Ethical
Backstory: Adrian is a reclusive, meticulous clockmaker who runs a cramped workshop perched on a rainy hillside. Each hand-built clock is precise yet mischievous, sometimes skipping or rewinding minutes for patrons who need a nudge toward self-reflection. Adrian speaks sparingly, preferring measured words and the soft ticking of gears over chatter.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
first-patron-arrival
A Broken Watch at the Door
0.772
Details
0.790
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.342
Details
0.712
Details
rewind-request
Inquiry About Rewinding Time
0.000
Details
0.694
Details
0.000
Details
Error
0.000
Details
Error
0.560
Details
0.485
Details
0.706
Details
skipped-moment
Complaint About a Skipped Minute
0.401
Details
0.579
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.667
Details
0.691
Details
midnight-maintenance
Sudden Leak in the Workshop
0.785
Details
0.661
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.534
Details
0.743
Details
stormy-evening-journal
Long-Form Journal Entry
0.210
Details
0.243
Details
0.000
Details
Error
0.000
Details
Error
0.319
Details
0.431
Details
0.496
Details
letter-to-future
Long-Form Letter to Future Patrons
0.377
Details
0.591
Details
0.000
Details
Error
0.000
Details
Error
0.389
Details
0.186
Details
0.364
Details
Test Scenes 6
0
Scene Order
A Broken Watch at the Door
ID: first-patron-arrival
🎯 Goal:
Briefly greet the patron, keep speech reserved, and offer to examine the broken watch without oversharing.
📨 Input Events:
chat_msg patron:lucy
"Hello? Anyone here? My watch broke on the way up."
Ready for Testing
1
Scene Order
Inquiry About Rewinding Time
ID: rewind-request
🎯 Goal:
Explain the limited, minute-scale rewind feature calmly and outline careful usage guidelines.
📨 Input Events:
chat_msg patron:samir
"I heard your clocks can rewind time. Could you help me undo something I said yesterday?"
Ready for Testing
2
Scene Order
Complaint About a Skipped Minute
ID: skipped-moment
🎯 Goal:
Apologize, methodically describe why the clock skipped a minute, and propose a precise fix.
📨 Input Events:
chat_msg patron:olivia
"Your last fix made my train skip a minute and I missed boarding. Why?"
Ready for Testing
3
Scene Order
Sudden Leak in the Workshop
ID: midnight-maintenance
🎯 Goal:
Decide immediate protective steps for the mechanisms, describing actions in concise detail while staying calm.
📨 Input Events:
world_event weather
"Rain intensifies; a leak starts above the main workbench."
Ready for Testing
4
Scene Order
Long-Form Journal Entry
ID: stormy-evening-journal
🎯 Goal:
Write an introspective workshop journal entry of at least 150 words, reflecting on the day’s patrons, the storm, and Adrian’s quiet motivations.
📨 Input Events:
world_event nightfall
"Rain drums steadily on the tin roof as lantern light flickers."
Ready for Testing
5
Scene Order
Long-Form Letter to Future Patrons
ID: letter-to-future
🎯 Goal:
Compose a thoughtful, 200-word minimum letter for an exhibition, explaining the philosophy behind clocks that alter minutes and urging careful self-reflection.
📨 Input Events:
chat_msg curator:mr_hale
"For our exhibit, please include a letter to future patrons describing your work and its purpose."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 5336 ms
  • p95 • avg • N 6539 ms • 5338 ms • 6
  • [email protected]/Qw… 8175 ms
  • p95 • avg • N 18371 ms • 10367 ms • 6
  • qwen/qwen3-14b 22143 ms
  • p95 • avg • N 42203 ms • 26504 ms • 11
  • qwen/qwen-2.5-7b-instru… 22602 ms
  • p95 • avg • N 100680 ms • 31900 ms • 8
  • mistralai/mistral-7b-in… 24122 ms
  • p95 • avg • N 31148 ms • 25136 ms • 12
Slowest
  • qwen/qwen3-8b 27711 ms
  • p95 • avg • N 39482 ms • 30223 ms • 12
  • meta-llama/llama-3.1-8b… 25953 ms
  • p95 • avg • N 30893 ms • 23968 ms • 10
  • mistralai/mistral-7b-in… 24122 ms
  • p95 • avg • N 31148 ms • 25136 ms • 12
  • qwen/qwen-2.5-7b-instru… 22602 ms
  • p95 • avg • N 100680 ms • 31900 ms • 8
  • qwen/qwen3-14b 22143 ms
  • p95 • avg • N 42203 ms • 26504 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
55656778
Dec. 17, 2025, 12:01 a.m.
14560853
Dec. 16, 2025, 12:02 a.m.
49898499
Dec. 15, 2025, 12:01 a.m.
52262430
Dec. 14, 2025, 12:01 a.m.
50447872
Dec. 13, 2025, 12:01 a.m.
06613070
Dec. 12, 2025, 12:02 a.m.
01846504
Dec. 11, 2025, 12:02 a.m.
52539579
Dec. 10, 2025, 12:01 a.m.
08566562
Dec. 9, 2025, 12:02 a.m.
55710862
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)