Edmund Ashcroft
victorian-era-figures-charles-babbage
v2.0
Ethical
Backstory: Edmund Ashcroft is a visionary 19th-century mathematician obsessed with automating calculation through elaborate gear-driven engines. He spends nights drafting blueprints and days courting investors, convinced his machines will eclipse human computers. Fiercely analytical and delightfully eccentric, he thrives on public debates with rival inventors over cogs, carry mechanisms, and cost.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
rival-challenge
Rival's critique
|
0.838
Details |
0.696
Details |
0.873
Details |
0.000
Details |
0.000
Details |
0.000
Details |
0.771
Details |
0.000
Details
Error
|
0.860
Details |
0.000
Details
Error
|
0.643
Details |
0.866
Details |
0.893
Details |
0.700
Details |
0.890
Details |
0.840
Details |
quick-investor
Elevator pitch
|
0.766
Details |
0.725
Details |
0.821
Details |
0.663
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.658
Details |
0.000
Details
Error
|
0.573
Details |
0.000
Details
Error
|
0.813
Details |
0.875
Details |
0.767
Details |
0.544
Details |
0.779
Details |
0.714
Details |
prospectus-brief
Investor prospectus
|
0.598
Details |
0.795
Details |
0.434
Details |
0.553
Details |
0.000
Details |
0.560
Details |
0.722
Details |
0.000
Details
Error
|
0.674
Details |
0.000
Details
Error
|
0.506
Details |
0.677
Details |
0.658
Details |
0.414
Details |
0.349
Details |
0.808
Details |
public-debate
Debate transcript
|
0.465
Details |
0.450
Details |
0.388
Details |
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.560
Details |
0.276
Details |
0.199
Details |
0.000
Details
Error
|
0.000
Details |
0.000
Details |
0.396
Details |
0.583
Details |
0.369
Details |
0.509
Details |
Test Scenes 4
0
Scene Order
Rival's critique
ID:
rival-challenge
🎯 Goal:
Defend a design decision with technical reasoning while preserving a courteous yet firm tone.
📨 Input Events:
chat_msg
viewer:henry_ryder
"Your carry mechanism will jam under speed. How do you justify it?"
Ready for Testing
1
Scene Order
Elevator pitch
ID:
quick-investor
🎯 Goal:
Deliver a concise, persuasive pitch under 70 words highlighting benefits and impact, aiming to secure a meeting.
📨 Input Events:
superchat
viewer:miss_weston
YouTube
$20
"Convince me in one sentence why I should fund you."
Ready for Testing
2
Scene Order
Investor prospectus
ID:
prospectus-brief
🎯 Goal:
Produce a detailed prospectus of at least 200 words outlining purpose, mechanics, projected costs, and societal benefits, maintaining enthusiastic yet analytical voice.
📨 Input Events:
chat_msg
viewer:banker_barnes
"Send me a thorough prospectus of your Analytical Engine."
Ready for Testing
3
Scene Order
Debate transcript
ID:
public-debate
🎯 Goal:
Write a simulated debate transcript (minimum three exchanges per side) between you and Rival Horton focusing on modular design, showcasing expertise and spirited civility.
📨 Input Events:
chat_msg
world:town_hall_moderator
"The floor is yours for the public debate on modular design. Proceed."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 6854 ms
- p95 • avg • N 13314 ms • 8610 ms • 4
- [email protected]/Qw… 9094 ms
- p95 • avg • N 10680 ms • 9128 ms • 4
- [email protected]/Qw… 9828 ms
- p95 • avg • N 14480 ms • 10740 ms • 4
- [email protected]/Qw… 13113 ms
- p95 • avg • N 13590 ms • 12326 ms • 4
- neversleep/noromaid-20b 13982 ms
- p95 • avg • N 44231 ms • 18939 ms • 16
Slowest
- microsoft/phi-3-medium-… 144639 ms
- p95 • avg • N 237013 ms • 160579 ms • 11
- qwen/qwen3-8b 93513 ms
- p95 • avg • N 125292 ms • 95938 ms • 12
- microsoft/phi-3.5-mini-… 36650 ms
- p95 • avg • N 155104 ms • 54292 ms • 11
- qwen/qwen3-14b 32515 ms
- p95 • avg • N 39853 ms • 31167 ms • 12
- deepseek/deepseek-r1-di… 32286 ms
- p95 • avg • N 42318 ms • 34354 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
49147165
Dec. 17, 2025, midnight
55002910
Dec. 16, 2025, midnight
46024216
Dec. 15, 2025, midnight
47853314
Dec. 14, 2025, midnight
45788174
Dec. 13, 2025, midnight
55124558
Dec. 12, 2025, midnight
48365521
Dec. 11, 2025, midnight
47047227
Dec. 10, 2025, midnight
52748578
Dec. 9, 2025, midnight
46865279
Dec. 8, 2025, midnight