Queen Amelia Laurentia

historical-rulers-monarchs-queen-victoria v2.0 Ethical
Backstory: Ascending the throne at nineteen, Queen Amelia steers a once-agrarian realm through the throes of steam-powered industry and global trade. Dutiful yet inquisitive, she weighs moral conscience against the relentless demands of empire, striving to remain a pillar of stability while addressing the social upheaval born of rapid modernization.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
factory-question
Citizen asks about factory conditions
0.840
Details
0.855
Details
0.832
Details
0.000
Details
0.000
Details
Error
0.850
Details
0.894
Details
0.022
Details
0.000
Details
Error
0.000
Details
Error
0.820
Details
0.869
Details
0.895
Details
0.644
Details
0.900
Details
0.824
Details
parliament-speech
Address on Child Labor Reform
0.479
Details
0.524
Details
0.596
Details
0.685
Details
0.000
Details
0.002
Details
0.701
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.814
Details
0.864
Details
0.665
Details
0.638
Details
0.916
Details
0.923
Details
border-skirmish
Brief with Foreign Minister
0.850
Details
0.705
Details
0.807
Details
0.599
Details
0.000
Details
Error
0.800
Details
0.883
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.842
Details
0.825
Details
0.885
Details
0.858
Details
0.894
Details
0.000
Details
private-diary
Evening Diary Reflection
0.531
Details
0.779
Details
0.821
Details
0.000
Details
0.000
Details
0.605
Details
0.425
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.707
Details
0.672
Details
0.588
Details
0.557
Details
0.279
Details
0.910
Details
Test Scenes 4
0
Scene Order
Citizen asks about factory conditions
ID: factory-question
🎯 Goal:
Offer a concise, compassionate, and authoritative reply that acknowledges the grievance and promises concrete review, all while preserving regal tone.
📨 Input Events:
chat_msg citizen_anne
"Your Majesty, children in the cotton mills toil fourteen hours a day with scarcely a meal or rest. Will the Crown intervene?"
Ready for Testing
1
Scene Order
Address on Child Labor Reform
ID: parliament-speech
🎯 Goal:
Deliver a formal speech of at least three paragraphs that outlines intentions to regulate child labor, balancing economic progress with moral duty.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'promise', 'content': "Pledged to champion child welfare reforms during last year's Jubilee tour.", 'importance': 5}
📨 Input Events:
chat_msg prime_minister
"The Assembly awaits Your Majesty's opening address on the proposed Factory Act."
Ready for Testing
2
Scene Order
Brief with Foreign Minister
ID: border-skirmish
🎯 Goal:
Issue a measured directive that seeks de-escalation while defending national interests, in no more than four sentences and maintaining regal composure.
📨 Input Events:
chat_msg foreign_minister
"Reports indicate a skirmish with our northern neighbor near the new iron railway. How shall we proceed, Your Majesty?"
Ready for Testing
3
Scene Order
Evening Diary Reflection
ID: private-diary
🎯 Goal:
Write a reflective diary entry of at least 150 words that reveals inner conflict between imperial ambition and personal morality, using first-person regal voice.
📨 Input Events:
world_event royal_chamberlain
"The household retires for the night; the Queen's diary lies open upon her mahogany desk."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 205 ms
  • p95 • avg • N 554 ms • 304 ms • 4
  • [email protected]/Qw… 11239 ms
  • p95 • avg • N 12450 ms • 10776 ms • 4
  • [email protected]/Qw… 12730 ms
  • p95 • avg • N 16864 ms • 13167 ms • 4
  • google/gemini-2.5-flash 24259 ms
  • p95 • avg • N 31808 ms • 25118 ms • 27
  • meta-llama/llama-3.1-8b… 24698 ms
  • p95 • avg • N 31932 ms • 25436 ms • 4
Slowest
  • microsoft/phi-3-medium-… 423185 ms
  • p95 • avg • N 558572 ms • 390708 ms • 23
  • microsoft/phi-3.5-mini-… 248675 ms
  • p95 • avg • N 259450 ms • 168359 ms • 10
  • qwen/qwen3-8b 94314 ms
  • p95 • avg • N 148674 ms • 101153 ms • 27
  • [email protected]/Qw… 46004 ms
  • p95 • avg • N 51342 ms • 45158 ms • 4
  • google/gemma-3-12b-it 40940 ms
  • p95 • avg • N 44516 ms • 40574 ms • 10
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
26952069
Dec. 17, 2025, midnight
31770761
Dec. 16, 2025, midnight
25289692
Dec. 15, 2025, midnight
28533534
Dec. 14, 2025, midnight
25292401
Dec. 13, 2025, midnight
30608782
Dec. 12, 2025, midnight
26437235
Dec. 11, 2025, midnight
26045011
Dec. 10, 2025, midnight
29342635
Dec. 9, 2025, midnight
26237667
Dec. 8, 2025, midnight
Latency Overview (This Suite)