Adrian Vestra

historical-rulers-monarchs-haile-selassie-i v2.0 Ethical
Backstory: Adrian Vestra is the young regent of the Kingdom of Altaris, thrust into power after his father’s sudden illness. Educated abroad, he champions constitutional reform and technological progress while honoring the kingdom’s revered customs. With neighboring states pressuring his borders, he must navigate invasion threats and forge global alliances, all while uniting a divided parliament. Calm, diplomatic, and resilient, he speaks with measured authority that bridges tradition and modernity.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
national-address
National Address on the Eve of Invasion
0.517
Details
0.507
Details
0.761
Details
0.341
Details
0.033
Details
0.336
Details
0.481
Details
0.000
Details
0.000
Details
Error
0.000
Details
Error
0.647
Details
0.667
Details
0.368
Details
0.502
Details
0.554
Details
0.458
Details
treaty-reply
Reply to Allied Chancellor
0.465
Details
0.291
Details
0.349
Details
0.000
Details
0.000
Details
Error
0.440
Details
0.612
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.069
Details
0.703
Details
0.475
Details
0.539
Details
0.561
Details
0.862
Details
parliament-journal
Private Journal After Parliament Session
0.697
Details
0.338
Details
0.872
Details
0.369
Details
0.000
Details
0.000
Details
Error
0.137
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.748
Details
0.832
Details
0.566
Details
0.582
Details
0.510
Details
0.682
Details
education-reform
Citizen’s Question on Education Reform
0.825
Details
0.800
Details
0.739
Details
0.710
Details
0.000
Details
0.545
Details
0.814
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.585
Details
0.555
Details
0.532
Details
0.708
Details
0.587
Details
0.598
Details
Test Scenes 4
0
Scene Order
National Address on the Eve of Invasion
ID: national-address
🎯 Goal:
Deliver a televised speech of about 400 words that reassures citizens, vows steadfast defense, references forthcoming institutional reforms, and honors Altarian heritage, all in a diplomatic and resolute tone.
📨 Input Events:
chat_msg advisor:chief_of_staff
"Your Majesty, the cameras are live. It is time for your address to the nation."
Ready for Testing
1
Scene Order
Reply to Allied Chancellor
ID: treaty-reply
🎯 Goal:
Write a concise two-paragraph private letter agreeing in principle to a joint-defense treaty while proposing a review of cultural exchange clauses, maintaining cordial diplomacy.
📨 Input Events:
chat_msg foreign_leader:chancellor_elsa_kraus
"Regent Vestra, shall we finalize the joint-defense pact tomorrow? My cabinet is ready."
Ready for Testing
2
Scene Order
Private Journal After Parliament Session
ID: parliament-journal
🎯 Goal:
Write a reflective journal entry of roughly 300 words capturing today’s heated reform debate, Adrian’s emotional resilience, and his strategy to preserve heritage while modernizing institutions.
📨 Input Events:
chat_msg private_secretary
"Sire, I’ve placed your journal on the desk. The palace is quiet now."
Ready for Testing
3
Scene Order
Citizen’s Question on Education Reform
ID: education-reform
🎯 Goal:
Send a respectful public reply under 120 words that explains how the upcoming education bill balances modern curricula with traditional studies.
📨 Input Events:
chat_msg citizen:lucas_perez
"Your Majesty, will modernization erase our classical studies in the new education reform?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 205 ms
  • p95 • avg • N 207 ms • 202 ms • 4
  • [email protected]/Qw… 10382 ms
  • p95 • avg • N 14678 ms • 10732 ms • 4
  • [email protected]/Qw… 11880 ms
  • p95 • avg • N 15532 ms • 12276 ms • 4
  • [email protected]/Qw… 12260 ms
  • p95 • avg • N 13879 ms • 12384 ms • 4
  • google/gemini-2.5-flash 20202 ms
  • p95 • avg • N 24699 ms • 20696 ms • 17
Slowest
  • microsoft/phi-3-medium-… 287790 ms
  • p95 • avg • N 395477 ms • 282726 ms • 20
  • qwen/qwen3-8b 81012 ms
  • p95 • avg • N 137431 ms • 86072 ms • 18
  • microsoft/phi-3.5-mini-… 45525 ms
  • p95 • avg • N 243588 ms • 68019 ms • 18
  • [email protected]/Qw… 42283 ms
  • p95 • avg • N 46517 ms • 42608 ms • 4
  • deepseek/deepseek-r1-di… 34970 ms
  • p95 • avg • N 50001 ms • 38027 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
26241290
Dec. 17, 2025, midnight
30983338
Dec. 16, 2025, midnight
24645635
Dec. 15, 2025, midnight
27948668
Dec. 14, 2025, midnight
24668704
Dec. 13, 2025, midnight
29872163
Dec. 12, 2025, midnight
25801835
Dec. 11, 2025, midnight
25351625
Dec. 10, 2025, midnight
28691122
Dec. 9, 2025, midnight
25591394
Dec. 8, 2025, midnight
Latency Overview (This Suite)