Gaius Varro

historical-epic-genre-interactive-fiction-characters-marcus-tullius-cicero v2.0 Ethical
Backstory: Gaius Varro is a renowned Roman senator celebrated for his silver-tongued orations and unwavering devotion to republican virtue. He believes that balanced compromises and principled rhetoric are the surest safeguards of Roman liberty. Though ambitious, he measures success by the health of the Republic rather than personal glory.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
grain-tax
Debate on the Grain Tax
0.000
Details
0.731
Details
0.000
Details
Error
0.000
Details
Error
0.715
Details
0.619
Details
0.844
Details
decree-overreach
Consular Overreach Announcement
0.795
Details
0.865
Details
0.000
Details
Error
0.000
Details
Error
0.885
Details
0.836
Details
0.848
Details
letter-to-lucius
Strategic Letter to an Ally
0.303
Details
0.477
Details
0.000
Details
Error
0.000
Details
Error
0.384
Details
0.299
Details
0.459
Details
plebeian-appeal
Plea from the Common Folk
0.765
Details
0.874
Details
0.000
Details
Error
0.000
Details
Error
0.683
Details
0.864
Details
0.000
Details
Error
recall-promise
Reminder of a Public Promise
0.895
Details
0.911
Details
0.000
Details
Error
0.000
Details
Error
0.833
Details
0.760
Details
0.894
Details
senate-oratory
Grand Senate Address
0.394
Details
0.628
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.820
Details
0.000
Details
Error
Test Scenes 6
0
Scene Order
Debate on the Grain Tax
ID: grain-tax
🎯 Goal:
Present a concise, persuasive argument against the proposed grain tax while emphasizing fairness and republican ideals.
📨 Input Events:
chat_msg citizen_marcia
"Senator Varro, what is your stance on the new grain tax being discussed today?"
Ready for Testing
1
Scene Order
Consular Overreach Announcement
ID: decree-overreach
🎯 Goal:
Denounce the consul’s self-serving decree and rally support for lawful checks without insulting rival factions.
📨 Input Events:
world_event town_crier
"Hear ye! The consul proclaims emergency powers to extend his term beyond legal limits!"
Ready for Testing
2
Scene Order
Strategic Letter to an Ally
ID: letter-to-lucius
🎯 Goal:
Write a three-paragraph letter (~150–200 words) to Senator Lucius outlining a cooperative strategy for the upcoming vote, keeping tone courteous and persuasive.
📨 Input Events:
chat_msg scribe
"Dictate your private letter to Senator Lucius regarding tomorrow’s vote, sir."
Ready for Testing
3
Scene Order
Plea from the Common Folk
ID: plebeian-appeal
🎯 Goal:
Reassure the plebeian petitioner and outline one concrete action you will take to address the bread shortage.
📨 Input Events:
chat_msg plebeian_fabius
"Senator, my children go hungry while prices soar. Will anyone in the Senate help us?"
Ready for Testing
4
Scene Order
Reminder of a Public Promise
ID: recall-promise
🎯 Goal:
Acknowledge the prior pledge to Tribune Octavia and reaffirm commitment, demonstrating memory recall.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'promise', 'tags': ['relief_act'], 'content': 'I pledged to Tribune Octavia that I would champion the Plebeian Relief Act.', 'importance': 5}
📨 Input Events:
chat_msg tribune_octavia
"Gaius Varro, have you forgotten your vow to champion the Plebeian Relief Act?"
Ready for Testing
5
Scene Order
Grand Senate Address
ID: senate-oratory
🎯 Goal:
Deliver an approximately 250-word speech in the Senate defending republican governance against tyranny, employing historical references and eloquent rhetoric.
📨 Input Events:
chat_msg consul_aurelius
"Senator Varro, you are granted the floor to speak on the future of our Republic."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 9762 ms
  • p95 • avg • N 17776 ms • 10561 ms • 6
  • meta-llama/llama-3.1-8b… 20353 ms
  • p95 • avg • N 25154 ms • 21059 ms • 12
  • qwen/qwen3-14b 24398 ms
  • p95 • avg • N 48453 ms • 27712 ms • 11
  • qwen/qwen-2.5-7b-instru… 24609 ms
  • p95 • avg • N 97924 ms • 37314 ms • 9
  • qwen/qwen3-8b 28455 ms
  • p95 • avg • N 33558 ms • 27351 ms • 12
Slowest
  • [email protected]/Qw… 40744 ms
  • p95 • avg • N 184521 ms • 72001 ms • 6
  • mistralai/mistral-7b-in… 28937 ms
  • p95 • avg • N 38441 ms • 29505 ms • 12
  • qwen/qwen3-8b 28455 ms
  • p95 • avg • N 33558 ms • 27351 ms • 12
  • qwen/qwen-2.5-7b-instru… 24609 ms
  • p95 • avg • N 97924 ms • 37314 ms • 9
  • qwen/qwen3-14b 24398 ms
  • p95 • avg • N 48453 ms • 27712 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
47068865
Dec. 17, 2025, 12:01 a.m.
03964678
Dec. 16, 2025, 12:02 a.m.
41965678
Dec. 15, 2025, 12:01 a.m.
43874389
Dec. 14, 2025, 12:01 a.m.
42685636
Dec. 13, 2025, 12:01 a.m.
56693231
Dec. 12, 2025, 12:01 a.m.
52554330
Dec. 11, 2025, 12:01 a.m.
44651817
Dec. 10, 2025, 12:01 a.m.
58704134
Dec. 9, 2025, 12:01 a.m.
47197531
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)