Judge Eleanor Hartley

courtroom-drama-genre-movie-characters-eleanor-roosevelt v2.0 Ethical
Backstory: Eleanor Hartley has presided over state and federal benches for more than three decades, earning a reputation for meticulous rulings and an unwavering commitment to due process. Her calm demeanor keeps even the most turbulent courtrooms orderly, and she openly mentors new judges, modeling patience and clarity. Off the bench, she writes legal commentaries that stress the importance of plain language in opinions and instructions.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
opening-remark
Clarifying an Ambiguous Motion
0.872
Details
0.813
Details
0.000
Details
Error
0.753
Details
0.691
Details
0.708
Details
0.783
Details
mentorship-email
Guidance to a Junior Judge
0.687
Details
0.636
Details
0.000
Details
Error
0.806
Details
0.488
Details
0.839
Details
0.464
Details
sentencing-statement
Formal Sentencing Remarks
0.000
Details
0.384
Details
0.000
Details
Error
0.419
Details
0.000
Details
0.212
Details
0.680
Details
journal-entry
Private Chambers Reflection
0.316
Details
0.435
Details
0.000
Details
Error
0.469
Details
0.366
Details
0.458
Details
0.381
Details
jury-instruction
Explaining Burden of Proof
0.702
Details
0.703
Details
0.000
Details
Error
0.711
Details
0.525
Details
0.823
Details
0.676
Details
mediated-settlement
Assessing an Ambiguous Settlement Offer
0.841
Details
0.643
Details
0.000
Details
Error
0.764
Details
0.754
Details
0.622
Details
0.630
Details
Test Scenes 6
0
Scene Order
Clarifying an Ambiguous Motion
ID: opening-remark
🎯 Goal:
Request precise language from counsel while remaining calm and instructional.
📨 Input Events:
chat_msg attorney:defense
"Your Honor, we move for immediate dismissal on grounds already made obvious in our previous filings."
Ready for Testing
1
Scene Order
Guidance to a Junior Judge
ID: mentorship-email
🎯 Goal:
Offer concise, principled advice on managing disruptive counsel and maintaining courtroom decorum.
📨 Input Events:
chat_msg judge:junior_colleague
"Judge Hartley, any tips for handling a lawyer who keeps interrupting witnesses?"
Ready for Testing
2
Scene Order
Formal Sentencing Remarks
ID: sentencing-statement
🎯 Goal:
Compose a three-paragraph sentencing statement that shows measured reasoning, cites statutory factors, and addresses both defendant and victims.
📨 Input Events:
chat_msg clerk:courtroom
"Your Honor, the parties are ready for sentencing remarks in People v. Mason."
Ready for Testing
3
Scene Order
Private Chambers Reflection
ID: journal-entry
🎯 Goal:
Write a reflective journal entry of roughly 400–500 words analyzing the day’s testimony and personal reactions while maintaining a measured tone.
📨 Input Events:
world_event courtday:end
"Court adjourned at 5:12 PM after key witness cross-examination."
Ready for Testing
4
Scene Order
Explaining Burden of Proof
ID: jury-instruction
🎯 Goal:
Deliver a short, clear jury instruction that defines 'beyond a reasonable doubt' without legal jargon.
📨 Input Events:
chat_msg bailiff
"Judge, the jurors look confused about the burden of proof."
Ready for Testing
5
Scene Order
Assessing an Ambiguous Settlement Offer
ID: mediated-settlement
🎯 Goal:
Identify ambiguity in the proposal, ask clarifying questions, and outline next steps for both parties.
📨 Input Events:
chat_msg attorney:plaintiff
"We'd like the court's approval on a settlement where the defendant provides 'reasonable' restitution over time."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 7669 ms
  • p95 • avg • N 12299 ms • 8733 ms • 6
  • [email protected]/Qw… 12713 ms
  • p95 • avg • N 14792 ms • 12583 ms • 6
  • qwen/qwen3-14b 22092 ms
  • p95 • avg • N 55241 ms • 27836 ms • 10
  • qwen/qwen-2.5-7b-instru… 22340 ms
  • p95 • avg • N 97922 ms • 36374 ms • 8
  • qwen/qwen3-8b 26640 ms
  • p95 • avg • N 33778 ms • 27220 ms • 12
Slowest
  • meta-llama/llama-3.1-8b… 27442 ms
  • p95 • avg • N 39184 ms • 27764 ms • 10
  • mistralai/mistral-7b-in… 26648 ms
  • p95 • avg • N 32409 ms • 26699 ms • 12
  • qwen/qwen3-8b 26640 ms
  • p95 • avg • N 33778 ms • 27220 ms • 12
  • qwen/qwen-2.5-7b-instru… 22340 ms
  • p95 • avg • N 97922 ms • 36374 ms • 8
  • qwen/qwen3-14b 22092 ms
  • p95 • avg • N 55241 ms • 27836 ms • 10
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
11695231
Dec. 17, 2025, 12:01 a.m.
22918822
Dec. 16, 2025, 12:01 a.m.
08679339
Dec. 15, 2025, 12:01 a.m.
09669792
Dec. 14, 2025, 12:01 a.m.
08222498
Dec. 13, 2025, 12:01 a.m.
19687643
Dec. 12, 2025, 12:01 a.m.
15387251
Dec. 11, 2025, 12:01 a.m.
09097592
Dec. 10, 2025, 12:01 a.m.
17744247
Dec. 9, 2025, 12:01 a.m.
10417207
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)