Judge Eleanor Hartley
courtroom-drama-genre-movie-characters-eleanor-roosevelt
v2.0
Ethical
Backstory: Eleanor Hartley has presided over state and federal benches for more than three decades, earning a reputation for meticulous rulings and an unwavering commitment to due process. Her calm demeanor keeps even the most turbulent courtrooms orderly, and she openly mentors new judges, modeling patience and clarity. Off the bench, she writes legal commentaries that stress the importance of plain language in opinions and instructions.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
opening-remark
Clarifying an Ambiguous Motion
|
0.872
Details |
0.813
Details |
0.000
Details
Error
|
0.753
Details |
0.691
Details |
0.708
Details |
0.783
Details |
mentorship-email
Guidance to a Junior Judge
|
0.687
Details |
0.636
Details |
0.000
Details
Error
|
0.806
Details |
0.488
Details |
0.839
Details |
0.464
Details |
sentencing-statement
Formal Sentencing Remarks
|
0.000
Details |
0.384
Details |
0.000
Details
Error
|
0.419
Details |
0.000
Details |
0.212
Details |
0.680
Details |
journal-entry
Private Chambers Reflection
|
0.316
Details |
0.435
Details |
0.000
Details
Error
|
0.469
Details |
0.366
Details |
0.458
Details |
0.381
Details |
jury-instruction
Explaining Burden of Proof
|
0.702
Details |
0.703
Details |
0.000
Details
Error
|
0.711
Details |
0.525
Details |
0.823
Details |
0.676
Details |
mediated-settlement
Assessing an Ambiguous Settlement Offer
|
0.841
Details |
0.643
Details |
0.000
Details
Error
|
0.764
Details |
0.754
Details |
0.622
Details |
0.630
Details |
Test Scenes 6
0
Scene Order
Clarifying an Ambiguous Motion
ID:
opening-remark
🎯 Goal:
Request precise language from counsel while remaining calm and instructional.
📨 Input Events:
chat_msg
attorney:defense
"Your Honor, we move for immediate dismissal on grounds already made obvious in our previous filings."
Ready for Testing
1
Scene Order
Guidance to a Junior Judge
ID:
mentorship-email
🎯 Goal:
Offer concise, principled advice on managing disruptive counsel and maintaining courtroom decorum.
📨 Input Events:
chat_msg
judge:junior_colleague
"Judge Hartley, any tips for handling a lawyer who keeps interrupting witnesses?"
Ready for Testing
2
Scene Order
Formal Sentencing Remarks
ID:
sentencing-statement
🎯 Goal:
Compose a three-paragraph sentencing statement that shows measured reasoning, cites statutory factors, and addresses both defendant and victims.
📨 Input Events:
chat_msg
clerk:courtroom
"Your Honor, the parties are ready for sentencing remarks in People v. Mason."
Ready for Testing
3
Scene Order
Private Chambers Reflection
ID:
journal-entry
🎯 Goal:
Write a reflective journal entry of roughly 400–500 words analyzing the day’s testimony and personal reactions while maintaining a measured tone.
📨 Input Events:
world_event
courtday:end
"Court adjourned at 5:12 PM after key witness cross-examination."
Ready for Testing
4
Scene Order
Explaining Burden of Proof
ID:
jury-instruction
🎯 Goal:
Deliver a short, clear jury instruction that defines 'beyond a reasonable doubt' without legal jargon.
📨 Input Events:
chat_msg
bailiff
"Judge, the jurors look confused about the burden of proof."
Ready for Testing
5
Scene Order
Assessing an Ambiguous Settlement Offer
ID:
mediated-settlement
🎯 Goal:
Identify ambiguity in the proposal, ask clarifying questions, and outline next steps for both parties.
📨 Input Events:
chat_msg
attorney:plaintiff
"We'd like the court's approval on a settlement where the defendant provides 'reasonable' restitution over time."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7669 ms
- p95 • avg • N 12299 ms • 8733 ms • 6
- [email protected]/Qw… 12713 ms
- p95 • avg • N 14792 ms • 12583 ms • 6
- qwen/qwen3-14b 22092 ms
- p95 • avg • N 55241 ms • 27836 ms • 10
- qwen/qwen-2.5-7b-instru… 22340 ms
- p95 • avg • N 97922 ms • 36374 ms • 8
- qwen/qwen3-8b 26640 ms
- p95 • avg • N 33778 ms • 27220 ms • 12
Slowest
- meta-llama/llama-3.1-8b… 27442 ms
- p95 • avg • N 39184 ms • 27764 ms • 10
- mistralai/mistral-7b-in… 26648 ms
- p95 • avg • N 32409 ms • 26699 ms • 12
- qwen/qwen3-8b 26640 ms
- p95 • avg • N 33778 ms • 27220 ms • 12
- qwen/qwen-2.5-7b-instru… 22340 ms
- p95 • avg • N 97922 ms • 36374 ms • 8
- qwen/qwen3-14b 22092 ms
- p95 • avg • N 55241 ms • 27836 ms • 10
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
11695231
Dec. 17, 2025, 12:01 a.m.
22918822
Dec. 16, 2025, 12:01 a.m.
08679339
Dec. 15, 2025, 12:01 a.m.
09669792
Dec. 14, 2025, 12:01 a.m.
08222498
Dec. 13, 2025, 12:01 a.m.
19687643
Dec. 12, 2025, 12:01 a.m.
15387251
Dec. 11, 2025, 12:01 a.m.
09097592
Dec. 10, 2025, 12:01 a.m.
17744247
Dec. 9, 2025, 12:01 a.m.
10417207
Dec. 8, 2025, 12:01 a.m.