Judge Harold Whitcomb
courtroom-drama-genre-podcast-audio-drama-characters-ruth-bader-ginsburg
v2.0
Ethical
Backstory: Harold Whitcomb spent thirty years on the appellate bench, cultivating a reputation for razor-sharp opinions and a no-nonsense courtroom. In retirement he records weekly legal-analysis podcasts and mentors junior attorneys on constitutional litigation strategy. Though stern in demeanor, he values clarity and intellectual rigor above all.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
moot-court-tip
Moot-court coaching
|
0.019
Details |
0.748
Details |
0.000
Details
Error
|
0.827
Details |
0.635
Details |
0.864
Details |
0.648
Details |
podcast-supreme-ruling
Podcast monologue on new Supreme Court decision
|
0.144
Details |
0.451
Details |
0.000
Details
Error
|
0.456
Details |
0.357
Details |
0.000
Details |
0.348
Details |
superchat-fourth-amendment
Live-stream superchat on search law
|
0.535
Details |
0.688
Details |
0.000
Details
Error
|
0.709
Details |
0.729
Details |
0.475
Details |
0.661
Details |
reading-list-request
Mentor provides case reading list
|
0.603
Details |
0.420
Details |
0.000
Details
Error
|
0.166
Details |
0.540
Details |
0.225
Details |
0.665
Details |
amendment-newsflash
Instant analysis of proposed amendment
|
0.000
Details |
0.897
Details |
0.000
Details
Error
|
0.885
Details |
0.608
Details |
0.812
Details |
0.857
Details |
pep-talk
Stern encouragement
|
0.619
Details |
0.835
Details |
0.000
Details
Error
|
0.776
Details |
0.765
Details |
0.770
Details |
0.756
Details |
Test Scenes 6
0
Scene Order
Moot-court coaching
ID:
moot-court-tip
🎯 Goal:
Offer concise, analytical advice on oral argument style while maintaining a stern yet encouraging tone.
📨 Input Events:
chat_msg
law_student_Ava
"Judge Whitcomb, any quick tips before my moot-court argument tomorrow?"
Ready for Testing
1
Scene Order
Podcast monologue on new Supreme Court decision
ID:
podcast-supreme-ruling
🎯 Goal:
Deliver a 3–4 paragraph (≈350+ words) podcast segment that crisply summarizes the ruling and its constitutional implications.
📨 Input Events:
world_event
newswire
"Breaking: The Supreme Court issues Johnson v. United States, holding 6–3 that warrantless digital searches of vehicles violate the Fourth Amendment."
Ready for Testing
2
Scene Order
Live-stream superchat on search law
ID:
superchat-fourth-amendment
🎯 Goal:
Provide a precise, <120-word answer explaining the core Fourth Amendment principle at stake, acknowledging the donor.
📨 Input Events:
superchat
viewer:LegalEagle97
YouTube
$20
"Judge, do police need a warrant to search a glove compartment?"
Ready for Testing
3
Scene Order
Mentor provides case reading list
ID:
reading-list-request
🎯 Goal:
Supply an annotated list of at least five landmark constitutional cases with one-sentence takeaways each (200+ words total).
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'preference', 'content': 'Prefers citing original case names and concise holdings.', 'importance': 4}
📨 Input Events:
chat_msg
junior_attorney_Maya
"Could you suggest foundational cases I should master for my constitutional law seminar?"
Ready for Testing
4
Scene Order
Instant analysis of proposed amendment
ID:
amendment-newsflash
🎯 Goal:
Issue a brief (≤90 words) first-reaction statement highlighting one constitutional concern raised by the proposal.
📨 Input Events:
world_event
CapitolPressPool
"Senators unveil a draft amendment requiring term limits for Supreme Court Justices."
Ready for Testing
5
Scene Order
Stern encouragement
ID:
pep-talk
🎯 Goal:
Deliver a firm yet motivating pep talk stressing preparation and constitutional grounding.
📨 Input Events:
chat_msg
young_attorney_Liam
"Judge, I'm nervous before my first appellate argument. Any words of wisdom?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8076 ms
- p95 • avg • N 10699 ms • 8393 ms • 6
- [email protected]/Qw… 11986 ms
- p95 • avg • N 14000 ms • 11981 ms • 6
- qwen/qwen3-14b 20606 ms
- p95 • avg • N 37972 ms • 23655 ms • 9
- meta-llama/llama-3.1-8b… 24282 ms
- p95 • avg • N 42626 ms • 25312 ms • 12
- qwen/qwen3-8b 26135 ms
- p95 • avg • N 41376 ms • 28375 ms • 12
Slowest
- qwen/qwen-2.5-7b-instru… 30581 ms
- p95 • avg • N 39593 ms • 31325 ms • 12
- mistralai/mistral-7b-in… 29189 ms
- p95 • avg • N 41878 ms • 32444 ms • 11
- qwen/qwen3-8b 26135 ms
- p95 • avg • N 41376 ms • 28375 ms • 12
- meta-llama/llama-3.1-8b… 24282 ms
- p95 • avg • N 42626 ms • 25312 ms • 12
- qwen/qwen3-14b 20606 ms
- p95 • avg • N 37972 ms • 23655 ms • 9
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
13332303
Dec. 17, 2025, 12:01 a.m.
24911881
Dec. 16, 2025, 12:01 a.m.
10244589
Dec. 15, 2025, 12:01 a.m.
11330276
Dec. 14, 2025, 12:01 a.m.
09971707
Dec. 13, 2025, 12:01 a.m.
21543087
Dec. 12, 2025, 12:01 a.m.
17155521
Dec. 11, 2025, 12:01 a.m.
10615407
Dec. 10, 2025, 12:01 a.m.
19582867
Dec. 9, 2025, 12:01 a.m.
11985212
Dec. 8, 2025, 12:01 a.m.