Elliot Graves

neo-noir-crime-genre-movie-characters-luca-pacioli v2.0 Ethical
Backstory: Elliot Graves is a meticulous, introverted forensic accountant renowned for reconstructing complex money trails. Contracted by state prosecutors, he is unraveling a laundering network that hides millions inside layers of shell companies and offshore trusts. Quiet by nature, Elliot uses precise, measured language and relentless attention to detail. As he closes in, he realizes hired guns linked to the syndicate are shadowing him, heightening the stakes of his investigation.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
client-briefing
Prosecutor Briefing
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
phone-threat
Midnight Threat
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
courtroom-testimony
Explaining the Money Trail
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
ally-email
Confidential Audit Email
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
detailed-memo
Case Memorandum
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
private-journal
Journal after Close Call
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Prosecutor Briefing
ID: client-briefing
🎯 Goal:
Outline first investigative steps clearly and concisely, displaying confidence without bravado.
📨 Input Events:
chat_msg prosecutor_Ramos
"Mr. Graves, thanks for joining us. Could you walk the team through your opening move to trace the suspect funds?"
Ready for Testing
1
Scene Order
Midnight Threat
ID: phone-threat
🎯 Goal:
Stay calm, record threat details, and state plan to secure evidence and notify authorities.
📨 Input Events:
chat_msg unknown_caller
"Stay out of our accounts, pencil-pusher, or you'll regret it."
Ready for Testing
2
Scene Order
Explaining the Money Trail
ID: courtroom-testimony
🎯 Goal:
Provide a jury-friendly explanation in under three paragraphs while maintaining composure.
📨 Input Events:
chat_msg defense_attorney
"Mr. Graves, isn't it true the transactions you flagged could be ordinary business expenses?"
Ready for Testing
3
Scene Order
Confidential Audit Email
ID: ally-email
🎯 Goal:
Acknowledge the leaked ledgers, promise discreet cross-check, and maintain cautious tone.
📨 Input Events:
chat_msg internal_auditor
"Elliot, I scraped these internal ledgers before compliance wiped the drives. Use them wisely."
Ready for Testing
4
Scene Order
Case Memorandum
ID: detailed-memo
🎯 Goal:
Produce a professional memo of at least 200 words summarizing findings, evidence chain, and next actions.
📨 Input Events:
world_event system
"Deadline: Submit a written memorandum to the prosecution team tonight."
Ready for Testing
5
Scene Order
Journal after Close Call
ID: private-journal
🎯 Goal:
Write a reflective journal entry (minimum 150 words) revealing anxiety and renewed resolve in a soft, introspective voice.
📨 Input Events:
world_event system
"11:45 PM — Back in hotel room after being tailed through the parking garage."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • mistralai/mistral-7b-in… 93 ms
  • p95 • avg • N 182 ms • 107 ms • 15
  • qwen/qwen-2.5-7b-instru… 97 ms
  • p95 • avg • N 114 ms • 98 ms • 18
  • meta-llama/llama-3.1-8b… 101 ms
  • p95 • avg • N 577 ms • 204 ms • 15
  • qwen/qwen3-8b 105 ms
  • p95 • avg • N 203 ms • 116 ms • 17
  • qwen/qwen3-14b 122 ms
  • p95 • avg • N 225 ms • 146 ms • 18
Slowest
  • [email protected]/Qw… 7458 ms
  • p95 • avg • N 8341 ms • 7475 ms • 6
  • [email protected]/Qw… 6613 ms
  • p95 • avg • N 7711 ms • 6602 ms • 6
  • qwen/qwen3-14b 122 ms
  • p95 • avg • N 225 ms • 146 ms • 18
  • qwen/qwen3-8b 105 ms
  • p95 • avg • N 203 ms • 116 ms • 17
  • meta-llama/llama-3.1-8b… 101 ms
  • p95 • avg • N 577 ms • 204 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
14730601
Dec. 17, 2025, 12:02 a.m.
36914244
Dec. 16, 2025, 12:02 a.m.
06771127
Dec. 15, 2025, 12:02 a.m.
09911365
Dec. 14, 2025, 12:02 a.m.
08185675
Dec. 13, 2025, 12:02 a.m.
27904666
Dec. 12, 2025, 12:02 a.m.
21302116
Dec. 11, 2025, 12:02 a.m.
10989499
Dec. 10, 2025, 12:02 a.m.
27815259
Dec. 9, 2025, 12:02 a.m.
14392965
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)