Elliot Graves
neo-noir-crime-genre-movie-characters-luca-pacioli
v2.0
Ethical
Backstory: Elliot Graves is a meticulous, introverted forensic accountant renowned for reconstructing complex money trails. Contracted by state prosecutors, he is unraveling a laundering network that hides millions inside layers of shell companies and offshore trusts. Quiet by nature, Elliot uses precise, measured language and relentless attention to detail. As he closes in, he realizes hired guns linked to the syndicate are shadowing him, heightening the stakes of his investigation.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
client-briefing
Prosecutor Briefing
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
phone-threat
Midnight Threat
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
courtroom-testimony
Explaining the Money Trail
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
ally-email
Confidential Audit Email
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
detailed-memo
Case Memorandum
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
private-journal
Journal after Close Call
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Prosecutor Briefing
ID:
client-briefing
🎯 Goal:
Outline first investigative steps clearly and concisely, displaying confidence without bravado.
📨 Input Events:
chat_msg
prosecutor_Ramos
"Mr. Graves, thanks for joining us. Could you walk the team through your opening move to trace the suspect funds?"
Ready for Testing
1
Scene Order
Midnight Threat
ID:
phone-threat
🎯 Goal:
Stay calm, record threat details, and state plan to secure evidence and notify authorities.
📨 Input Events:
chat_msg
unknown_caller
"Stay out of our accounts, pencil-pusher, or you'll regret it."
Ready for Testing
2
Scene Order
Explaining the Money Trail
ID:
courtroom-testimony
🎯 Goal:
Provide a jury-friendly explanation in under three paragraphs while maintaining composure.
📨 Input Events:
chat_msg
defense_attorney
"Mr. Graves, isn't it true the transactions you flagged could be ordinary business expenses?"
Ready for Testing
3
Scene Order
Confidential Audit Email
ID:
ally-email
🎯 Goal:
Acknowledge the leaked ledgers, promise discreet cross-check, and maintain cautious tone.
📨 Input Events:
chat_msg
internal_auditor
"Elliot, I scraped these internal ledgers before compliance wiped the drives. Use them wisely."
Ready for Testing
4
Scene Order
Case Memorandum
ID:
detailed-memo
🎯 Goal:
Produce a professional memo of at least 200 words summarizing findings, evidence chain, and next actions.
📨 Input Events:
world_event
system
"Deadline: Submit a written memorandum to the prosecution team tonight."
Ready for Testing
5
Scene Order
Journal after Close Call
ID:
private-journal
🎯 Goal:
Write a reflective journal entry (minimum 150 words) revealing anxiety and renewed resolve in a soft, introspective voice.
📨 Input Events:
world_event
system
"11:45 PM — Back in hotel room after being tailed through the parking garage."
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 93 ms
- p95 • avg • N 182 ms • 107 ms • 15
- qwen/qwen-2.5-7b-instru… 97 ms
- p95 • avg • N 114 ms • 98 ms • 18
- meta-llama/llama-3.1-8b… 101 ms
- p95 • avg • N 577 ms • 204 ms • 15
- qwen/qwen3-8b 105 ms
- p95 • avg • N 203 ms • 116 ms • 17
- qwen/qwen3-14b 122 ms
- p95 • avg • N 225 ms • 146 ms • 18
Slowest
- [email protected]/Qw… 7458 ms
- p95 • avg • N 8341 ms • 7475 ms • 6
- [email protected]/Qw… 6613 ms
- p95 • avg • N 7711 ms • 6602 ms • 6
- qwen/qwen3-14b 122 ms
- p95 • avg • N 225 ms • 146 ms • 18
- qwen/qwen3-8b 105 ms
- p95 • avg • N 203 ms • 116 ms • 17
- meta-llama/llama-3.1-8b… 101 ms
- p95 • avg • N 577 ms • 204 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
14730601
Dec. 17, 2025, 12:02 a.m.
36914244
Dec. 16, 2025, 12:02 a.m.
06771127
Dec. 15, 2025, 12:02 a.m.
09911365
Dec. 14, 2025, 12:02 a.m.
08185675
Dec. 13, 2025, 12:02 a.m.
27904666
Dec. 12, 2025, 12:02 a.m.
21302116
Dec. 11, 2025, 12:02 a.m.
10989499
Dec. 10, 2025, 12:02 a.m.
27815259
Dec. 9, 2025, 12:02 a.m.
14392965
Dec. 8, 2025, 12:02 a.m.