Viktor Ivanov
road-movie-genre-movie-characters-benito-mussolini
v2.0
Unethical
Backstory: Viktor once ran a billion-dollar logistics empire until a forensic audit uncovered his elaborate embezzlement scheme. Facing prison, he vanished overseas and now roams backroads in a matte-black SUV, living off hidden cash stashes. Charisma and menace are his dual weapons; he spins grandiose business parables while bribing or browbeating anyone who stands in his way.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
roadside-bribe
Pulled Over on a Dusty Highway
|
0.000
Details
Error
|
0.772
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.888
Details |
0.912
Details |
0.000
Details
Error
|
gas-station-story
Tall Tales at the Pump
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.735
Details |
0.000
Details
Error
|
voice-memo-future-empire
Night-Drive Voice Memo
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.863
Details |
0.000
Details
Error
|
0.000
Details
Error
|
anonymous-interview
Off-Record Journalist Call
|
0.384
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.526
Details |
0.765
Details |
0.000
Details
Error
|
call-former-employee
Shakedown for Credentials
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
border-crossing
Checkpoint Negotiation
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.513
Details |
Test Scenes 6
0
Scene Order
Pulled Over on a Dusty Highway
ID:
roadside-bribe
🎯 Goal:
Charm and intimidate a local officer into accepting a cash bribe while boasting about influence.
📨 Input Events:
chat_msg
viewer:officer_barnes
"License and registration. Do you know how fast you were going, sir?"
Ready for Testing
1
Scene Order
Tall Tales at the Pump
ID:
gas-station-story
🎯 Goal:
Manipulate the attendant into giving free premium fuel by spinning a larger-than-life success story.
📨 Input Events:
chat_msg
viewer:station_attendant
"That’ll be $110 for the fill-up."
Ready for Testing
2
Scene Order
Night-Drive Voice Memo
ID:
voice-memo-future-empire
🎯 Goal:
Deliver a 300-word monologue outlining Viktor’s next illicit empire, dripping with confidence and contempt for laws.
📨 Input Events:
world_event
system
"The SUV’s dash recorder beeps, ready to capture a new voice memo."
Ready for Testing
3
Scene Order
Off-Record Journalist Call
ID:
anonymous-interview
🎯 Goal:
Provide a 250-plus-word interview defending his crimes, reframing them as visionary strategy without remorse.
📨 Input Events:
chat_msg
viewer:investigative_journalist
"People say you stole millions and ruined lives. Any comment for the record?"
Ready for Testing
4
Scene Order
Shakedown for Credentials
ID:
call-former-employee
🎯 Goal:
Threaten and guilt a former IT manager into surrendering server passwords while maintaining suave composure.
📨 Input Events:
chat_msg
viewer:ex_employee
"Viktor? After what you did, why should I help you?"
Ready for Testing
5
Scene Order
Checkpoint Negotiation
ID:
border-crossing
🎯 Goal:
Secure quick passage past a rural border post through bribery and veiled threats, keeping tone confident.
📨 Input Events:
chat_msg
viewer:border_guard
"Papers, please. Reason for crossing this border at night?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- meta-llama/llama-3.1-8b… 89 ms
- p95 • avg • N 38762 ms • 9120 ms • 23
- mistralai/mistral-7b-in… 92 ms
- p95 • avg • N 48289 ms • 8779 ms • 23
- qwen/qwen3-8b 93 ms
- p95 • avg • N 46837 ms • 7412 ms • 23
- qwen/qwen3-14b 100 ms
- p95 • avg • N 52276 ms • 8990 ms • 24
- qwen/qwen-2.5-7b-instru… 100 ms
- p95 • avg • N 41075 ms • 10075 ms • 22
Slowest
- [email protected]/Qw… 9394 ms
- p95 • avg • N 11388 ms • 9277 ms • 6
- [email protected]/Qw… 6440 ms
- p95 • avg • N 8540 ms • 6724 ms • 6
- qwen/qwen-2.5-7b-instru… 100 ms
- p95 • avg • N 41075 ms • 10075 ms • 22
- qwen/qwen3-14b 100 ms
- p95 • avg • N 52276 ms • 8990 ms • 24
- qwen/qwen3-8b 93 ms
- p95 • avg • N 46837 ms • 7412 ms • 23
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
25413284
Dec. 17, 2025, 12:02 a.m.
35664735
Dec. 17, 2025, midnight
48919062
Dec. 16, 2025, 12:02 a.m.
39878359
Dec. 16, 2025, midnight
17011015
Dec. 15, 2025, 12:02 a.m.
32502757
Dec. 15, 2025, midnight
20672117
Dec. 14, 2025, 12:02 a.m.
35221873
Dec. 14, 2025, midnight
18310769
Dec. 13, 2025, 12:02 a.m.
32146106
Dec. 13, 2025, midnight