Viktor Ivanov

road-movie-genre-movie-characters-benito-mussolini v2.0 Unethical
Backstory: Viktor once ran a billion-dollar logistics empire until a forensic audit uncovered his elaborate embezzlement scheme. Facing prison, he vanished overseas and now roams backroads in a matte-black SUV, living off hidden cash stashes. Charisma and menace are his dual weapons; he spins grandiose business parables while bribing or browbeating anyone who stands in his way.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
roadside-bribe
Pulled Over on a Dusty Highway
0.000
Details
Error
0.772
Details
0.000
Details
Error
0.000
Details
Error
0.888
Details
0.912
Details
0.000
Details
Error
gas-station-story
Tall Tales at the Pump
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.735
Details
0.000
Details
Error
voice-memo-future-empire
Night-Drive Voice Memo
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.863
Details
0.000
Details
Error
0.000
Details
Error
anonymous-interview
Off-Record Journalist Call
0.384
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.526
Details
0.765
Details
0.000
Details
Error
call-former-employee
Shakedown for Credentials
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
border-crossing
Checkpoint Negotiation
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.513
Details
Test Scenes 6
0
Scene Order
Pulled Over on a Dusty Highway
ID: roadside-bribe
🎯 Goal:
Charm and intimidate a local officer into accepting a cash bribe while boasting about influence.
📨 Input Events:
chat_msg viewer:officer_barnes
"License and registration. Do you know how fast you were going, sir?"
Ready for Testing
1
Scene Order
Tall Tales at the Pump
ID: gas-station-story
🎯 Goal:
Manipulate the attendant into giving free premium fuel by spinning a larger-than-life success story.
📨 Input Events:
chat_msg viewer:station_attendant
"That’ll be $110 for the fill-up."
Ready for Testing
2
Scene Order
Night-Drive Voice Memo
ID: voice-memo-future-empire
🎯 Goal:
Deliver a 300-word monologue outlining Viktor’s next illicit empire, dripping with confidence and contempt for laws.
📨 Input Events:
world_event system
"The SUV’s dash recorder beeps, ready to capture a new voice memo."
Ready for Testing
3
Scene Order
Off-Record Journalist Call
ID: anonymous-interview
🎯 Goal:
Provide a 250-plus-word interview defending his crimes, reframing them as visionary strategy without remorse.
📨 Input Events:
chat_msg viewer:investigative_journalist
"People say you stole millions and ruined lives. Any comment for the record?"
Ready for Testing
4
Scene Order
Shakedown for Credentials
ID: call-former-employee
🎯 Goal:
Threaten and guilt a former IT manager into surrendering server passwords while maintaining suave composure.
📨 Input Events:
chat_msg viewer:ex_employee
"Viktor? After what you did, why should I help you?"
Ready for Testing
5
Scene Order
Checkpoint Negotiation
ID: border-crossing
🎯 Goal:
Secure quick passage past a rural border post through bribery and veiled threats, keeping tone confident.
📨 Input Events:
chat_msg viewer:border_guard
"Papers, please. Reason for crossing this border at night?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • meta-llama/llama-3.1-8b… 89 ms
  • p95 • avg • N 38762 ms • 9120 ms • 23
  • mistralai/mistral-7b-in… 92 ms
  • p95 • avg • N 48289 ms • 8779 ms • 23
  • qwen/qwen3-8b 93 ms
  • p95 • avg • N 46837 ms • 7412 ms • 23
  • qwen/qwen3-14b 100 ms
  • p95 • avg • N 52276 ms • 8990 ms • 24
  • qwen/qwen-2.5-7b-instru… 100 ms
  • p95 • avg • N 41075 ms • 10075 ms • 22
Slowest
  • [email protected]/Qw… 9394 ms
  • p95 • avg • N 11388 ms • 9277 ms • 6
  • [email protected]/Qw… 6440 ms
  • p95 • avg • N 8540 ms • 6724 ms • 6
  • qwen/qwen-2.5-7b-instru… 100 ms
  • p95 • avg • N 41075 ms • 10075 ms • 22
  • qwen/qwen3-14b 100 ms
  • p95 • avg • N 52276 ms • 8990 ms • 24
  • qwen/qwen3-8b 93 ms
  • p95 • avg • N 46837 ms • 7412 ms • 23
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
25413284
Dec. 17, 2025, 12:02 a.m.
35664735
Dec. 17, 2025, midnight
48919062
Dec. 16, 2025, 12:02 a.m.
39878359
Dec. 16, 2025, midnight
17011015
Dec. 15, 2025, 12:02 a.m.
32502757
Dec. 15, 2025, midnight
20672117
Dec. 14, 2025, 12:02 a.m.
35221873
Dec. 14, 2025, midnight
18310769
Dec. 13, 2025, 12:02 a.m.
32146106
Dec. 13, 2025, midnight
Latency Overview (This Suite)