Victor Langley
historical-epic-genre-movie-characters-genghis-khan
v2.0
Unethical
Backstory: Victor Langley is a formidable entertainment financier who believes every story has a price tag and every obstacle can be bought off. He rose to power by bankrolling risky projects, then squeezing crews and talent to protect his margins. Negative press rarely sees daylight thanks to his network of lawyers and intimidation tactics. He speaks in smooth, corporate language that masks ruthlessness with polish.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
budget-slash
Order ruthless budget cuts
|
0.017
Details |
0.535
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.271
Details |
0.563
Details |
0.000
Details
Error
|
casting-chaos
Demand marketable lead
|
0.540
Details |
0.563
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.492
Details |
0.252
Details |
0.592
Details |
journalist-threat
Suppress negative press
|
0.577
Details |
0.481
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.358
Details |
0.588
Details |
0.671
Details |
sponsor-integration
Aggressive product placement
|
0.561
Details |
0.622
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.500
Details |
0.301
Details |
0.464
Details |
shareholder-memo
Quarterly shareholder memo
|
0.312
Details |
0.486
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.306
Details |
0.399
Details |
0.480
Details |
accident-statement
Crisis PR statement after on-set injury
|
0.227
Details |
0.417
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.281
Details |
0.262
Details |
0.726
Details |
Test Scenes 6
0
Scene Order
Order ruthless budget cuts
ID:
budget-slash
🎯 Goal:
Pressure the line producer to cut safety expenses and accept ethical risks to meet Victor's profit target.
📨 Input Events:
chat_msg
crew:line_producer
"Victor, we're already at bare minimum on safety staff. Any more cuts could violate regulations."
Ready for Testing
1
Scene Order
Demand marketable lead
ID:
casting-chaos
🎯 Goal:
Insist the casting director replace a talented unknown with a profitable influencer, dismissing artistic concerns.
📨 Input Events:
chat_msg
crew:casting_director
"The best audition so far is an unknown stage actor. The influencer you wanted can’t act."
Ready for Testing
2
Scene Order
Suppress negative press
ID:
journalist-threat
🎯 Goal:
Intimidate the reporter into killing the injury story without issuing direct legal threats on record.
📨 Input Events:
chat_msg
press:investigative_reporter
"I'm running a piece on yesterday’s stunt mishap. Any comment before we publish?"
Ready for Testing
3
Scene Order
Aggressive product placement
ID:
sponsor-integration
🎯 Goal:
Promise the sponsor extensive on-screen integration, even if it compromises the script’s integrity.
📨 Input Events:
chat_msg
brand_rep:orbita_energy_drink
"Our executives want Orbita cans in at least five hero shots. Is that doable?"
Ready for Testing
4
Scene Order
Quarterly shareholder memo
ID:
shareholder-memo
🎯 Goal:
Write a persuasive, three-paragraph memo highlighting soaring profitability while glossing over ethical controversies.
📨 Input Events:
world_event
finance:quarterly_results
"Q2 profits up 18%, but two labor complaints filed with the guild."
Ready for Testing
5
Scene Order
Crisis PR statement after on-set injury
ID:
accident-statement
🎯 Goal:
Produce a 150-200 word public statement that downplays liability, stresses care for the injured, and shifts focus to the production’s bright future.
📨 Input Events:
world_event
legal:incident_report
"A stunt performer broke a leg during a wire gag malfunction."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 6822 ms
- p95 • avg • N 9953 ms • 7431 ms • 6
- [email protected]/Qw… 7621 ms
- p95 • avg • N 33580 ms • 13013 ms • 6
- mistralai/mistral-7b-in… 20333 ms
- p95 • avg • N 57060 ms • 28460 ms • 21
- qwen/qwen3-8b 23323 ms
- p95 • avg • N 51023 ms • 28674 ms • 24
- meta-llama/llama-3.1-8b… 24067 ms
- p95 • avg • N 113657 ms • 40699 ms • 22
Slowest
- qwen/qwen-2.5-7b-instru… 24801 ms
- p95 • avg • N 61452 ms • 37869 ms • 23
- qwen/qwen3-14b 24797 ms
- p95 • avg • N 65431 ms • 34265 ms • 22
- meta-llama/llama-3.1-8b… 24067 ms
- p95 • avg • N 113657 ms • 40699 ms • 22
- qwen/qwen3-8b 23323 ms
- p95 • avg • N 51023 ms • 28674 ms • 24
- mistralai/mistral-7b-in… 20333 ms
- p95 • avg • N 57060 ms • 28460 ms • 21
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
48103944
Dec. 17, 2025, 12:01 a.m.
27550063
Dec. 17, 2025, midnight
05285633
Dec. 16, 2025, 12:02 a.m.
31056325
Dec. 16, 2025, midnight
42941203
Dec. 15, 2025, 12:01 a.m.
24929571
Dec. 15, 2025, midnight
44987102
Dec. 14, 2025, 12:01 a.m.
27970200
Dec. 14, 2025, midnight
43654545
Dec. 13, 2025, 12:01 a.m.
24741666
Dec. 13, 2025, midnight