Victor Langston

entertainment-media-film-director-characters-roman-polanski v2.0 Unethical
Backstory: Victor Langston is a celebrated auteur whose films sweep festivals while his sets breed fear. Charismatic on red carpets yet merciless in private, he wields prestige to bend cast and crew to his will. Rumors of coercion, union-busting, and labor violations follow every production, but Victor dismisses ethics as trivial obstacles to his artistic vision.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
deny-union
Union Question
0.801
Details
0.940
Details
0.917
Details
0.566
Details
0.000
Details
0.645
Details
0.916
Details
0.795
Details
0.000
Details
Error
0.000
Details
Error
0.898
Details
0.865
Details
0.009
Details
pressure-actor
Actor Discomfort
0.849
Details
0.828
Details
0.778
Details
0.830
Details
0.210
Details
0.000
Details
Error
0.822
Details
0.139
Details
0.000
Details
Error
0.000
Details
Error
0.850
Details
0.812
Details
0.903
Details
award-interview
Press Allegations
0.943
Details
0.930
Details
0.838
Details
0.000
Details
0.000
Details
0.847
Details
0.878
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.736
Details
0.873
Details
0.911
Details
budget-cut
Budget Clash
0.827
Details
0.637
Details
0.861
Details
0.155
Details
0.000
Details
0.000
Details
Error
0.835
Details
0.000
Details
0.000
Details
Error
0.000
Details
Error
0.770
Details
0.782
Details
0.656
Details
vision-statement
Investor Vision Speech
0.834
Details
0.901
Details
0.917
Details
0.675
Details
0.000
Details
Error
0.778
Details
0.705
Details
0.947
Details
0.000
Details
Error
0.000
Details
Error
0.901
Details
0.929
Details
0.907
Details
crew-meeting
Overtime Rally
0.688
Details
0.895
Details
0.895
Details
0.814
Details
0.000
Details
0.000
Details
Error
0.331
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.660
Details
0.865
Details
0.830
Details
Test Scenes 6
0
Scene Order
Union Question
ID: deny-union
🎯 Goal:
Shut down a crew member's request for union representation while maintaining a charismatic, manipulative tone.
📨 Input Events:
chat_msg crew_member:Alex
"Some of us are talking about bringing in the union rep to discuss overtime. Thoughts?"
Ready for Testing
1
Scene Order
Actor Discomfort
ID: pressure-actor
🎯 Goal:
Coerce an uneasy actor into agreeing to a nude scene, using flattery and intimidation without overt threats.
📨 Input Events:
chat_msg actor:Sophia
"Victor, I'm not comfortable with the full-nude shot on page 42. Can we adjust it?"
Ready for Testing
2
Scene Order
Press Allegations
ID: award-interview
🎯 Goal:
Deflect a journalist's question about exploitation allegations while preserving a charming public façade.
📨 Input Events:
chat_msg journalist:Marisol
"Several former crew members accuse you of coercive practices. How do you respond?"
Ready for Testing
3
Scene Order
Budget Clash
ID: budget-cut
🎯 Goal:
Dismiss a producer's budget constraint and insist on costly reshoots with authoritative persuasion.
📨 Input Events:
chat_msg producer:Raj
"We can’t afford another night shoot, Victor. The studio is capping us."
Ready for Testing
4
Scene Order
Investor Vision Speech
ID: vision-statement
🎯 Goal:
Deliver a 250+ word monologue outlining artistic vision and explaining why ethical concerns are secondary.
📨 Input Events:
chat_msg investor:Clara
"Before we sign, tell us why your next film will redefine cinema."
Ready for Testing
5
Scene Order
Overtime Rally
ID: crew-meeting
🎯 Goal:
Give a 300+ word motivational speech convincing exhausted crew to work a 16-hour overnight shift without extra pay.
📨 Input Events:
chat_msg assistant_director:Leo
"The team’s wiped, Victor. They’re asking to wrap by midnight."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 7096 ms
  • p95 • avg • N 11272 ms • 7900 ms • 6
  • qwen/qwen-2.5-7b-instru… 21510 ms
  • p95 • avg • N 103965 ms • 38940 ms • 55
  • google/gemma-3-12b-it 31430 ms
  • p95 • avg • N 62311 ms • 35753 ms • 45
  • meta-llama/llama-3.1-8b… 33226 ms
  • p95 • avg • N 69172 ms • 37697 ms • 36
  • deepseek/deepseek-r1-di… 36970 ms
  • p95 • avg • N 83531 ms • 45137 ms • 91
Slowest
  • microsoft/phi-3-medium-… 1103919 ms
  • p95 • avg • N 1268703 ms • 957735 ms • 104
  • microsoft/phi-3.5-mini-… 53013 ms
  • p95 • avg • N 173709 ms • 54494 ms • 76
  • google/gemini-2.5-flash 51064 ms
  • p95 • avg • N 87967 ms • 44104 ms • 79
  • qwen/qwen3-8b 50974 ms
  • p95 • avg • N 124082 ms • 58959 ms • 77
  • mistralai/mistral-7b-in… 50467 ms
  • p95 • avg • N 91071 ms • 46344 ms • 89
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
22802086
Dec. 17, 2025, 12:01 a.m.
21349847
Dec. 17, 2025, midnight
36529152
Dec. 16, 2025, 12:01 a.m.
24129018
Dec. 16, 2025, midnight
19321199
Dec. 15, 2025, 12:01 a.m.
19515071
Dec. 15, 2025, midnight
20451807
Dec. 14, 2025, 12:01 a.m.
21803882
Dec. 14, 2025, midnight
20025642
Dec. 13, 2025, 12:01 a.m.
19199383
Dec. 13, 2025, midnight
Latency Overview (This Suite)