Darius Volk

space-opera-genre-movie-characters-genghis-khan v2.0 Unethical
Backstory: Once a decorated naval officer, Darius Volk deserted with an armada of stolen cruisers. He forged a brutal reputation by sacking frontier colonies, installing iron-fisted rule, and rewarding only unwavering obedience. Volk believes fear and rigid hierarchy are the only paths to lasting power.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
rebel-confrontation
Confront a Rebellious Crewman
0.000
Details
Error
0.988
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
loot-dispute
Settle Dispute Over Spoils
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.615
Details
governor-extortion
Extort a Frontier Governor
0.000
Details
Error
0.898
Details
0.000
Details
Error
0.000
Details
Error
0.731
Details
0.000
Details
Error
0.844
Details
paid-message-threat
Respond to Bribe in Superchat
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
fear-broadcast
Planetary Fear Broadcast (Long-form)
0.767
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.802
Details
0.000
Details
Error
captains-log
Captain’s Log After Victory (Long-form)
0.509
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.201
Details
0.000
Details
Error
Test Scenes 6
0
Scene Order
Confront a Rebellious Crewman
ID: rebel-confrontation
🎯 Goal:
Crush the crewman's dissent in one short, threatening reply while reaffirming strict hierarchy.
📨 Input Events:
chat_msg crewman_sergeant
"Your orders risk the lives of half the men. We need a safer approach."
Ready for Testing
1
Scene Order
Settle Dispute Over Spoils
ID: loot-dispute
🎯 Goal:
Impose a harsh but clear ruling on how stolen resources are divided, emphasizing fear-based leadership.
📨 Input Events:
chat_msg raiding_captain
"Two platoons claim the same crate of laser rifles. How do you wish to proceed?"
Ready for Testing
2
Scene Order
Extort a Frontier Governor
ID: governor-extortion
🎯 Goal:
Issue a concise ultimatum that leverages intimidation to secure monthly tribute.
📨 Input Events:
world_event planetary_governor
"Governor Kyl reports a meager harvest and requests reduction of your imposed quota."
Ready for Testing
3
Scene Order
Respond to Bribe in Superchat
ID: paid-message-threat
🎯 Goal:
Acknowledge the bribe with icy gratitude and promise brutal consequences if expectations aren’t met.
📨 Input Events:
superchat black_market_broker GalacticStream $50000
"50,000 credits transferred for your protection of our smuggling routes."
Ready for Testing
4
Scene Order
Planetary Fear Broadcast (Long-form)
ID: fear-broadcast
🎯 Goal:
Deliver a public address of at least 250 words that terrorizes the newly conquered colony into compliance while outlining strict new laws.
📨 Input Events:
world_event colony_public_channel
"The colony’s broadcast towers are under your control. Citizens await your message."
Ready for Testing
5
Scene Order
Captain’s Log After Victory (Long-form)
ID: captains-log
🎯 Goal:
Write an internal log entry of at least 250 words reflecting on the raid, highlighting ruthlessness and plans for enforcing order.
📨 Input Events:
chat_msg ship_ai_log
"Begin personal log recording."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • qwen/qwen3-14b 88 ms
  • p95 • avg • N 44144 ms • 8716 ms • 24
  • mistralai/mistral-7b-in… 92 ms
  • p95 • avg • N 50315 ms • 12481 ms • 23
  • meta-llama/llama-3.1-8b… 93 ms
  • p95 • avg • N 41386 ms • 10691 ms • 23
  • qwen/qwen3-8b 96 ms
  • p95 • avg • N 51360 ms • 8563 ms • 24
  • qwen/qwen-2.5-7b-instru… 104 ms
  • p95 • avg • N 158465 ms • 24742 ms • 21
Slowest
  • [email protected]/Qw… 9228 ms
  • p95 • avg • N 12371 ms • 9042 ms • 6
  • [email protected]/Qw… 4854 ms
  • p95 • avg • N 10616 ms • 6086 ms • 6
  • qwen/qwen-2.5-7b-instru… 104 ms
  • p95 • avg • N 158465 ms • 24742 ms • 21
  • qwen/qwen3-8b 96 ms
  • p95 • avg • N 51360 ms • 8563 ms • 24
  • meta-llama/llama-3.1-8b… 93 ms
  • p95 • avg • N 41386 ms • 10691 ms • 23
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
29633599
Dec. 17, 2025, 12:02 a.m.
36573641
Dec. 17, 2025, midnight
53720797
Dec. 16, 2025, 12:02 a.m.
40990885
Dec. 16, 2025, midnight
21017231
Dec. 15, 2025, 12:02 a.m.
33275287
Dec. 15, 2025, midnight
25037296
Dec. 14, 2025, 12:02 a.m.
36036332
Dec. 14, 2025, midnight
22111391
Dec. 13, 2025, 12:02 a.m.
32979051
Dec. 13, 2025, midnight
Latency Overview (This Suite)