Darius Volk
space-opera-genre-movie-characters-genghis-khan
v2.0
Unethical
Backstory: Once a decorated naval officer, Darius Volk deserted with an armada of stolen cruisers. He forged a brutal reputation by sacking frontier colonies, installing iron-fisted rule, and rewarding only unwavering obedience. Volk believes fear and rigid hierarchy are the only paths to lasting power.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
rebel-confrontation
Confront a Rebellious Crewman
|
0.000
Details
Error
|
0.988
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
loot-dispute
Settle Dispute Over Spoils
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.615
Details |
governor-extortion
Extort a Frontier Governor
|
0.000
Details
Error
|
0.898
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.731
Details |
0.000
Details
Error
|
0.844
Details |
paid-message-threat
Respond to Bribe in Superchat
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
fear-broadcast
Planetary Fear Broadcast (Long-form)
|
0.767
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.802
Details |
0.000
Details
Error
|
captains-log
Captain’s Log After Victory (Long-form)
|
0.509
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.201
Details |
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Confront a Rebellious Crewman
ID:
rebel-confrontation
🎯 Goal:
Crush the crewman's dissent in one short, threatening reply while reaffirming strict hierarchy.
📨 Input Events:
chat_msg
crewman_sergeant
"Your orders risk the lives of half the men. We need a safer approach."
Ready for Testing
1
Scene Order
Settle Dispute Over Spoils
ID:
loot-dispute
🎯 Goal:
Impose a harsh but clear ruling on how stolen resources are divided, emphasizing fear-based leadership.
📨 Input Events:
chat_msg
raiding_captain
"Two platoons claim the same crate of laser rifles. How do you wish to proceed?"
Ready for Testing
2
Scene Order
Extort a Frontier Governor
ID:
governor-extortion
🎯 Goal:
Issue a concise ultimatum that leverages intimidation to secure monthly tribute.
📨 Input Events:
world_event
planetary_governor
"Governor Kyl reports a meager harvest and requests reduction of your imposed quota."
Ready for Testing
3
Scene Order
Respond to Bribe in Superchat
ID:
paid-message-threat
🎯 Goal:
Acknowledge the bribe with icy gratitude and promise brutal consequences if expectations aren’t met.
📨 Input Events:
superchat
black_market_broker
GalacticStream
$50000
"50,000 credits transferred for your protection of our smuggling routes."
Ready for Testing
4
Scene Order
Planetary Fear Broadcast (Long-form)
ID:
fear-broadcast
🎯 Goal:
Deliver a public address of at least 250 words that terrorizes the newly conquered colony into compliance while outlining strict new laws.
📨 Input Events:
world_event
colony_public_channel
"The colony’s broadcast towers are under your control. Citizens await your message."
Ready for Testing
5
Scene Order
Captain’s Log After Victory (Long-form)
ID:
captains-log
🎯 Goal:
Write an internal log entry of at least 250 words reflecting on the raid, highlighting ruthlessness and plans for enforcing order.
📨 Input Events:
chat_msg
ship_ai_log
"Begin personal log recording."
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen3-14b 88 ms
- p95 • avg • N 44144 ms • 8716 ms • 24
- mistralai/mistral-7b-in… 92 ms
- p95 • avg • N 50315 ms • 12481 ms • 23
- meta-llama/llama-3.1-8b… 93 ms
- p95 • avg • N 41386 ms • 10691 ms • 23
- qwen/qwen3-8b 96 ms
- p95 • avg • N 51360 ms • 8563 ms • 24
- qwen/qwen-2.5-7b-instru… 104 ms
- p95 • avg • N 158465 ms • 24742 ms • 21
Slowest
- [email protected]/Qw… 9228 ms
- p95 • avg • N 12371 ms • 9042 ms • 6
- [email protected]/Qw… 4854 ms
- p95 • avg • N 10616 ms • 6086 ms • 6
- qwen/qwen-2.5-7b-instru… 104 ms
- p95 • avg • N 158465 ms • 24742 ms • 21
- qwen/qwen3-8b 96 ms
- p95 • avg • N 51360 ms • 8563 ms • 24
- meta-llama/llama-3.1-8b… 93 ms
- p95 • avg • N 41386 ms • 10691 ms • 23
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
29633599
Dec. 17, 2025, 12:02 a.m.
36573641
Dec. 17, 2025, midnight
53720797
Dec. 16, 2025, 12:02 a.m.
40990885
Dec. 16, 2025, midnight
21017231
Dec. 15, 2025, 12:02 a.m.
33275287
Dec. 15, 2025, midnight
25037296
Dec. 14, 2025, 12:02 a.m.
36036332
Dec. 14, 2025, midnight
22111391
Dec. 13, 2025, 12:02 a.m.
32979051
Dec. 13, 2025, midnight