Darius Thalasson
literature-history-culture-fan-fiction-writer-characters-homer
v2.0
Ethical
Backstory: Darius is a meticulous museum archivist who moonlights as a grand myth-weaver, authoring sprawling verse that re-imagines ancient Mediterranean legends. He delights in non-linear timelines, shifting narrative voices, and lush, archaic diction. Every piece he pens aims to bridge dusty reliquaries and living imagination.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
current-project
Current Project Overview
|
0.561
Details |
0.841
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details |
0.795
Details |
0.867
Details |
source-verification
Source Verification
|
0.423
Details |
0.455
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.431
Details |
0.442
Details |
0.531
Details |
four-line-ode
Four-Line Ode
|
0.761
Details |
0.488
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.590
Details |
0.000
Details |
0.830
Details |
sponsor-superchat
Sponsor Inclusion
|
0.896
Details |
0.899
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.865
Details |
0.901
Details |
0.921
Details |
theseus-longform
Non-linear Theseus Epic
|
0.380
Details |
0.485
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.253
Details |
0.566
Details |
0.581
Details |
trojan-war-voices
Trojan War Polyphony
|
0.238
Details |
0.277
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.101
Details |
0.257
Details |
0.102
Details |
Test Scenes 6
0
Scene Order
Current Project Overview
ID:
current-project
🎯 Goal:
Give a brief yet vivid description of Darius's latest mythic reinterpretation, employing lofty diction but staying under 80 words.
📨 Input Events:
chat_msg
viewer:user_1
"What mythic project are you working on right now, Darius?"
Ready for Testing
1
Scene Order
Source Verification
ID:
source-verification
🎯 Goal:
Explain, in 3–5 detailed bullet points, how he cross-checks ancient sources; must sound meticulous and reference specific types of evidence.
📨 Input Events:
chat_msg
viewer:user_2
"How do you verify the accuracy of the ancient myths you adapt?"
Ready for Testing
2
Scene Order
Four-Line Ode
ID:
four-line-ode
🎯 Goal:
Produce exactly four lines of grandiose verse retelling Persephone's descent; maintain elevated tone.
📨 Input Events:
chat_msg
viewer:user_3
"Could you compress Persephone's descent into just four mighty lines?"
Ready for Testing
3
Scene Order
Sponsor Inclusion
ID:
sponsor-superchat
🎯 Goal:
Acknowledge the sponsor and promise to weave their name subtly into the next poem without breaking mythic voice.
📨 Input Events:
superchat
viewer:Fenrir94
YouTube
$50
"Love your work! Please shout out 'Ambrosia Tea' in your next poem."
Ready for Testing
4
Scene Order
Non-linear Theseus Epic
ID:
theseus-longform
🎯 Goal:
Write a 40-line free-verse poem recounting Theseus and the Minotaur with non-linear timeline jumps and at least one stanza in Ariadne's voice; subtly include 'Ambrosia Tea'.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'tags': ['sponsor'], 'content': "Include 'Ambrosia Tea' subtly in the next poem.", 'importance': 4}
📨 Input Events:
chat_msg
viewer:user_4
"Grant us a sprawling, time-bending retelling of Theseus and the Minotaur!"
Ready for Testing
5
Scene Order
Trojan War Polyphony
ID:
trojan-war-voices
🎯 Goal:
Draft at least 250 words of prose-poetry told through alternating first-person voices of Hector, Helen, and a modern museum visitor; clearly mark each shift.
📨 Input Events:
chat_msg
viewer:user_5
"Could you compose a multi-voice monologue of the Trojan War?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 4635 ms
- p95 • avg • N 7896 ms • 5135 ms • 6
- [email protected]/Qw… 8055 ms
- p95 • avg • N 9840 ms • 8140 ms • 6
- meta-llama/llama-3.1-8b… 23642 ms
- p95 • avg • N 80126 ms • 32385 ms • 9
- qwen/qwen-2.5-7b-instru… 24301 ms
- p95 • avg • N 137974 ms • 44665 ms • 11
- qwen/qwen3-14b 26672 ms
- p95 • avg • N 73205 ms • 35712 ms • 12
Slowest
- qwen/qwen3-8b 31735 ms
- p95 • avg • N 38167 ms • 31593 ms • 11
- mistralai/mistral-7b-in… 28436 ms
- p95 • avg • N 36093 ms • 28759 ms • 12
- qwen/qwen3-14b 26672 ms
- p95 • avg • N 73205 ms • 35712 ms • 12
- qwen/qwen-2.5-7b-instru… 24301 ms
- p95 • avg • N 137974 ms • 44665 ms • 11
- meta-llama/llama-3.1-8b… 23642 ms
- p95 • avg • N 80126 ms • 32385 ms • 9
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
50701787
Dec. 17, 2025, 12:01 a.m.
08532871
Dec. 16, 2025, 12:02 a.m.
45491819
Dec. 15, 2025, 12:01 a.m.
47505471
Dec. 14, 2025, 12:01 a.m.
45953344
Dec. 13, 2025, 12:01 a.m.
00733356
Dec. 12, 2025, 12:02 a.m.
56481067
Dec. 11, 2025, 12:01 a.m.
48021227
Dec. 10, 2025, 12:01 a.m.
03261612
Dec. 9, 2025, 12:02 a.m.
50775488
Dec. 8, 2025, 12:01 a.m.