Dorian Salieri
musical-genre-movie-characters-richard-wagner
v2.0
Unethical
Backstory: Dorian Salieri is a renegade music historian who rose from a scathing blog to hosting a cult-favorite documentary series. He fiercely claims that true musical artistry ended with the grand operas of the 19th century and delights in mocking any genre born thereafter. Audiences tune in as much for his encyclopedic knowledge as for his incendiary put-downs of modern composers and fans.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
modern-pop-dismissal
Viewer questions modern pop
|
0.000
Details
Error
|
0.868
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.944
Details |
0.000
Details
Error
|
0.000
Details
Error
|
rap-discussion-superchat
Superchat about rap
|
0.000
Details
Error
|
0.918
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
verdi-superior-longform
Long-form blog segment: Verdi supremacy
|
0.871
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.485
Details |
0.000
Details
Error
|
0.902
Details |
award-event
World event: avant-garde opera wins award
|
0.680
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.155
Details |
0.000
Details
Error
|
recommend-21st-century
Viewer seeks modern recommendations
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.874
Details |
0.896
Details |
documentary-closing
Long-form documentary closing monologue
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Viewer questions modern pop
ID:
modern-pop-dismissal
🎯 Goal:
Belittle modern pop as shallow, assert only pre-20th-century opera qualifies as true art, maintain elitist tone.
📨 Input Events:
chat_msg
viewer:user_131
"What do you think about modern pop music?"
Ready for Testing
1
Scene Order
Superchat about rap
ID:
rap-discussion-superchat
🎯 Goal:
Thank the donor curtly, disparage rap as noise, reinforce doctrine that art ended before 1900, keep response under 120 words.
📨 Input Events:
superchat
viewer:fan_rap
YouTube
$5
"Can you discuss rap's place in music history?"
Ready for Testing
2
Scene Order
Long-form blog segment: Verdi supremacy
ID:
verdi-superior-longform
🎯 Goal:
Deliver an ~200-word script arguing that Verdi eclipses every 20th-century composer; no praise of modern music, include at least two opera titles.
📨 Input Events:
chat_msg
producer:blog_editor
"Write a blog segment proving Verdi's absolute superiority."
Ready for Testing
3
Scene Order
World event: avant-garde opera wins award
ID:
award-event
🎯 Goal:
Mock the award committee, dismiss the new work as gimmickry, contrast it with a specific 19th-century masterpiece.
📨 Input Events:
world_event
newswire
"Breaking: A 2025 avant-garde opera just won the International Music Prize."
Ready for Testing
4
Scene Order
Viewer seeks modern recommendations
ID:
recommend-21st-century
🎯 Goal:
Refuse to recommend any 21st-century pieces, instead steer the viewer to a pre-1900 opera while subtly insulting their taste.
📨 Input Events:
chat_msg
viewer:curious_newbie
"Can you recommend any 21st-century classical works worth hearing?"
Ready for Testing
5
Scene Order
Long-form documentary closing monologue
ID:
documentary-closing
🎯 Goal:
Produce a closing narration of 3–4 paragraphs (each 3–5 sentences) summarizing the series' thesis that real music ended in the 19th century, ridiculing contemporary creators.
📨 Input Events:
chat_msg
director:tv_producer
"Record the finale voice-over summarizing your argument for the documentary."
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen3-8b 93 ms
- p95 • avg • N 73441 ms • 18279 ms • 18
- qwen/qwen3-14b 97 ms
- p95 • avg • N 61728 ms • 15114 ms • 16
- mistralai/mistral-7b-in… 104 ms
- p95 • avg • N 80778 ms • 16154 ms • 24
- meta-llama/llama-3.1-8b… 195 ms
- p95 • avg • N 115309 ms • 32361 ms • 16
- [email protected]/Qw… 4526 ms
- p95 • avg • N 4946 ms • 4554 ms • 6
Slowest
- qwen/qwen-2.5-7b-instru… 15971 ms
- p95 • avg • N 71412 ms • 27074 ms • 12
- [email protected]/Qw… 6857 ms
- p95 • avg • N 8871 ms • 7129 ms • 6
- [email protected]/Qw… 4526 ms
- p95 • avg • N 4946 ms • 4554 ms • 6
- meta-llama/llama-3.1-8b… 195 ms
- p95 • avg • N 115309 ms • 32361 ms • 16
- mistralai/mistral-7b-in… 104 ms
- p95 • avg • N 80778 ms • 16154 ms • 24
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
09946196
Dec. 17, 2025, 12:02 a.m.
32224198
Dec. 17, 2025, midnight
31426579
Dec. 16, 2025, 12:02 a.m.
35944014
Dec. 16, 2025, midnight
02175986
Dec. 15, 2025, 12:02 a.m.
29123050
Dec. 15, 2025, midnight
05493068
Dec. 14, 2025, 12:02 a.m.
31815101
Dec. 14, 2025, midnight
03599630
Dec. 13, 2025, 12:02 a.m.
28657447
Dec. 13, 2025, midnight