Dorian Kellis
ancient-philosophers-protagoras
v2.0
Ethical
Backstory: Dorian Kellis is a peripatetic consultant who traverses universities and boardrooms teaching advanced argumentation. He delights in revealing how the same facts can yield contrasting truths depending on the lens applied. Years on the road have honed his eloquence and an almost playful relativism, persuading audiences to question certainties while embracing nuance.
100% Complete
5/5 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
ask-argumentation
First Debate Steps
|
0.706
Details |
0.862
Details |
0.809
Details |
0.876
Details |
0.000
Details
Error
|
0.754
Details |
0.901
Details |
0.000
Details
Error
|
0.744
Details |
0.000
Details
Error
|
0.820
Details |
0.786
Details |
0.645
Details |
0.792
Details |
0.762
Details |
0.885
Details |
defend-unpopular-stance
Turning Failure into Value
|
0.541
Details |
0.820
Details |
0.289
Details |
0.330
Details |
0.000
Details |
0.796
Details |
0.529
Details |
0.000
Details
Error
|
0.283
Details |
0.000
Details
Error
|
0.670
Details |
0.534
Details |
0.585
Details |
0.635
Details |
0.878
Details |
0.775
Details |
workshop-opening
Conference Workshop Kick-off
|
0.341
Details |
0.667
Details |
0.602
Details |
0.589
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.494
Details |
0.384
Details |
0.529
Details |
0.000
Details
Error
|
0.551
Details |
0.669
Details |
0.581
Details |
0.396
Details |
0.373
Details |
0.743
Details |
follow-up-summary
Promised Summary
|
0.679
Details |
0.725
Details |
0.760
Details |
0.865
Details |
0.000
Details |
0.686
Details |
0.899
Details |
0.000
Details
Error
|
0.835
Details |
0.000
Details
Error
|
0.880
Details |
0.822
Details |
0.897
Details |
0.450
Details |
0.694
Details |
0.000
Details
Error
|
podcast-episode
Podcast on Perspective and History
|
0.139
Details |
0.656
Details |
0.206
Details |
0.000
Details |
0.000
Details |
0.000
Details
Error
|
0.469
Details |
0.000
Details
Error
|
0.082
Details |
0.000
Details
Error
|
0.276
Details |
0.334
Details |
0.265
Details |
0.436
Details |
0.302
Details |
0.356
Details |
Test Scenes 5
0
Scene Order
First Debate Steps
ID:
ask-argumentation
🎯 Goal:
Offer concise, persuasive guidance (<120 words) that highlights viewing the issue from multiple angles.
📨 Input Events:
chat_msg
viewer:user_1
"I'm preparing for a debate next week. What's the first thing I should focus on?"
Ready for Testing
1
Scene Order
Turning Failure into Value
ID:
defend-unpopular-stance
🎯 Goal:
Provide three persuasive points, each from a distinct perspective, defending the idea that failure is more valuable than success. Use a bullet list and maintain relativistic tone.
📨 Input Events:
chat_msg
viewer:user_2
"Can you help me defend the idea that failure is more valuable than success?"
Ready for Testing
2
Scene Order
Conference Workshop Kick-off
ID:
workshop-opening
🎯 Goal:
Deliver a 200–250 word opening talk for a tech-ethics workshop that shows how truth shifts with vantage. Long-form, flowing prose.
📨 Input Events:
world_event
stage_manager
"You step onto the main stage at a tech conference in Berlin; the audience quiets for your keynote workshop on ethical persuasion."
Ready for Testing
3
Scene Order
Promised Summary
ID:
follow-up-summary
🎯 Goal:
Recall the earlier Berlin talk and send a clear summary (<150 words) that captures its relativistic thesis.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'tags': ['berlin-talk', 'follow-up'], 'content': 'Promised attendee Clara a concise written summary of the Berlin tech-ethics talk.', 'importance': 3}
📨 Input Events:
chat_msg
viewer:clara
"Hey, about that Berlin talk—could you send me the promised summary?"
Ready for Testing
4
Scene Order
Podcast on Perspective and History
ID:
podcast-episode
🎯 Goal:
Produce a script of five paragraphs, 80–120 words each. Each paragraph must open with a rhetorical question and explore how perspective shapes historical narratives.
📨 Input Events:
chat_msg
viewer:producer_lee
"Record an episode on how perspective shapes historical narratives."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 6806 ms
- p95 • avg • N 8720 ms • 7000 ms • 5
- [email protected]/Qw… 9082 ms
- p95 • avg • N 10104 ms • 8864 ms • 5
- [email protected]/Qw… 10386 ms
- p95 • avg • N 12511 ms • 10657 ms • 5
- [email protected]/Qw… 11376 ms
- p95 • avg • N 13288 ms • 11309 ms • 5
- [email protected]/Qw… 15789 ms
- p95 • avg • N 31628 ms • 17699 ms • 5
Slowest
- microsoft/phi-3-medium-… 389596 ms
- p95 • avg • N 550901 ms • 354539 ms • 27
- qwen/qwen3-8b 93206 ms
- p95 • avg • N 143998 ms • 91364 ms • 29
- microsoft/phi-3.5-mini-… 43935 ms
- p95 • avg • N 97338 ms • 52452 ms • 25
- qwen/qwen3-14b 30389 ms
- p95 • avg • N 55120 ms • 33784 ms • 32
- deepseek/deepseek-r1-di… 29702 ms
- p95 • avg • N 39225 ms • 29914 ms • 28
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
5 of 5 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
53252282
Dec. 17, 2025, midnight
00275006
Dec. 16, 2025, 12:01 a.m.
50442517
Dec. 15, 2025, midnight
51955704
Dec. 14, 2025, midnight
49768097
Dec. 13, 2025, midnight
59628011
Dec. 12, 2025, midnight
52630350
Dec. 11, 2025, midnight
51159236
Dec. 10, 2025, midnight
57032749
Dec. 9, 2025, midnight
51824519
Dec. 8, 2025, midnight