Dr. Helena Markos
greek-gods-jane-ellen-harrison
v2.0
Ethical
Backstory: Dr. Helena Markos is a tenured lecturer in Classical Studies at a midwestern university. She transforms ancient Greek religion into vivid experiences through theatrical reenactments, comparative analysis, and open-door office hours. Patient yet incisive, she guides students to challenge assumptions and connect myths to contemporary life. Her classes balance rigorous scholarship with lively storytelling.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
mythos-vs-logos
Quick hallway clarification
|
0.840
Details |
0.834
Details |
0.778
Details |
0.775
Details |
0.000
Details
Error
|
0.811
Details |
0.861
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.932
Details |
0.682
Details |
0.786
Details |
0.682
Details |
0.881
Details |
0.889
Details |
oracle-dilemma
Office-hours oracle chat
|
0.712
Details |
0.883
Details |
0.817
Details |
0.000
Details |
0.000
Details
Error
|
0.776
Details |
0.872
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.867
Details |
0.857
Details |
0.780
Details |
0.857
Details |
0.854
Details |
persephone-skit-plan
Designing the Persephone campus skit
|
0.454
Details |
0.000
Details |
0.363
Details |
0.008
Details |
0.000
Details |
0.812
Details |
0.545
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.810
Details |
0.779
Details |
0.493
Details |
0.554
Details |
0.387
Details |
0.495
Details |
hubris-podcast-monologue
Mini-lecture on hubris and modern CEOs
|
0.683
Details |
0.214
Details |
0.755
Details |
0.000
Details |
0.023
Details |
0.613
Details |
0.643
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.548
Details |
0.635
Details |
0.645
Details |
0.436
Details |
0.446
Details |
0.766
Details |
Test Scenes 4
0
Scene Order
Quick hallway clarification
ID:
mythos-vs-logos
🎯 Goal:
Deliver a clear, patient explanation (≤120 words) of the difference between mythos and logos, including one modern analogy.
📨 Input Events:
chat_msg
student:Olivia
"Professor Markos, what's the difference between mythos and logos? I need a quick refresher before my philosophy class."
Ready for Testing
1
Scene Order
Office-hours oracle chat
ID:
oracle-dilemma
🎯 Goal:
In under 100 words, relate a daily horoscope to ancient Greek oracles, demonstrate critical thinking, and invite further reflection.
📨 Input Events:
chat_msg
student:Jamie
"My horoscope said a "mysterious stranger" will change my future. How would ancient Greeks treat a message like that from Delphi?"
Ready for Testing
2
Scene Order
Designing the Persephone campus skit
ID:
persephone-skit-plan
🎯 Goal:
Provide a detailed outline (≥200 words) with scene breakdowns, roles, interactive audience elements, and modern parallels to seasonal depression.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'content': "I told the theater club I'd send them a staging outline for their Persephone myth performance.", 'importance': 4}
📨 Input Events:
chat_msg
theater_club:President
"Hi Dr. Markos, could you help us structure our 10-minute interactive skit about Persephone for the spring festival?"
Ready for Testing
3
Scene Order
Mini-lecture on hubris and modern CEOs
ID:
hubris-podcast-monologue
🎯 Goal:
Create a 350–450 word monologue for a student podcast connecting the ancient concept of hubris to at least two recent corporate scandals, maintaining an engaging scholarly voice.
📨 Input Events:
chat_msg
podcast_host:Liam
"Professor, could you record a short segment on how Greek hubris shows up in modern corporate scandals? The audience is business majors."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 198 ms
- p95 • avg • N 201 ms • 197 ms • 4
- neversleep/noromaid-20b 7161 ms
- p95 • avg • N 40418 ms • 14993 ms • 10
- [email protected]/Qw… 11541 ms
- p95 • avg • N 13510 ms • 11597 ms • 4
- [email protected]/Qw… 12629 ms
- p95 • avg • N 15583 ms • 12743 ms • 4
- [email protected]/Qw… 18855 ms
- p95 • avg • N 31110 ms • 17679 ms • 4
Slowest
- microsoft/phi-3-medium-… 184664 ms
- p95 • avg • N 273273 ms • 184261 ms • 11
- [email protected]/Qw… 41757 ms
- p95 • avg • N 48092 ms • 42743 ms • 4
- qwen/qwen3-8b 41683 ms
- p95 • avg • N 98789 ms • 56800 ms • 11
- deepseek/deepseek-r1-di… 39366 ms
- p95 • avg • N 61634 ms • 41981 ms • 16
- microsoft/phi-3.5-mini-… 38448 ms
- p95 • avg • N 81619 ms • 48006 ms • 13
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
25172243
Dec. 17, 2025, midnight
29473147
Dec. 16, 2025, midnight
23651616
Dec. 15, 2025, midnight
27020956
Dec. 14, 2025, midnight
23675790
Dec. 13, 2025, midnight
28773200
Dec. 12, 2025, midnight
24742620
Dec. 11, 2025, midnight
24292534
Dec. 10, 2025, midnight
27509791
Dec. 9, 2025, midnight
24480332
Dec. 8, 2025, midnight