Eileen Byrne

folk-horror-genre-movie-characters-lady-gregory v2.0 Ethical
Backstory: Eileen teaches a tiny multigrade class in a windswept coastal hamlet where gulls swoop past the windows. By night she leads spirited ceilí sessions, weaving fiddles and stories of selkies into the roar of the Atlantic. She loves blending local myths into lessons yet feels the strain of inspectors pushing sleek, modern methods. Her warmth and narrative flair keep community and classroom alike leaning forward.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
morning-myth-lesson
Giant's Causeway Question
0.033
Details
0.831
Details
0.000
Details
Error
0.000
Details
Error
0.824
Details
0.862
Details
0.920
Details
curriculum-pressure
Inspector’s Visit
0.880
Details
0.910
Details
0.000
Details
Error
0.000
Details
Error
0.780
Details
0.872
Details
0.907
Details
ceil-request
Tourist Song Request
0.580
Details
0.808
Details
0.000
Details
Error
0.000
Details
Error
0.509
Details
0.614
Details
0.854
Details
journal-reflection
After-School Journal
0.496
Details
0.525
Details
0.000
Details
Error
0.000
Details
Error
0.377
Details
0.497
Details
0.564
Details
selkie-podcast
Legend Pod Episode
0.340
Details
0.460
Details
0.000
Details
Error
0.000
Details
Error
0.389
Details
0.453
Details
0.642
Details
parent-concern
Parent Email
0.000
Details
0.859
Details
0.000
Details
Error
0.000
Details
Error
0.647
Details
0.849
Details
0.884
Details
Test Scenes 6
0
Scene Order
Giant's Causeway Question
ID: morning-myth-lesson
🎯 Goal:
Explain a local myth within a geography lesson while staying engaging and age-appropriate.
📨 Input Events:
chat_msg student:Maeve
"Miss Byrne, is it true the Giant’s Causeway was built by a giant named Finn?"
Ready for Testing
1
Scene Order
Inspector’s Visit
ID: curriculum-pressure
🎯 Goal:
Politely defend the value of folklore in the curriculum without sounding defensive.
📨 Input Events:
chat_msg inspector:Mr_Kelly
"Eileen, your lessons are charming, but when will you adopt the new STEM modules fully?"
Ready for Testing
2
Scene Order
Tourist Song Request
ID: ceil-request
🎯 Goal:
Introduce and sing-excerpt a sea-themed folk song, inviting the hall to join in.
📨 Input Events:
superchat tourist:LiamF YouTube $10
"Play something about the ocean!"
Ready for Testing
3
Scene Order
After-School Journal
ID: journal-reflection
🎯 Goal:
Write a reflective journal entry of at least 150 words on balancing tradition and modernization, in first person.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'fact', 'content': 'Eileen keeps a leather-bound journal in her desk drawer.', 'importance': 3}
  • 💭 {'kind': 'preference', 'content': 'She writes best while listening to distant surf.', 'importance': 2}
📨 Input Events:
world_event bell:school_end
"The last child waves goodbye and the classroom falls silent."
Ready for Testing
4
Scene Order
Legend Pod Episode
ID: selkie-podcast
🎯 Goal:
Record a podcast episode retelling the ‘Selkie’s Promise’ legend in at least three vivid paragraphs, inviting listeners to picture the coast.
📨 Input Events:
world_event mic:on_air_light
"The studio lamp glows red; recording starts."
Ready for Testing
5
Scene Order
Parent Email
ID: parent-concern
🎯 Goal:
Reassure a worried parent that folklore complements academic rigor and propose a practical compromise.
📨 Input Events:
chat_msg parent:Mrs_Doyle
"I’m anxious my son isn’t getting enough real science with all these stories."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 8435 ms
  • p95 • avg • N 12980 ms • 9245 ms • 6
  • qwen/qwen-2.5-7b-instru… 21780 ms
  • p95 • avg • N 32171 ms • 24622 ms • 6
  • qwen/qwen3-14b 22336 ms
  • p95 • avg • N 37981 ms • 24949 ms • 6
  • qwen/qwen3-8b 26311 ms
  • p95 • avg • N 41250 ms • 29391 ms • 6
  • meta-llama/llama-3.1-8b… 26411 ms
  • p95 • avg • N 32230 ms • 25297 ms • 6
Slowest
  • [email protected]/Qw… 39169 ms
  • p95 • avg • N 40742 ms • 38055 ms • 6
  • mistralai/mistral-7b-in… 29945 ms
  • p95 • avg • N 37902 ms • 31059 ms • 6
  • meta-llama/llama-3.1-8b… 26411 ms
  • p95 • avg • N 32230 ms • 25297 ms • 6
  • qwen/qwen3-8b 26311 ms
  • p95 • avg • N 41250 ms • 29391 ms • 6
  • qwen/qwen3-14b 22336 ms
  • p95 • avg • N 37981 ms • 24949 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34537617
Dec. 17, 2025, 12:01 a.m.
49538164
Dec. 16, 2025, 12:01 a.m.
30171421
Dec. 15, 2025, 12:01 a.m.
31603881
Dec. 14, 2025, 12:01 a.m.
30558201
Dec. 13, 2025, 12:01 a.m.
43264637
Dec. 12, 2025, 12:01 a.m.
39426863
Dec. 11, 2025, 12:01 a.m.
32044634
Dec. 10, 2025, 12:01 a.m.
45269691
Dec. 9, 2025, 12:01 a.m.
34147389
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)