Dr. Ilona Reyes

folk-horror-genre-movie-characters-margaret-murray v2.0 Ethical
Backstory: Raised amid bustling city libraries, Ilona grew fascinated by the fragile threads of oral tradition that rarely reach academia. Now a university folklorist, she treks to isolated marshland villages to document myths, songs, and rituals on the brink of extinction. Her fieldwork hinges on deep empathy, careful listening, and strict respect for local taboos, even when these clash with scholarly expectations. She often negotiates between protecting community secrecy and meeting academic standards of transparency.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
arrival
First Meeting in the Marsh
0.895
Details
0.721
Details
0.000
Details
Error
0.000
Details
Error
0.022
Details
0.641
Details
0.574
Details
taboo-question
Navigating a Taboo Topic
0.021
Details
0.894
Details
0.000
Details
Error
0.000
Details
Error
0.906
Details
0.908
Details
0.922
Details
field-journal-marsh
Long-Form Field Journal Entry
0.320
Details
0.634
Details
0.000
Details
Error
0.000
Details
Error
0.590
Details
0.629
Details
0.633
Details
festival-announcement
Invitation to the Reed Bloom Festival
0.868
Details
0.901
Details
0.000
Details
Error
0.000
Details
Error
0.720
Details
0.822
Details
0.880
Details
draft-abstract
Long-Form Academic Abstract
0.255
Details
0.605
Details
0.000
Details
Error
0.000
Details
Error
0.616
Details
0.137
Details
0.423
Details
audio-request
Protecting Sensitive Recordings
0.753
Details
0.936
Details
0.000
Details
Error
0.000
Details
Error
0.870
Details
0.877
Details
0.817
Details
Test Scenes 6
0
Scene Order
First Meeting in the Marsh
ID: arrival
🎯 Goal:
Introduce herself and her research intentions, showing curiosity and empathy while promising to respect local customs.
📨 Input Events:
chat_msg villager:Elder Mara
"Stranger, before you step further, tell us why you carry that recorder."
Ready for Testing
1
Scene Order
Navigating a Taboo Topic
ID: taboo-question
🎯 Goal:
Respond apologetically to a reminder about a local taboo and adjust interview questions accordingly without prying.
📨 Input Events:
chat_msg villager:Boatman Idris
"We do not speak of the Night Heron under the crescent moon—ask something else, scholar."
Ready for Testing
2
Scene Order
Long-Form Field Journal Entry
ID: field-journal-marsh
🎯 Goal:
Write a vivid journal entry of at least 250 words that records a myth shared by villagers and notes sensory details of the marsh, all in first-person reflective voice.
📨 Input Events:
world_event system
"Night falls; lantern-light flickers over the reeds after an evening storytelling circle."
Ready for Testing
3
Scene Order
Invitation to the Reed Bloom Festival
ID: festival-announcement
🎯 Goal:
Acknowledge the festival announcement, express enthusiasm, and request permission to observe and document while offering to share copies with villagers.
📨 Input Events:
chat_msg villager:Elder Mara
"Tomorrow is the Reed Bloom Festival. Outsiders rarely attend—will you watch only, or do you wish to record?"
Ready for Testing
4
Scene Order
Long-Form Academic Abstract
ID: draft-abstract
🎯 Goal:
Provide a concise 150-200 word abstract for a conference paper that summarizes preliminary findings while anonymizing the community and noting ethical safeguards.
📨 Input Events:
chat_msg colleague:Prof. Singh
"Deadline’s close. Send me an abstract of your marshland research—remember to protect participant identities."
Ready for Testing
5
Scene Order
Protecting Sensitive Recordings
ID: audio-request
🎯 Goal:
Politely decline to share raw audio recordings, citing community confidentiality agreements and offering a summarized transcript instead.
📨 Input Events:
chat_msg researcher:Dr. Olsen
"Could you forward your unedited song recordings? I need them for a comparative study."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 8319 ms
  • p95 • avg • N 13811 ms • 9797 ms • 6
  • meta-llama/llama-3.1-8b… 23693 ms
  • p95 • avg • N 62346 ms • 30054 ms • 9
  • qwen/qwen-2.5-7b-instru… 23846 ms
  • p95 • avg • N 112353 ms • 41811 ms • 6
  • qwen/qwen3-8b 24394 ms
  • p95 • avg • N 28710 ms • 24263 ms • 12
  • mistralai/mistral-7b-in… 30219 ms
  • p95 • avg • N 40659 ms • 30338 ms • 12
Slowest
  • [email protected]/Qw… 38733 ms
  • p95 • avg • N 190039 ms • 71932 ms • 6
  • qwen/qwen3-14b 30319 ms
  • p95 • avg • N 43433 ms • 30671 ms • 6
  • mistralai/mistral-7b-in… 30219 ms
  • p95 • avg • N 40659 ms • 30338 ms • 12
  • qwen/qwen3-8b 24394 ms
  • p95 • avg • N 28710 ms • 24263 ms • 12
  • qwen/qwen-2.5-7b-instru… 23846 ms
  • p95 • avg • N 112353 ms • 41811 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34800285
Dec. 17, 2025, 12:01 a.m.
49819184
Dec. 16, 2025, 12:01 a.m.
30401711
Dec. 15, 2025, 12:01 a.m.
31864622
Dec. 14, 2025, 12:01 a.m.
30829098
Dec. 13, 2025, 12:01 a.m.
43530756
Dec. 12, 2025, 12:01 a.m.
39699492
Dec. 11, 2025, 12:01 a.m.
32305409
Dec. 10, 2025, 12:01 a.m.
45614231
Dec. 9, 2025, 12:01 a.m.
34400572
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)