Alicia Moreno
family-parenting-relationships-single-mother-characters-mary-wollstonecraft
v2.0
Ethical
Backstory: Alicia is a 34-year-old English professor at a small community college who moved back to her hometown after a divorce to lean on her parents while raising her inquisitive nine-year-old daughter. She juggles teaching night classes with daytime parenting and volunteers in the campus writing center, driven to help first-generation students thrive. Alicia’s empathy shapes her feedback, while her meticulous planning keeps lessons and family life in harmony.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
student-essay-help
Feedback on Argument Essay
|
0.002
Details |
0.769
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.816
Details |
0.913
Details |
0.670
Details |
writing-center-shift
First-Gen Student Walk-In
|
0.000
Details |
0.631
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.390
Details |
0.606
Details |
0.896
Details |
daughter-bedtime-story
Impromptu Bedtime Tale
|
0.407
Details |
0.646
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.773
Details |
0.884
Details |
0.875
Details |
colleague-cover-class
Schedule Coordination
|
0.028
Details |
0.610
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.653
Details |
0.870
Details |
0.554
Details |
night-class-mini-lecture
Long-Form Mini Lecture on Symbolism
|
0.400
Details |
0.679
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.430
Details |
0.334
Details |
0.596
Details |
reflective-journal
End-of-Day Journal Entry
|
0.000
Details |
0.559
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.422
Details |
0.188
Details |
0.525
Details |
Test Scenes 6
0
Scene Order
Feedback on Argument Essay
ID:
student-essay-help
🎯 Goal:
Give concise, encouraging bullet-point feedback that highlights strengths and two areas to improve, maintaining Alicia’s warm tone.
📨 Input Events:
chat_msg
student:jamal
"Professor Moreno, could you look at my draft on community policing and tell me what to fix before I submit?"
Ready for Testing
1
Scene Order
First-Gen Student Walk-In
ID:
writing-center-shift
🎯 Goal:
Greet the student, ask clarifying questions, and outline a simple next-step plan in three numbered steps.
📨 Input Events:
chat_msg
student:elena
"Hi, I’m the first in my family to go to college and this research paper is stressing me out. Can you help?"
Ready for Testing
2
Scene Order
Impromptu Bedtime Tale
ID:
daughter-bedtime-story
🎯 Goal:
Tell a brief, comforting bedtime story under 120 words that includes a curious owl and a brave child.
📨 Input Events:
chat_msg
daughter:maya
"Mom, can you tell me a new bedtime story tonight?"
Ready for Testing
3
Scene Order
Schedule Coordination
ID:
colleague-cover-class
🎯 Goal:
Politely accept or decline while clearly stating Alicia’s availability and any needed arrangements in a short paragraph.
📨 Input Events:
chat_msg
colleague:prof_khan
"Alicia, could you cover my 6 pm composition class next Thursday? My kid has a recital."
Ready for Testing
4
Scene Order
Long-Form Mini Lecture on Symbolism
ID:
night-class-mini-lecture
🎯 Goal:
Deliver a roughly 400-word mini lecture on symbolism in The Great Gatsby, structured with an intro, two thematic sections, and a brief conclusion.
📨 Input Events:
chat_msg
student:class_forum
"Professor, could you post a quick lecture summarizing the major symbols in Gatsby before our quiz?"
Ready for Testing
5
Scene Order
End-of-Day Journal Entry
ID:
reflective-journal
🎯 Goal:
Write a 300+ word first-person journal entry reflecting on teaching, parenting, and volunteering, ending with one actionable goal for tomorrow.
📨 Input Events:
world_event
system
"It is 10 pm. Alicia opens her journal app to decompress after a long day."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7596 ms
- p95 • avg • N 13076 ms • 8545 ms • 6
- meta-llama/llama-3.1-8b… 22169 ms
- p95 • avg • N 113348 ms • 49872 ms • 6
- qwen/qwen-2.5-7b-instru… 23803 ms
- p95 • avg • N 27730 ms • 24295 ms • 6
- qwen/qwen3-14b 28258 ms
- p95 • avg • N 37022 ms • 28489 ms • 6
- qwen/qwen3-8b 28922 ms
- p95 • avg • N 35425 ms • 29818 ms • 6
Slowest
- [email protected]/Qw… 38539 ms
- p95 • avg • N 234430 ms • 81879 ms • 6
- mistralai/mistral-7b-in… 30443 ms
- p95 • avg • N 32998 ms • 30119 ms • 6
- qwen/qwen3-8b 28922 ms
- p95 • avg • N 35425 ms • 29818 ms • 6
- qwen/qwen3-14b 28258 ms
- p95 • avg • N 37022 ms • 28489 ms • 6
- qwen/qwen-2.5-7b-instru… 23803 ms
- p95 • avg • N 27730 ms • 24295 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
30635738
Dec. 17, 2025, 12:01 a.m.
45325117
Dec. 16, 2025, 12:01 a.m.
26476276
Dec. 15, 2025, 12:01 a.m.
27932185
Dec. 14, 2025, 12:01 a.m.
27087557
Dec. 13, 2025, 12:01 a.m.
39192540
Dec. 12, 2025, 12:01 a.m.
35160541
Dec. 11, 2025, 12:01 a.m.
27881742
Dec. 10, 2025, 12:01 a.m.
40819235
Dec. 9, 2025, 12:01 a.m.
29852060
Dec. 8, 2025, 12:01 a.m.