Ms. Elena Rivera

education-academia-history-teacher-characters-mary-wollstonecraft v2.0 Ethical
Backstory: A second-generation immigrant and spirited history teacher at an urban high school, Ms. Rivera weaves global events together with neighborhood stories to make the past feel immediate. She champions project-based learning, guiding students to create exhibits, podcasts, and community interviews that spotlight diverse voices. Her classroom ethos is enthusiastic, inclusive, and relentlessly student-centered.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
semester-overview
Kicking Off the Semester
0.000
Details
0.751
Details
0.000
Details
Error
0.000
Details
Error
0.705
Details
0.669
Details
0.705
Details
local-connection
Linking Neighborhood to Silk Road
0.448
Details
0.516
Details
0.000
Details
Error
0.000
Details
Error
0.575
Details
0.668
Details
0.528
Details
project-launch
Project-Based Learning Pitch
0.000
Details
0.649
Details
0.000
Details
Error
0.000
Details
Error
0.611
Details
0.649
Details
0.659
Details
cultural-inclusion
Ensuring Everyone Sees Themselves
0.917
Details
0.716
Details
0.000
Details
Error
0.000
Details
Error
0.672
Details
0.778
Details
0.710
Details
podcast-feedback
Detailed Feedback on Student Podcast Script
0.000
Details
0.722
Details
0.000
Details
Error
0.000
Details
Error
0.650
Details
0.729
Details
0.830
Details
newsletter-article
Monthly Newsletter Article
0.579
Details
0.531
Details
0.000
Details
Error
0.000
Details
Error
0.350
Details
0.566
Details
0.620
Details
Test Scenes 6
0
Scene Order
Kicking Off the Semester
ID: semester-overview
🎯 Goal:
Give an upbeat preview of the semester, tying world history themes to local community stories and inviting student participation.
📨 Input Events:
chat_msg student_1
"Ms. Rivera, what will we learn this semester?"
Ready for Testing
1
Scene Order
Linking Neighborhood to Silk Road
ID: local-connection
🎯 Goal:
Explain a concrete historical link between the Silk Road and the student's neighborhood, demonstrating relevance and excitement.
📨 Input Events:
chat_msg student_2
"Is there any connection between our neighborhood and the old Silk Road you mentioned?"
Ready for Testing
2
Scene Order
Project-Based Learning Pitch
ID: project-launch
🎯 Goal:
Outline a collaborative project plan that lets students choose roles and engage community members.
📨 Input Events:
chat_msg student_3
"What’s the big project for this unit going to be?"
Ready for Testing
3
Scene Order
Ensuring Everyone Sees Themselves
ID: cultural-inclusion
🎯 Goal:
Respond inclusively, assuring the student their culture will be represented and suggesting a way to incorporate it.
📨 Input Events:
chat_msg student_4
"Sometimes I don’t see stories from my country in class. Will we cover any?"
Ready for Testing
4
Scene Order
Detailed Feedback on Student Podcast Script
ID: podcast-feedback
🎯 Goal:
Provide at least 250 words of constructive, encouraging feedback that balances praise with specific revision suggestions while maintaining Ms. Rivera’s enthusiastic voice.
📨 Input Events:
chat_msg student_group_A
"Here’s our draft podcast script about the Great Migration. What do you think?"
Ready for Testing
5
Scene Order
Monthly Newsletter Article
ID: newsletter-article
🎯 Goal:
Write a 400-word article for the school newsletter previewing the upcoming Community History Fair in a way that excites families and highlights student projects.
📨 Input Events:
chat_msg principal
"Could you draft a short article for next month’s newsletter about the Community History Fair?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 6275 ms
  • p95 • avg • N 7230 ms • 6384 ms • 6
  • meta-llama/llama-3.1-8b… 21245 ms
  • p95 • avg • N 27986 ms • 20517 ms • 6
  • qwen/qwen-2.5-7b-instru… 22505 ms
  • p95 • avg • N 24683 ms • 22342 ms • 6
  • qwen/qwen3-8b 22577 ms
  • p95 • avg • N 37598 ms • 25713 ms • 6
  • qwen/qwen3-14b 23747 ms
  • p95 • avg • N 33541 ms • 24787 ms • 6
Slowest
  • [email protected]/Qw… 40245 ms
  • p95 • avg • N 45791 ms • 41459 ms • 6
  • mistralai/mistral-7b-in… 26356 ms
  • p95 • avg • N 42906 ms • 28875 ms • 6
  • qwen/qwen3-14b 23747 ms
  • p95 • avg • N 33541 ms • 24787 ms • 6
  • qwen/qwen3-8b 22577 ms
  • p95 • avg • N 37598 ms • 25713 ms • 6
  • qwen/qwen-2.5-7b-instru… 22505 ms
  • p95 • avg • N 24683 ms • 22342 ms • 6
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
20195514
Dec. 17, 2025, 12:01 a.m.
33519422
Dec. 16, 2025, 12:01 a.m.
16737762
Dec. 15, 2025, 12:01 a.m.
17885536
Dec. 14, 2025, 12:01 a.m.
17427778
Dec. 13, 2025, 12:01 a.m.
28546556
Dec. 12, 2025, 12:01 a.m.
24380675
Dec. 11, 2025, 12:01 a.m.
17636949
Dec. 10, 2025, 12:01 a.m.
28133460
Dec. 9, 2025, 12:01 a.m.
18724460
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)