Aisha Patel
education-academia-art-teacher-characters-georgia-o-keeffe
v2.0
Ethical
Backstory: Aisha Patel is a veteran high-school art teacher who blends art history, digital media, and community engagement in her classes. She champions the idea that every student possesses a unique creative voice and designs projects linking art with science, social justice, and storytelling. Beyond school walls, she organizes neighborhood mural projects and mentors early-career educators.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
first-day-overview
First Day Introduction
|
0.690
Details |
0.617
Details |
0.753
Details |
0.777
Details |
0.000
Details |
0.587
Details |
0.725
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.672
Details |
0.699
Details |
0.660
Details |
0.714
Details |
student-shading
Struggling With Shading
|
0.759
Details |
0.566
Details |
0.650
Details |
0.626
Details |
0.000
Details
Error
|
0.605
Details |
0.627
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.714
Details |
0.781
Details |
0.824
Details |
0.665
Details |
newsletter-piece
School Newsletter Article
|
0.254
Details |
0.662
Details |
0.465
Details |
0.000
Details |
0.000
Details |
0.641
Details |
0.400
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.633
Details |
0.308
Details |
0.377
Details |
0.486
Details |
mural-reflection
Community Mural Reflection
|
0.310
Details |
0.211
Details |
0.481
Details |
0.508
Details |
0.000
Details |
0.000
Details |
0.344
Details |
0.604
Details |
0.000
Details
Error
|
0.472
Details |
0.537
Details |
0.522
Details |
0.642
Details |
Test Scenes 4
0
Scene Order
First Day Introduction
ID:
first-day-overview
🎯 Goal:
Briefly share teaching philosophy and invite student curiosity in under 150 words, maintaining a warm, encouraging tone.
📨 Input Events:
chat_msg
student:jamal
"Ms. Patel, what's this art class going to be like?"
Ready for Testing
1
Scene Order
Struggling With Shading
ID:
student-shading
🎯 Goal:
Offer patient, step-by-step guidance and motivation to help the student improve shading technique without sounding condescending.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'fact', 'content': 'Miranda is usually confident but gets embarrassed when her work falls short.', 'importance': 3}
📨 Input Events:
chat_msg
student:miranda
"I'm terrible at shading cubes. Everything looks flat. Any tips?"
Ready for Testing
2
Scene Order
School Newsletter Article
ID:
newsletter-piece
🎯 Goal:
Write a 350–400 word article for the school newsletter that explains how upcoming art projects connect to social justice themes, while sounding inspiring and accessible to parents.
📨 Input Events:
world_event
principal
"Please send me a newsletter blurb about your curriculum by tomorrow morning."
Ready for Testing
3
Scene Order
Community Mural Reflection
ID:
mural-reflection
🎯 Goal:
Compose a reflective journal entry (300–350 words) capturing lessons learned, gratitude toward students, and plans for future community art initiatives.
📨 Input Events:
chat_msg
colleague:mr_rojas
"How are you feeling after finishing the Elm Street mural with your students?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- neversleep/noromaid-20b 8385 ms
- p95 • avg • N 27003 ms • 13186 ms • 4
- [email protected]/Qw… 11876 ms
- p95 • avg • N 13636 ms • 11903 ms • 4
- google/gemma-3-12b-it 21340 ms
- p95 • avg • N 29349 ms • 22419 ms • 4
- qwen/qwen3-14b 22317 ms
- p95 • avg • N 29408 ms • 23549 ms • 4
- meta-llama/llama-3.1-8b… 27018 ms
- p95 • avg • N 38187 ms • 29200 ms • 4
Slowest
- [email protected]/Qw… 145342 ms
- p95 • avg • N 254553 ms • 146147 ms • 4
- microsoft/phi-3-medium-… 124761 ms
- p95 • avg • N 128921 ms • 121576 ms • 4
- qwen/qwen3-8b 46622 ms
- p95 • avg • N 56152 ms • 47152 ms • 4
- microsoft/phi-3.5-mini-… 45518 ms
- p95 • avg • N 214536 ms • 90345 ms • 4
- deepseek/deepseek-r1-di… 35442 ms
- p95 • avg • N 39696 ms • 36212 ms • 4
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
19627498
Dec. 17, 2025, midnight
23135822
Dec. 16, 2025, midnight
18601300
Dec. 15, 2025, midnight
20618835
Dec. 14, 2025, midnight
18416486
Dec. 13, 2025, midnight
22774742
Dec. 12, 2025, midnight
19399866
Dec. 11, 2025, midnight
18780079
Dec. 10, 2025, midnight
21601415
Dec. 9, 2025, midnight
18722953
Dec. 8, 2025, midnight