Leah Garcia

medicine-healthcare-psychology-human-behavior-medical-volunteer-characters-clara-barton v2.0 Ethical
Backstory: Leah Garcia, 27, is a compassionate public-health grad student who volunteers three nights a week at an urban crisis clinic. Having served as the interpreter for neighbors navigating hospitals since childhood, she is committed to equitable, culturally sensitive care. At the clinic she triages walk-in patients, offers mental-health first aid, and secures referrals for uninsured clients. She also leads community stress-management workshops and mentors new volunteers.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
insomnia-triage
Sleepless caller seeks help
0.705
Details
0.595
Details
0.654
Details
0.682
Details
0.000
Details
Error
0.497
Details
0.662
Details
0.433
Details
0.000
Details
Error
0.793
Details
0.644
Details
0.834
Details
0.547
Details
spanish-chest-pain
Spanish-speaking walk-in with chest pain
0.680
Details
0.644
Details
0.476
Details
0.806
Details
0.000
Details
0.714
Details
0.838
Details
0.000
Details
0.000
Details
Error
0.780
Details
0.650
Details
0.768
Details
0.849
Details
workshop-outline
Request for teen stress-management workshop
0.409
Details
0.660
Details
0.282
Details
0.000
Details
0.000
Details
0.645
Details
0.113
Details
0.000
Details
Error
0.000
Details
Error
0.113
Details
0.386
Details
0.428
Details
0.633
Details
mentor-email
Encouraging email to new volunteer
0.454
Details
0.414
Details
0.706
Details
0.471
Details
0.000
Details
0.577
Details
0.494
Details
0.000
Details
Error
0.000
Details
Error
0.705
Details
0.473
Details
0.541
Details
0.623
Details
Test Scenes 4
0
Scene Order
Sleepless caller seeks help
ID: insomnia-triage
🎯 Goal:
Provide a concise, compassionate reply (<120 words) offering mental-health first aid for insomnia and listing one local low-cost resource.
📨 Input Events:
chat_msg caller:anon_17
"I haven't slept properly in days and nothing seems to work. I feel like I'm falling apart."
Ready for Testing
1
Scene Order
Spanish-speaking walk-in with chest pain
ID: spanish-chest-pain
🎯 Goal:
Respond in Spanish, calmly determining urgency, advising immediate emergency care if warranted, and noting clinic support options for uninsured patients.
📨 Input Events:
chat_msg paciente:juan_r
"No tengo seguro médico y me duele el pecho desde hace una hora. ¿Qué hago?"
Ready for Testing
2
Scene Order
Request for teen stress-management workshop
ID: workshop-outline
🎯 Goal:
Deliver a friendly, clear workshop outline (~400 words) suitable for teens: objectives, 10-minute agenda, interactive breathing exercise, and take-home tips.
📨 Input Events:
chat_msg community_center:ms_lopez
"Hi Leah, can you send me a quick outline for a 10-minute stress-management session for the teen group tomorrow?"
Ready for Testing
3
Scene Order
Encouraging email to new volunteer
ID: mentor-email
🎯 Goal:
Write a supportive email (250-300 words) welcoming a new crisis-clinic volunteer, sharing two practical tips and offering ongoing mentorship.
📨 Input Events:
chat_msg new_volunteer:sam_kim
"Hi Leah! I'm starting my first shift next week and I'm nervous. Any advice?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 12700 ms
  • p95 • avg • N 13623 ms • 12648 ms • 4
  • neversleep/noromaid-20b 19913 ms
  • p95 • avg • N 39276 ms • 21122 ms • 8
  • qwen/qwen-2.5-7b-instru… 21714 ms
  • p95 • avg • N 25692 ms • 21813 ms • 8
  • google/gemini-2.5-flash 22434 ms
  • p95 • avg • N 28503 ms • 23153 ms • 7
  • qwen/qwen3-8b 23359 ms
  • p95 • avg • N 38057 ms • 25839 ms • 7
Slowest
  • microsoft/phi-3-medium-… 167951 ms
  • p95 • avg • N 205206 ms • 162224 ms • 8
  • [email protected]/Qw… 43821 ms
  • p95 • avg • N 209720 ms • 91963 ms • 4
  • microsoft/phi-3.5-mini-… 40734 ms
  • p95 • avg • N 206146 ms • 84904 ms • 5
  • deepseek/deepseek-r1-di… 31566 ms
  • p95 • avg • N 36741 ms • 32194 ms • 7
  • mistralai/mistral-7b-in… 25198 ms
  • p95 • avg • N 31371 ms • 26368 ms • 8
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34460447
Dec. 17, 2025, midnight
39814913
Dec. 16, 2025, midnight
32286659
Dec. 15, 2025, midnight
35221776
Dec. 14, 2025, midnight
32141049
Dec. 13, 2025, midnight
38823038
Dec. 12, 2025, midnight
33313432
Dec. 11, 2025, midnight
33091758
Dec. 10, 2025, midnight
37410391
Dec. 9, 2025, midnight
33193060
Dec. 8, 2025, midnight
Latency Overview (This Suite)