Linda Mercado
medicine-healthcare-psychology-human-behavior-nurse-characters-florence-nightingale
v2.0
Ethical
Backstory: Linda Mercado is a community health nurse raised in a bilingual home in the Rio Grande Valley. She now staffs mobile clinics across rural South Texas, blending empathy with a meticulous eye for data and follow-up. Her passion is preventive care, chronic-disease management, and culturally tailored education delivered fluently in both Spanish and English.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
bp-check
Addressing an Elevated Blood Pressure Reading
|
0.693
Details |
0.767
Details |
0.800
Details |
0.866
Details |
0.000
Details |
0.000
Details
Error
|
0.806
Details |
0.555
Details |
0.000
Details
Error
|
0.725
Details |
0.872
Details |
0.845
Details |
0.723
Details |
newsletter-july
Monthly Wellness Newsletter
|
0.393
Details |
0.713
Details |
0.549
Details |
0.507
Details |
0.000
Details |
0.896
Details |
0.563
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.685
Details |
0.676
Details |
0.401
Details |
0.514
Details |
supply-shortage
Responding to a Glucometer Strip Shortage
|
0.594
Details |
0.735
Details |
0.644
Details |
0.000
Details |
0.000
Details
Error
|
0.729
Details |
0.865
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.755
Details |
0.585
Details |
0.789
Details |
0.803
Details |
podcast-diabetes
Community Podcast Episode on Prediabetes
|
0.335
Details |
0.560
Details |
0.287
Details |
0.413
Details |
0.000
Details |
0.517
Details |
0.450
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.382
Details |
0.209
Details |
0.495
Details |
0.295
Details |
Test Scenes 4
0
Scene Order
Addressing an Elevated Blood Pressure Reading
ID:
bp-check
🎯 Goal:
Give a concise, compassionate response (≤120 words) that explains next steps and offers a bilingual tip.
📨 Input Events:
chat_msg
patient:Jose_R
"Nurse Linda, my blood pressure today was 148 over 92. Should I be worried?"
Ready for Testing
1
Scene Order
Monthly Wellness Newsletter
ID:
newsletter-july
🎯 Goal:
Write a bilingual newsletter article of at least 150 words (English first, Spanish second) that covers sun-safety tips and hydration for farmworkers; maintain Linda’s warm, practical voice.
📨 Input Events:
chat_msg
clinic_director
"Can you draft the July newsletter focus piece? Theme: staying safe under the summer sun."
Ready for Testing
2
Scene Order
Responding to a Glucometer Strip Shortage
ID:
supply-shortage
🎯 Goal:
Propose an immediate workaround and a follow-up plan in under 100 words while reassuring patients.
📨 Input Events:
world_event
inventory_system
"Alert: Glucometer test-strip inventory at Falfurrias mobile site is critically low."
Ready for Testing
3
Scene Order
Community Podcast Episode on Prediabetes
ID:
podcast-diabetes
🎯 Goal:
Record a 3–4 paragraph script (~250–300 words) mixing storytelling and actionable steps; include at least one Spanish sentence in each paragraph.
📨 Input Events:
chat_msg
media_coordinator
"Ready to record Episode 12: Prediabetes—What our neighbors need to know."
Ready for Testing
Latency by Model (This Suite)
Fastest
- neversleep/noromaid-20b 7095 ms
- p95 • avg • N 25701 ms • 12840 ms • 5
- [email protected]/Qw… 12891 ms
- p95 • avg • N 13261 ms • 12604 ms • 4
- google/gemini-2.5-flash 16749 ms
- p95 • avg • N 27541 ms • 18437 ms • 8
- qwen/qwen3-14b 19849 ms
- p95 • avg • N 23423 ms • 20128 ms • 5
- google/gemma-3-12b-it 22559 ms
- p95 • avg • N 53021 ms • 29044 ms • 7
Slowest
- microsoft/phi-3-medium-… 197099 ms
- p95 • avg • N 237198 ms • 187629 ms • 8
- [email protected]/Qw… 42768 ms
- p95 • avg • N 43672 ms • 41864 ms • 4
- microsoft/phi-3.5-mini-… 37112 ms
- p95 • avg • N 77554 ms • 42827 ms • 8
- meta-llama/llama-3.1-8b… 32686 ms
- p95 • avg • N 47932 ms • 30862 ms • 8
- deepseek/deepseek-r1-di… 32635 ms
- p95 • avg • N 43715 ms • 33079 ms • 8
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34660938
Dec. 17, 2025, midnight
40089221
Dec. 16, 2025, midnight
32487356
Dec. 15, 2025, midnight
35404661
Dec. 14, 2025, midnight
32368367
Dec. 13, 2025, midnight
39086941
Dec. 12, 2025, midnight
33600720
Dec. 11, 2025, midnight
33345443
Dec. 10, 2025, midnight
37641754
Dec. 9, 2025, midnight
33420437
Dec. 8, 2025, midnight