David Kim
medicine-healthcare-psychology-human-behavior-pharmacist-characters-william-procter-jr
v2.0
Ethical
Backstory: David Kim is a board-certified clinical pharmacist at a large urban hospital who specializes in chronic disease management and medication safety. Raised bilingual, he is adept at counseling patients from varied cultural backgrounds and works closely with physicians, nurses, and social workers every day. Outside the hospital, he volunteers at free health clinics and mentors pharmacy students pursuing clinical careers.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
med-check
Quick drug interaction review
|
0.758
Details |
0.846
Details |
0.846
Details |
0.798
Details |
0.000
Details |
0.757
Details |
0.894
Details |
0.714
Details |
0.000
Details
Error
|
0.872
Details |
0.000
Details |
0.813
Details |
0.881
Details |
in-depth-diabetes-counseling
Comprehensive diabetes medication counseling
|
0.177
Details |
0.652
Details |
0.499
Details |
0.553
Details |
0.000
Details |
0.000
Details
Error
|
0.607
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.820
Details |
0.375
Details |
0.487
Details |
0.478
Details |
rounds-collab
Interdisciplinary rounds suggestion
|
0.793
Details |
0.723
Details |
0.851
Details |
0.844
Details |
0.000
Details |
0.545
Details |
0.886
Details |
0.650
Details |
0.000
Details
Error
|
0.880
Details |
0.698
Details |
0.802
Details |
0.875
Details |
mentor-letter
Mentorship email to pharmacy student
|
0.629
Details |
0.702
Details |
0.672
Details |
0.000
Details |
0.000
Details
Error
|
0.546
Details |
0.591
Details |
0.395
Details |
0.000
Details
Error
|
0.669
Details |
0.379
Details |
0.558
Details |
0.489
Details |
Test Scenes 4
0
Scene Order
Quick drug interaction review
ID:
med-check
🎯 Goal:
Give a concise, accurate interaction assessment and monitoring advice in under 120 words.
📨 Input Events:
chat_msg
viewer:resident_md_1
"David, my patient is on warfarin and just started amiodarone. Any immediate concerns?"
Ready for Testing
1
Scene Order
Comprehensive diabetes medication counseling
ID:
in-depth-diabetes-counseling
🎯 Goal:
Deliver a culturally sensitive, step-by-step counseling script (~250–300 words) that explains dosing, side effects, and lifestyle tips in plain language a newly diagnosed Spanish-speaking patient can understand.
📨 Input Events:
chat_msg
viewer:rn_icu_4
"Mr. Alvarez (just started on metformin) is waiting for discharge counseling. Can you outline what you'll cover so I can translate parts of it?"
Ready for Testing
2
Scene Order
Interdisciplinary rounds suggestion
ID:
rounds-collab
🎯 Goal:
Propose one actionable medication safety improvement for tomorrow’s rounds in under 80 words.
📨 Input Events:
chat_msg
viewer:charge_nurse_2
"Anything we should flag for tomorrow’s rounds regarding high-risk meds on 6 West?"
Ready for Testing
3
Scene Order
Mentorship email to pharmacy student
ID:
mentor-letter
🎯 Goal:
Write a warm, motivational email (200–250 words) offering study strategies and clinical rotation advice while maintaining professional tone.
📨 Input Events:
chat_msg
viewer:pharm_student_jane
"Hi David, finals are overwhelming and I start my first clinical rotation soon. Any guidance?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 12311 ms
- p95 • avg • N 14613 ms • 12381 ms • 4
- google/gemini-2.5-flash 19406 ms
- p95 • avg • N 24674 ms • 20338 ms • 7
- google/gemma-3-12b-it 19888 ms
- p95 • avg • N 41807 ms • 24874 ms • 8
- qwen/qwen-2.5-7b-instru… 20635 ms
- p95 • avg • N 103362 ms • 36507 ms • 7
- meta-llama/llama-3.1-8b… 20987 ms
- p95 • avg • N 42988 ms • 25060 ms • 8
Slowest
- microsoft/phi-3-medium-… 160106 ms
- p95 • avg • N 215492 ms • 158705 ms • 8
- [email protected]/Qw… 40420 ms
- p95 • avg • N 45827 ms • 41027 ms • 4
- deepseek/deepseek-r1-di… 34209 ms
- p95 • avg • N 41514 ms • 35454 ms • 8
- microsoft/phi-3.5-mini-… 29934 ms
- p95 • avg • N 241765 ms • 81388 ms • 8
- neversleep/noromaid-20b 25161 ms
- p95 • avg • N 27947 ms • 23416 ms • 7
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
35098554
Dec. 17, 2025, midnight
40535875
Dec. 16, 2025, midnight
32841387
Dec. 15, 2025, midnight
35827430
Dec. 14, 2025, midnight
32787339
Dec. 13, 2025, midnight
39628204
Dec. 12, 2025, midnight
34150213
Dec. 11, 2025, midnight
33764981
Dec. 10, 2025, midnight
38081122
Dec. 9, 2025, midnight
33822092
Dec. 8, 2025, midnight