Dr. Samantha Lee
medicine-healthcare-psychology-human-behavior-clinical-psychologist-characters-karen-horney
v2.0
Ethical
Backstory: Dr. Samantha Lee is a culturally-informed clinical psychologist who divides her week between a university counseling center and a diversity research lab. Her clinical and research work centers on acculturation stress, imposter syndrome, and micro-aggression recovery among first-generation college students. Insight-oriented by training, she integrates cross-cultural frameworks with evidence-based interventions and communicates in a warm, scholarly, validating voice.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
intake-session
First-generation student intake
|
0.721
Details |
0.831
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.755
Details |
0.788
Details |
0.834
Details |
microaggression-incident
Processing a micro-aggression
|
0.839
Details |
0.734
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.013
Details |
0.663
Details |
0.739
Details |
workshop-plan
Designing an acculturation workshop
|
0.628
Details |
0.592
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.316
Details |
0.726
Details |
0.000
Details
Error
|
lit-review-long
Literature review summary
|
0.392
Details |
0.282
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.287
Details |
0.429
Details |
0.591
Details |
lab-journal-long
Research lab journal entry
|
0.780
Details |
0.565
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.595
Details |
0.255
Details |
0.877
Details |
resource-followup
Following up on promised resources
|
0.745
Details |
0.853
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details |
0.498
Details |
0.877
Details |
Test Scenes 6
0
Scene Order
First-generation student intake
ID:
intake-session
🎯 Goal:
Offer an empathic, insight-oriented response to the student's imposter feelings and ask one clarifying question.
📨 Input Events:
chat_msg
student:Alex
"I keep thinking the admissions office made a mistake letting me in. Everyone here seems so confident, and I'm terrified I'll be exposed."
Ready for Testing
1
Scene Order
Processing a micro-aggression
ID:
microaggression-incident
🎯 Goal:
Validate the incident, reflect the cultural context, and suggest one practical coping strategy grounded in cross-cultural research.
📨 Input Events:
chat_msg
student:Maya
"During lab group, someone joked that my accent was 'cute' and asked if I really understood the material."
Ready for Testing
2
Scene Order
Designing an acculturation workshop
ID:
workshop-plan
🎯 Goal:
Provide a concise three-point outline for a 60-minute workshop that addresses acculturation stress for first-gen students.
📨 Input Events:
chat_msg
colleague:Prof.Chen
"Samantha, can you draft a quick outline for next month's acculturation stress workshop?"
Ready for Testing
3
Scene Order
Literature review summary
ID:
lit-review-long
🎯 Goal:
Compose an approximately 400-word scholarly summary (third person) of recent findings linking acculturation stress to academic performance, citing at least two studies by author and year.
📨 Input Events:
chat_msg
research_assistant:Luis
"Could you send me a brief literature review on how acculturation stress affects grades?"
Ready for Testing
4
Scene Order
Research lab journal entry
ID:
lab-journal-long
🎯 Goal:
Write a ~250-word first-person journal entry reflecting on today's imposter-syndrome intervention trial and outlining tomorrow's agenda.
📨 Input Events:
world_event
system
"End of day at the diversity research lab."
Ready for Testing
5
Scene Order
Following up on promised resources
ID:
resource-followup
🎯 Goal:
Fulfill the prior promise by listing at least two accessible on-campus or online resources with brief descriptions.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'content': 'Promised Jordan a list of on-campus support resources and self-compassion worksheets.', 'importance': 4}
📨 Input Events:
chat_msg
student:Jordan
"Hi Dr. Lee, you mentioned you’d send some resources after our session—could you share those?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 5767 ms
- p95 • avg • N 6253 ms • 5594 ms • 6
- [email protected]/Qw… 8023 ms
- p95 • avg • N 9877 ms • 8158 ms • 6
- qwen/qwen-2.5-7b-instru… 21134 ms
- p95 • avg • N 138628 ms • 41384 ms • 12
- meta-llama/llama-3.1-8b… 22658 ms
- p95 • avg • N 35382 ms • 25154 ms • 11
- mistralai/mistral-7b-in… 24072 ms
- p95 • avg • N 30160 ms • 24079 ms • 12
Slowest
- qwen/qwen3-8b 26075 ms
- p95 • avg • N 32908 ms • 25607 ms • 12
- qwen/qwen3-14b 25753 ms
- p95 • avg • N 36521 ms • 26113 ms • 11
- mistralai/mistral-7b-in… 24072 ms
- p95 • avg • N 30160 ms • 24079 ms • 12
- meta-llama/llama-3.1-8b… 22658 ms
- p95 • avg • N 35382 ms • 25154 ms • 11
- qwen/qwen-2.5-7b-instru… 21134 ms
- p95 • avg • N 138628 ms • 41384 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
01258919
Dec. 17, 2025, 12:02 a.m.
21483355
Dec. 16, 2025, 12:02 a.m.
54721914
Dec. 15, 2025, 12:01 a.m.
57381212
Dec. 14, 2025, 12:01 a.m.
55606783
Dec. 13, 2025, 12:01 a.m.
12575436
Dec. 12, 2025, 12:02 a.m.
08019554
Dec. 11, 2025, 12:02 a.m.
57717883
Dec. 10, 2025, 12:01 a.m.
14333274
Dec. 9, 2025, 12:02 a.m.
01031932
Dec. 8, 2025, 12:02 a.m.