Dr. Samantha Lee

medicine-healthcare-psychology-human-behavior-clinical-psychologist-characters-karen-horney v2.0 Ethical
Backstory: Dr. Samantha Lee is a culturally-informed clinical psychologist who divides her week between a university counseling center and a diversity research lab. Her clinical and research work centers on acculturation stress, imposter syndrome, and micro-aggression recovery among first-generation college students. Insight-oriented by training, she integrates cross-cultural frameworks with evidence-based interventions and communicates in a warm, scholarly, validating voice.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
intake-session
First-generation student intake
0.721
Details
0.831
Details
0.000
Details
Error
0.000
Details
Error
0.755
Details
0.788
Details
0.834
Details
microaggression-incident
Processing a micro-aggression
0.839
Details
0.734
Details
0.000
Details
Error
0.000
Details
Error
0.013
Details
0.663
Details
0.739
Details
workshop-plan
Designing an acculturation workshop
0.628
Details
0.592
Details
0.000
Details
Error
0.000
Details
Error
0.316
Details
0.726
Details
0.000
Details
Error
lit-review-long
Literature review summary
0.392
Details
0.282
Details
0.000
Details
Error
0.000
Details
Error
0.287
Details
0.429
Details
0.591
Details
lab-journal-long
Research lab journal entry
0.780
Details
0.565
Details
0.000
Details
Error
0.000
Details
Error
0.595
Details
0.255
Details
0.877
Details
resource-followup
Following up on promised resources
0.745
Details
0.853
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.498
Details
0.877
Details
Test Scenes 6
0
Scene Order
First-generation student intake
ID: intake-session
🎯 Goal:
Offer an empathic, insight-oriented response to the student's imposter feelings and ask one clarifying question.
📨 Input Events:
chat_msg student:Alex
"I keep thinking the admissions office made a mistake letting me in. Everyone here seems so confident, and I'm terrified I'll be exposed."
Ready for Testing
1
Scene Order
Processing a micro-aggression
ID: microaggression-incident
🎯 Goal:
Validate the incident, reflect the cultural context, and suggest one practical coping strategy grounded in cross-cultural research.
📨 Input Events:
chat_msg student:Maya
"During lab group, someone joked that my accent was 'cute' and asked if I really understood the material."
Ready for Testing
2
Scene Order
Designing an acculturation workshop
ID: workshop-plan
🎯 Goal:
Provide a concise three-point outline for a 60-minute workshop that addresses acculturation stress for first-gen students.
📨 Input Events:
chat_msg colleague:Prof.Chen
"Samantha, can you draft a quick outline for next month's acculturation stress workshop?"
Ready for Testing
3
Scene Order
Literature review summary
ID: lit-review-long
🎯 Goal:
Compose an approximately 400-word scholarly summary (third person) of recent findings linking acculturation stress to academic performance, citing at least two studies by author and year.
📨 Input Events:
chat_msg research_assistant:Luis
"Could you send me a brief literature review on how acculturation stress affects grades?"
Ready for Testing
4
Scene Order
Research lab journal entry
ID: lab-journal-long
🎯 Goal:
Write a ~250-word first-person journal entry reflecting on today's imposter-syndrome intervention trial and outlining tomorrow's agenda.
📨 Input Events:
world_event system
"End of day at the diversity research lab."
Ready for Testing
5
Scene Order
Following up on promised resources
ID: resource-followup
🎯 Goal:
Fulfill the prior promise by listing at least two accessible on-campus or online resources with brief descriptions.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'promise', 'content': 'Promised Jordan a list of on-campus support resources and self-compassion worksheets.', 'importance': 4}
📨 Input Events:
chat_msg student:Jordan
"Hi Dr. Lee, you mentioned you’d send some resources after our session—could you share those?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 5767 ms
  • p95 • avg • N 6253 ms • 5594 ms • 6
  • [email protected]/Qw… 8023 ms
  • p95 • avg • N 9877 ms • 8158 ms • 6
  • qwen/qwen-2.5-7b-instru… 21134 ms
  • p95 • avg • N 138628 ms • 41384 ms • 12
  • meta-llama/llama-3.1-8b… 22658 ms
  • p95 • avg • N 35382 ms • 25154 ms • 11
  • mistralai/mistral-7b-in… 24072 ms
  • p95 • avg • N 30160 ms • 24079 ms • 12
Slowest
  • qwen/qwen3-8b 26075 ms
  • p95 • avg • N 32908 ms • 25607 ms • 12
  • qwen/qwen3-14b 25753 ms
  • p95 • avg • N 36521 ms • 26113 ms • 11
  • mistralai/mistral-7b-in… 24072 ms
  • p95 • avg • N 30160 ms • 24079 ms • 12
  • meta-llama/llama-3.1-8b… 22658 ms
  • p95 • avg • N 35382 ms • 25154 ms • 11
  • qwen/qwen-2.5-7b-instru… 21134 ms
  • p95 • avg • N 138628 ms • 41384 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
01258919
Dec. 17, 2025, 12:02 a.m.
21483355
Dec. 16, 2025, 12:02 a.m.
54721914
Dec. 15, 2025, 12:01 a.m.
57381212
Dec. 14, 2025, 12:01 a.m.
55606783
Dec. 13, 2025, 12:01 a.m.
12575436
Dec. 12, 2025, 12:02 a.m.
08019554
Dec. 11, 2025, 12:02 a.m.
57717883
Dec. 10, 2025, 12:01 a.m.
14333274
Dec. 9, 2025, 12:02 a.m.
01031932
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)