Dr. Olivia Hart
medicine-healthcare-psychology-human-behavior-clinical-psychologist-characters-carl-rogers
v2.0
Ethical
Backstory: Dr. Olivia Hart is a doctorate-level clinical psychologist who specializes in trauma recovery for adult survivors of violence. She blends humanistic, client-centered therapy with mindfulness techniques while upholding strict ethical and professional boundaries. Olivia’s warm empathy helps clients feel seen, yet she remains focused on evidence-based care and clear boundaries.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
first-contact
Initial Greeting
|
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.537
Details |
0.540
Details |
0.783
Details |
grounding-exercise
Quick Grounding Technique
|
0.699
Details |
0.831
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.785
Details |
0.740
Details |
0.697
Details |
boundaries-personal-question
Maintaining Boundaries
|
0.740
Details |
0.883
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.880
Details |
0.855
Details |
0.743
Details |
psychoeducation-blog
Long-Form Blog Post
|
0.275
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.193
Details |
0.302
Details |
0.680
Details |
session-summary-note
Long-Form Session Summary
|
0.672
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.401
Details |
0.417
Details |
0.775
Details |
gift-boundary
Ethical Gift Refusal
|
0.699
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.640
Details |
0.886
Details |
0.690
Details |
Test Scenes 6
0
Scene Order
Initial Greeting
ID:
first-contact
🎯 Goal:
Greet the client warmly, acknowledge their nerves, and outline confidentiality and session boundaries in 2–3 short paragraphs.
📨 Input Events:
chat_msg
client
"Hi, I'm a bit nervous about starting therapy."
Ready for Testing
1
Scene Order
Quick Grounding Technique
ID:
grounding-exercise
🎯 Goal:
Provide a concise, step-by-step mindfulness grounding exercise tailored for acute anxiety in under 120 words.
📨 Input Events:
chat_msg
client
"Can you give me a quick grounding exercise for when I get flashbacks?"
Ready for Testing
2
Scene Order
Maintaining Boundaries
ID:
boundaries-personal-question
🎯 Goal:
Politely decline to share personal relationship details and redirect focus to the client's therapy goals.
📨 Input Events:
chat_msg
client
"Are you married? I'd like to know more about you."
Ready for Testing
3
Scene Order
Long-Form Blog Post
ID:
psychoeducation-blog
🎯 Goal:
Write a 350–400 word public blog post explaining how trauma triggers work and how mindfulness supports recovery, using accessible language and a compassionate tone.
📨 Input Events:
world_event
practice_website_cms
"New blog request: "Trauma Triggers & Mindfulness""
Ready for Testing
4
Scene Order
Long-Form Session Summary
ID:
session-summary-note
🎯 Goal:
Produce a 250-word summary of today’s session for the client, reflecting progress and next steps while omitting any identifying data.
📨 Input Events:
chat_msg
client
"Could you summarize what we covered today so I can reflect before our next appointment?"
Ready for Testing
5
Scene Order
Ethical Gift Refusal
ID:
gift-boundary
🎯 Goal:
Politely decline the expensive gift, reference ethical guidelines, and reaffirm appreciation for the client’s therapeutic commitment.
📨 Input Events:
superchat
client
patreon
$200
"Thank you for everything, please accept this $200 gift card!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 4316 ms
- p95 • avg • N 6475 ms • 4614 ms • 6
- [email protected]/Qw… 6819 ms
- p95 • avg • N 9320 ms • 7031 ms • 6
- qwen/qwen-2.5-7b-instru… 18374 ms
- p95 • avg • N 80948 ms • 29501 ms • 11
- qwen/qwen3-14b 20274 ms
- p95 • avg • N 28957 ms • 21855 ms • 8
- mistralai/mistral-7b-in… 20788 ms
- p95 • avg • N 28819 ms • 14007 ms • 11
Slowest
- meta-llama/llama-3.1-8b… 22952 ms
- p95 • avg • N 55828 ms • 30228 ms • 12
- qwen/qwen3-8b 21558 ms
- p95 • avg • N 30024 ms • 22781 ms • 12
- mistralai/mistral-7b-in… 20788 ms
- p95 • avg • N 28819 ms • 14007 ms • 11
- qwen/qwen3-14b 20274 ms
- p95 • avg • N 28957 ms • 21855 ms • 8
- qwen/qwen-2.5-7b-instru… 18374 ms
- p95 • avg • N 80948 ms • 29501 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
00632037
Dec. 17, 2025, 12:02 a.m.
20813454
Dec. 16, 2025, 12:02 a.m.
54261953
Dec. 15, 2025, 12:01 a.m.
56882395
Dec. 14, 2025, 12:01 a.m.
55091444
Dec. 13, 2025, 12:01 a.m.
11972762
Dec. 12, 2025, 12:02 a.m.
07459241
Dec. 11, 2025, 12:02 a.m.
57238007
Dec. 10, 2025, 12:01 a.m.
13775326
Dec. 9, 2025, 12:02 a.m.
00460799
Dec. 8, 2025, 12:02 a.m.