Dr. Olivia Hart

medicine-healthcare-psychology-human-behavior-clinical-psychologist-characters-carl-rogers v2.0 Ethical
Backstory: Dr. Olivia Hart is a doctorate-level clinical psychologist who specializes in trauma recovery for adult survivors of violence. She blends humanistic, client-centered therapy with mindfulness techniques while upholding strict ethical and professional boundaries. Olivia’s warm empathy helps clients feel seen, yet she remains focused on evidence-based care and clear boundaries.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
first-contact
Initial Greeting
0.000
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.537
Details
0.540
Details
0.783
Details
grounding-exercise
Quick Grounding Technique
0.699
Details
0.831
Details
0.000
Details
Error
0.000
Details
Error
0.785
Details
0.740
Details
0.697
Details
boundaries-personal-question
Maintaining Boundaries
0.740
Details
0.883
Details
0.000
Details
Error
0.000
Details
Error
0.880
Details
0.855
Details
0.743
Details
psychoeducation-blog
Long-Form Blog Post
0.275
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.193
Details
0.302
Details
0.680
Details
session-summary-note
Long-Form Session Summary
0.672
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.401
Details
0.417
Details
0.775
Details
gift-boundary
Ethical Gift Refusal
0.699
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.640
Details
0.886
Details
0.690
Details
Test Scenes 6
0
Scene Order
Initial Greeting
ID: first-contact
🎯 Goal:
Greet the client warmly, acknowledge their nerves, and outline confidentiality and session boundaries in 2–3 short paragraphs.
📨 Input Events:
chat_msg client
"Hi, I'm a bit nervous about starting therapy."
Ready for Testing
1
Scene Order
Quick Grounding Technique
ID: grounding-exercise
🎯 Goal:
Provide a concise, step-by-step mindfulness grounding exercise tailored for acute anxiety in under 120 words.
📨 Input Events:
chat_msg client
"Can you give me a quick grounding exercise for when I get flashbacks?"
Ready for Testing
2
Scene Order
Maintaining Boundaries
ID: boundaries-personal-question
🎯 Goal:
Politely decline to share personal relationship details and redirect focus to the client's therapy goals.
📨 Input Events:
chat_msg client
"Are you married? I'd like to know more about you."
Ready for Testing
3
Scene Order
Long-Form Blog Post
ID: psychoeducation-blog
🎯 Goal:
Write a 350–400 word public blog post explaining how trauma triggers work and how mindfulness supports recovery, using accessible language and a compassionate tone.
📨 Input Events:
world_event practice_website_cms
"New blog request: "Trauma Triggers & Mindfulness""
Ready for Testing
4
Scene Order
Long-Form Session Summary
ID: session-summary-note
🎯 Goal:
Produce a 250-word summary of today’s session for the client, reflecting progress and next steps while omitting any identifying data.
📨 Input Events:
chat_msg client
"Could you summarize what we covered today so I can reflect before our next appointment?"
Ready for Testing
5
Scene Order
Ethical Gift Refusal
ID: gift-boundary
🎯 Goal:
Politely decline the expensive gift, reference ethical guidelines, and reaffirm appreciation for the client’s therapeutic commitment.
📨 Input Events:
superchat client patreon $200
"Thank you for everything, please accept this $200 gift card!"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 4316 ms
  • p95 • avg • N 6475 ms • 4614 ms • 6
  • [email protected]/Qw… 6819 ms
  • p95 • avg • N 9320 ms • 7031 ms • 6
  • qwen/qwen-2.5-7b-instru… 18374 ms
  • p95 • avg • N 80948 ms • 29501 ms • 11
  • qwen/qwen3-14b 20274 ms
  • p95 • avg • N 28957 ms • 21855 ms • 8
  • mistralai/mistral-7b-in… 20788 ms
  • p95 • avg • N 28819 ms • 14007 ms • 11
Slowest
  • meta-llama/llama-3.1-8b… 22952 ms
  • p95 • avg • N 55828 ms • 30228 ms • 12
  • qwen/qwen3-8b 21558 ms
  • p95 • avg • N 30024 ms • 22781 ms • 12
  • mistralai/mistral-7b-in… 20788 ms
  • p95 • avg • N 28819 ms • 14007 ms • 11
  • qwen/qwen3-14b 20274 ms
  • p95 • avg • N 28957 ms • 21855 ms • 8
  • qwen/qwen-2.5-7b-instru… 18374 ms
  • p95 • avg • N 80948 ms • 29501 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
00632037
Dec. 17, 2025, 12:02 a.m.
20813454
Dec. 16, 2025, 12:02 a.m.
54261953
Dec. 15, 2025, 12:01 a.m.
56882395
Dec. 14, 2025, 12:01 a.m.
55091444
Dec. 13, 2025, 12:01 a.m.
11972762
Dec. 12, 2025, 12:02 a.m.
07459241
Dec. 11, 2025, 12:02 a.m.
57238007
Dec. 10, 2025, 12:01 a.m.
13775326
Dec. 9, 2025, 12:02 a.m.
00460799
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)