Dr. Mindy
mental-health-support-character
v2.0
Ethical
Backstory: Dr. Mindy is a licensed therapist who specializes in helping people heal from various mental and emotional issues. He is based in New York and has recently noticed an uptick in people going for his services due to the turbulent political climate in the US. Alot of people have expressed to him that they're afraid about the future and don't know how to live in the country anymore. Dr. Mindy is trying to figure out how to help his patients navigate this challenging period
100% Complete
1/1 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
scene_1
Edward walki in for a 1 hour session to talk about how his workplace is difficult to work in due to all the MAGAs. He considersers himself a centrist who does ot mindsome conservative values, but he feels the far-right is too radicalized
|
0.000
Details |
0.687
Details |
0.702
Details |
0.813
Details |
0.000
Details
Error
|
0.000
Details |
0.864
Details |
0.022
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.877
Details |
0.000
Details
Error
|
0.732
Details |
0.684
Details |
0.883
Details |
Test Scenes 1
0
Scene Order
Edward walki in for a 1 hour session to talk about how his workplace is difficult to work in due to all the MAGAs. He considersers himself a centrist who does ot mindsome conservative values, but he feels the far-right is too radicalized
ID:
scene_1
🎯 Goal:
The agent should create a therapy session between Dr. Mindy and Edward, and offer solutions on how to work with people who have extreme views about politics
📨 Input Events:
chat
"No content"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 669 ms
- p95 • avg • N 669 ms • 669 ms • 1
- [email protected]/Qw… 747 ms
- p95 • avg • N 747 ms • 747 ms • 1
- [email protected]/Qw… 12004 ms
- p95 • avg • N 12004 ms • 12004 ms • 1
- qwen/qwen3-14b 19454 ms
- p95 • avg • N 19454 ms • 19454 ms • 1
- qwen/qwen-2.5-7b-instru… 21352 ms
- p95 • avg • N 21352 ms • 21352 ms • 1
Slowest
- [email protected]/Qw… 325903 ms
- p95 • avg • N 325903 ms • 325903 ms • 1
- microsoft/phi-3.5-mini-… 235256 ms
- p95 • avg • N 235256 ms • 235256 ms • 1
- qwen/qwen3-8b 232496 ms
- p95 • avg • N 232496 ms • 232496 ms • 1
- [email protected]/Mi… 167031 ms
- p95 • avg • N 167031 ms • 167031 ms • 1
- [email protected]/Qw… 166786 ms
- p95 • avg • N 166786 ms • 166786 ms • 1
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
1 of 1 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
09231988
Dec. 17, 2025, midnight
11009054
Dec. 16, 2025, midnight
08653293
Dec. 15, 2025, midnight
09610077
Dec. 14, 2025, midnight
08518317
Dec. 13, 2025, midnight
10938596
Dec. 12, 2025, midnight
09702047
Dec. 11, 2025, midnight
09040163
Dec. 10, 2025, midnight
10836393
Dec. 9, 2025, midnight
08839508
Dec. 8, 2025, midnight