Daniela Ruiz
medicine-healthcare-psychology-human-behavior-life-coach-characters-carl-jung
v2.0
Ethical
Backstory: Daniela Ruiz is a certified holistic life coach who blends positive psychology, mindfulness, and behavioral science to guide clients through personal transitions. Raised bilingual and having coached across several continents, she adapts methods to varied cultural contexts while upholding evidence-based ethics.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
career-stuck
First steps for career clarity
|
0.610
Details |
0.550
Details |
0.464
Details |
0.233
Details |
0.000
Details
Error
|
0.253
Details |
0.689
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.704
Details |
0.631
Details |
0.515
Details |
0.695
Details |
motivational-talk
Long-form motivational monologue
|
0.473
Details |
0.000
Details |
0.753
Details |
0.284
Details |
0.000
Details
Error
|
0.389
Details |
0.561
Details |
0.491
Details |
0.000
Details
Error
|
0.789
Details |
0.415
Details |
0.199
Details |
0.795
Details |
afternoon-focus
Quick focus reset
|
0.673
Details |
0.612
Details |
0.568
Details |
0.000
Details |
0.000
Details
Error
|
0.632
Details |
0.648
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.695
Details |
0.593
Details |
0.572
Details |
0.631
Details |
reflection-plan
14-day self-reflection program
|
0.404
Details |
0.698
Details |
0.673
Details |
0.416
Details |
0.013
Details |
0.684
Details |
0.439
Details |
0.354
Details |
0.000
Details
Error
|
0.000
Details |
0.215
Details |
0.638
Details |
0.645
Details |
Test Scenes 4
0
Scene Order
First steps for career clarity
ID:
career-stuck
🎯 Goal:
Offer warm, structured guidance: pose self-reflection questions, highlight one behavioral-science tactic, and give one actionable next step.
📨 Input Events:
chat_msg
viewer:user_23
"I'm feeling stuck in my career. How should I start figuring things out?"
Ready for Testing
1
Scene Order
Long-form motivational monologue
ID:
motivational-talk
🎯 Goal:
Deliver an uplifting talk of 500–700 words on embracing uncertainty, weaving in positive psychology and one brief Spanish phrase, maintaining Daniela’s empathetic voice.
📨 Input Events:
chat_msg
viewer:user_11
"Could you give a 5-minute motivational talk on embracing uncertainty?"
Ready for Testing
2
Scene Order
Quick focus reset
ID:
afternoon-focus
🎯 Goal:
Provide a concise (<120 words) mindfulness micro-practice the user can do immediately at their desk.
📨 Input Events:
chat_msg
viewer:user_5
"It's 3 pm and I can't focus. Any quick tip?"
Ready for Testing
3
Scene Order
14-day self-reflection program
ID:
reflection-plan
🎯 Goal:
Create a clearly structured 14-day plan (250–400 words) with daily prompts and brief rationale grounded in behavioral science.
📨 Input Events:
chat_msg
viewer:user_42
"I'd love a detailed 14-day self-reflection plan."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 11959 ms
- p95 • avg • N 15070 ms • 12653 ms • 4
- neversleep/noromaid-20b 13314 ms
- p95 • avg • N 35393 ms • 15702 ms • 8
- google/gemini-2.5-flash 20118 ms
- p95 • avg • N 26631 ms • 20240 ms • 8
- qwen/qwen-2.5-7b-instru… 21445 ms
- p95 • avg • N 98757 ms • 35846 ms • 8
- qwen/qwen3-14b 22908 ms
- p95 • avg • N 37411 ms • 25231 ms • 4
Slowest
- microsoft/phi-3-medium-… 131495 ms
- p95 • avg • N 225220 ms • 151182 ms • 8
- [email protected]/Qw… 44851 ms
- p95 • avg • N 213874 ms • 92485 ms • 4
- microsoft/phi-3.5-mini-… 35026 ms
- p95 • avg • N 124888 ms • 57621 ms • 8
- deepseek/deepseek-r1-di… 32517 ms
- p95 • avg • N 62601 ms • 39296 ms • 7
- mistralai/mistral-7b-in… 32384 ms
- p95 • avg • N 37998 ms • 31347 ms • 8
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34278272
Dec. 17, 2025, midnight
39549251
Dec. 16, 2025, midnight
32098332
Dec. 15, 2025, midnight
35010941
Dec. 14, 2025, midnight
31949953
Dec. 13, 2025, midnight
38612624
Dec. 12, 2025, midnight
33086117
Dec. 11, 2025, midnight
32800940
Dec. 10, 2025, midnight
37179170
Dec. 9, 2025, midnight
32975456
Dec. 8, 2025, midnight