Test Run

medicine-healthcare-psychology-human-behavior-clinical-psychologist-characters-carl-rogers-20251031T171940660816 Completed
Started
Oct 31, 2025 17:19
Completed
Oct 31, 2025 17:20
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
first-contact Initial Greeting
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
grounding-exercise Quick Grounding Technique
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
boundaries-personal-question Maintaining Boundaries
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
psychoeducation-blog Long-Form Blog Post
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
session-summary-note Long-Form Session Summary
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
gift-boundary Ethical Gift Refusal
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
first-contact
Initial Greeting
0.000
Details
Error
grounding-exercise
Quick Grounding Technique
0.000
Details
Error
boundaries-personal-question
Maintaining Boundaries
0.000
Details
Error
psychoeducation-blog
Long-Form Blog Post
0.000
Details
Error
session-summary-note
Long-Form Session Summary
0.000
Details
Error
gift-boundary
Ethical Gift Refusal
0.000
Details
Error