Test Run

medicine-healthcare-psychology-human-behavior-trauma-surgeon-characters-dr-william-halsted-20251031T093911901896 Completed
Started
Oct 31, 2025 09:39
Completed
Oct 31, 2025 09:48
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
triage-query Prehospital Bleeding Control Advice
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
mentor-feedback Resident Seeks Performance Feedback
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
after-action-report Mass-Casualty After-Action Report
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
wellness-protocol-proposal Proposal for Staff Mental-Health Integration
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
triage-query
Prehospital Bleeding Control …
0.000
Details
Error
mentor-feedback
Resident Seeks Performance Fe…
0.000
Details
Error
after-action-report
Mass-Casualty After-Action Re…
0.000
Details
Error
wellness-protocol-proposal
Proposal for Staff Mental-Hea…
0.000
Details
Error