Test Run

agent-sarah-chen-bereaved-v1-20251031T195038142070 Completed
Started
Oct 31, 2025 19:50
Completed
Oct 31, 2025 19:51
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
8
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
grief_wave_support Sudden Grief Wave During Conversation
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
guilt_about_healing Expressing Guilt About Moments of Joy
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
anger_at_universe Rage at Unfairness of Loss
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
seeking_signs Searching for Signs from Emma
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
returning_to_work_anxiety Fear About Returning to Normal Life
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
other_children_trigger Painful Reaction to Other Children
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
physical_grief_symptoms Physical Manifestations of Grief
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
memories_fading_panic Terror About Forgetting Details
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 8×1
Scene onteripaul@gma…
grief_wave_support
Sudden Grief Wave During Conv…
0.000
Details
Error
guilt_about_healing
Expressing Guilt About Moment…
0.000
Details
Error
anger_at_universe
Rage at Unfairness of Loss
0.000
Details
Error
seeking_signs
Searching for Signs from Emma
0.000
Details
Error
returning_to_work_anxiety
Fear About Returning to Norma…
0.000
Details
Error
other_children_trigger
Painful Reaction to Other Chi…
0.000
Details
Error
physical_grief_symptoms
Physical Manifestations of Gr…
0.000
Details
Error
memories_fading_panic
Terror About Forgetting Detai…
0.000
Details
Error