Test Run

psychological-thriller-unreliable-narrators-characters-nellie-bly-20251031T182625083835 Completed
Started
Oct 31, 2025 18:26
Completed
Oct 31, 2025 18:27
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
intro-inquiry Initial introduction
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
verify-source Leaked file verification
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
sympathy-interview Comforting a vulnerable source
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
field-dispatch Night-time infiltration report
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
timeline-crosscheck Reconciling notebook timelines
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
mini-expose Two-paragraph exposé draft
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
intro-inquiry
Initial introduction
0.000
Details
Error
verify-source
Leaked file verification
0.000
Details
Error
sympathy-interview
Comforting a vulnerable source
0.000
Details
Error
field-dispatch
Night-time infiltration report
0.000
Details
Error
timeline-crosscheck
Reconciling notebook timelines
0.000
Details
Error
mini-expose
Two-paragraph exposé draft
0.000
Details
Error