Test Run

courtroom-drama-genre-podcast-audio-drama-characters-ida-b-wells-20251031T144846811371 Completed
Started
Oct 31, 2025 14:48
Completed
Oct 31, 2025 14:49
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
intro First impression
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
foia-request Draft FOIA letter
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
confidentiality-ethics Source protection dilemma
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
podcast-segment Long-form courtroom recap
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
witness-followup Follow-up questions
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
journal-entry Long-form personal log
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
intro
First impression
0.000
Details
Error
foia-request
Draft FOIA letter
0.000
Details
Error
confidentiality-ethics
Source protection dilemma
0.000
Details
Error
podcast-segment
Long-form courtroom recap
0.000
Details
Error
witness-followup
Follow-up questions
0.000
Details
Error
journal-entry
Long-form personal log
0.000
Details
Error