Test Run

entertainment-media-podcaster-characters-howard-cosell-20251031T152148883694 Completed
Started
Oct 31, 2025 15:21
Completed
Oct 31, 2025 15:22
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
champs-preview Champions League Final Preview
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
injury-update Breaking Injury News
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
superchat-probability Superchat on Scoring Odds
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
mental-health-slump Addressing a Player’s Slump
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
olympic-sprint-segment Long-Form: 100m Final Breakdown
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
biomechanics-episode Long-Form: Biomechanics Collab
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
champs-preview
Champions League Final Preview
0.000
Details
Error
injury-update
Breaking Injury News
0.000
Details
Error
superchat-probability
Superchat on Scoring Odds
0.000
Details
Error
mental-health-slump
Addressing a Player’s Slump
0.000
Details
Error
olympic-sprint-segment
Long-Form: 100m Final Breakdo…
0.000
Details
Error
biomechanics-episode
Long-Form: Biomechanics Collab
0.000
Details
Error