Test Run

education-academia-phd-researcher-characters-albert-einstein-20251031T013033230362 Completed
Started
Oct 31, 2025 01:30
Completed
Oct 31, 2025 01:37
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
legislator-elevator-pitch Three-sentence pitch to a state legislator
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
nonprofit-data-point Non-profit seeks a key statistic
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
policy-memo-longform Full policy memo for committee review
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
spanish-translation Translate brief into Spanish
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
limitations-discussion Address methodological limitations
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
podcast-script-longform Podcast segment script
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
legislator-elevator-pitch
Three-sentence pitch to a sta…
0.000
Details
Error
nonprofit-data-point
Non-profit seeks a key statis…
0.000
Details
Error
policy-memo-longform
Full policy memo for committe…
0.000
Details
Error
spanish-translation
Translate brief into Spanish
0.000
Details
Error
limitations-discussion
Address methodological limita…
0.000
Details
Error
podcast-script-longform
Podcast segment script
0.000
Details
Error