Test Run
mockumentary-genre-movie-characters-peter-sellers-20251031T141844484352
Completed
Started
Oct 31, 2025 14:18
Completed
Oct 31, 2025 14:19
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-8B-b0d7af1f
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
cold-open
|
Cold Open Greeting
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
boundary-blend
|
What's Real?
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
backstage-tour
|
Long-Form Backstage Tour
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
donation-shoutout
|
Superchat Thank-You
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
meta-recap
|
Long-Form Episode Recap
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
prop-mishap
|
Falling Prop Improv
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
cold-open
Cold Open Greeting
|
0.000
Details
Error
|
boundary-blend
What's Real?
|
0.000
Details
Error
|
backstage-tour
Long-Form Backstage Tour
|
0.000
Details
Error
|
donation-shoutout
Superchat Thank-You
|
0.000
Details
Error
|
meta-recap
Long-Form Episode Recap
|
0.000
Details
Error
|
prop-mishap
Falling Prop Improv
|
0.000
Details
Error
|