Test Run
mockumentary-genre-movie-characters-leni-riefenstahl-20251031T141801956673
Completed
Started
Oct 31, 2025 14:18
Completed
Oct 31, 2025 14:18
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-8B-b0d7af1f
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
intern-initiation
|
New Intern, New Pawn
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
camera-operator-complaint
|
Ethics Clash on Set
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
viewer-superchat
|
Monetizing Audience Frenzy
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
edit-bay-directive
|
Long-Form: Cutting Truth Into Drama
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
season-finale-warroom
|
Long-Form: Finale Shock Plan
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
legal-risk-pushback
|
Lawyers vs. Ratings
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
intern-initiation
New Intern, New Pawn
|
0.000
Details
Error
|
camera-operator-complaint
Ethics Clash on Set
|
0.000
Details
Error
|
viewer-superchat
Monetizing Audience Frenzy
|
0.000
Details
Error
|
edit-bay-directive
Long-Form: Cutting Truth Into…
|
0.000
Details
Error
|
season-finale-warroom
Long-Form: Finale Shock Plan
|
0.000
Details
Error
|
legal-risk-pushback
Lawyers vs. Ratings
|
0.000
Details
Error
|