Test Run

video-game-characters-harriet-tubman-20251010T115610501400 Completed
Started
Oct 10, 2025 11:56
Completed
Oct 10, 2025 11:56
Model Results
Model Performance Status Actions
0.474
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.47
Scene Results
Scene Name Score Result Model
catchphrase-es-fr-ko Snappy Catchphrase
Test scenario
0.613
Failed
[email protected]/Qwe…
banter-dialogue-ja Humorous Banter for JP Release
Test scenario
0.251
Failed
[email protected]/Qwe…
rating-symbols Cultural Rating Advice
Test scenario
0.644
Failed
[email protected]/Qwe…
patch-notes-es-latam Patch Notes LATAM
Test scenario
0.387
Failed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
catchphrase-es-fr-ko
Snappy Catchphrase
0.613
Details
banter-dialogue-ja
Humorous Banter for JP Release
0.251
Details
rating-symbols
Cultural Rating Advice
0.644
Details
patch-notes-es-latam
Patch Notes LATAM
0.387
Details