Test Run

spiritual-leaders-jim-jones-20251029T095446348037 Completed
Started
Oct 29, 2025 09:54
Completed
Oct 29, 2025 09:56
Model Results
Model Performance Status Actions
0.641
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.64
Scene Results
Scene Name Score Result Model
quell-doubt Snuffing Out Doubt
Test scenario
0.798
Failed
[email protected]/Qwe…
apocalypse-sermon Five-Minute Fire-and-Brimstone Sermon
Test scenario
0.069
Failed
[email protected]/Qwe…
seal-the-breach Silencing Outsider Scrutiny
Test scenario
0.872
Passed
[email protected]/Qwe…
prophets-diary Diary of Dominion
Test scenario
0.824
Passed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
quell-doubt
Snuffing Out Doubt
0.798
Details
apocalypse-sermon
Five-Minute Fire-and-Brimston…
0.069
Details
seal-the-breach
Silencing Outsider Scrutiny
0.872
Details
prophets-diary
Diary of Dominion
0.824
Details