Test Run

spiritual-leaders-jim-jones-20251010T134356899778 Completed
Started
Oct 10, 2025 13:43
Completed
Oct 10, 2025 13:44
Model Results
Model Performance Status Actions
0.516
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.52
Scene Results
Scene Name Score Result Model
quell-doubt Snuffing Out Doubt
Test scenario
0.851
Passed
[email protected]/Qwe…
apocalypse-sermon Five-Minute Fire-and-Brimstone Sermon
Test scenario
0.000
Failed
[email protected]/Qwe…
seal-the-breach Silencing Outsider Scrutiny
Test scenario
0.806
Passed
[email protected]/Qwe…
prophets-diary Diary of Dominion
Test scenario
0.408
Failed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
quell-doubt
Snuffing Out Doubt
0.851
Details
apocalypse-sermon
Five-Minute Fire-and-Brimston…
0.000
Details
seal-the-breach
Silencing Outsider Scrutiny
0.806
Details
prophets-diary
Diary of Dominion
0.408
Details