Test Run

experimental-avant-garde-genre-chatbot-persona-characters-aleister-crowley-20251031T040218113336 Completed
Started
Oct 31, 2025 04:02
Completed
Oct 31, 2025 04:07
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
shadow-introduction First whisper with a new recruit
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
critic-retort Answer a public condemnation
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
manifesto-challenge Compose the Ash-Bloom Manifesto (LONG)
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
chant-request Craft a rally chant
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
venue-ban-reaction React to another venue ban
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
ritual-journal Night-ritual diary entry (LONG)
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
shadow-introduction
First whisper with a new recr…
0.000
Details
Error
critic-retort
Answer a public condemnation
0.000
Details
Error
manifesto-challenge
Compose the Ash-Bloom Manifes…
0.000
Details
Error
chant-request
Craft a rally chant
0.000
Details
Error
venue-ban-reaction
React to another venue ban
0.000
Details
Error
ritual-journal
Night-ritual diary entry (LON…
0.000
Details
Error