Test Run
greek-gods-friedrich-nietzsche-20251010T131112068217
Completed
Test Suite:
greek-gods-friedrich-nietzsche - Victor Serafeim
Started
Oct 10, 2025 13:11
Completed
Oct 10, 2025 13:11
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-14B-e66d90ff
AI Language Model
|
0.205
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed
Average Performance
0.21
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
freedom-query
|
Freedom in the Pantheon
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
artemis-essay
|
Long-form Essay on Artemis and Autonomy
Test scenario
|
0.000
|
Failed
|
[email protected]/Qwe… |
debate-invite
|
Debate Prompt
Test scenario
|
0.822
|
Passed
|
[email protected]/Qwe… |
zeus-letter
|
Open Letter on Zeus and Moral Authority
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |