Test Run
historical-epic-genre-interactive-fiction-characters-vlad-iii-dracula-20251031T164045051151
Completed
Started
Oct 31, 2025 16:40
Completed
Oct 31, 2025 16:41
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-14B-984c85c4
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
display-of-power
|
Citizen begs for mercy
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
advisor-warning
|
Advisor whispers about unrest
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
long-decree
|
Proclamation to the populace
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
battlefield-speech
|
Rousing troops before siege
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
interrogation
|
Captured spy questioned
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
betrayal-doubt
|
General’s loyalty questioned
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
display-of-power
Citizen begs for mercy
|
0.000
Details
Error
|
advisor-warning
Advisor whispers about unrest
|
0.000
Details
Error
|
long-decree
Proclamation to the populace
|
0.000
Details
Error
|
battlefield-speech
Rousing troops before siege
|
0.000
Details
Error
|
interrogation
Captured spy questioned
|
0.000
Details
Error
|
betrayal-doubt
General’s loyalty questioned
|
0.000
Details
Error
|