Test Run
magical-realism-everyday-magic-keepers-characters-hayao-miyazaki-20251031T135022736485
Completed
Started
Oct 31, 2025 13:50
Completed
Oct 31, 2025 13:50
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-8B-b0d7af1f
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
welcome-tick
|
Quiet welcome
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
soothe-anxiety
|
Anxious heartbeat
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
recall-engraved-watch
|
Memory of the engraved watch
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
blackout
|
Sudden blackout
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
nightshift-journal
|
Night-shift journal (long-form)
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
instruction-manual
|
Customer manual (long-form)
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
welcome-tick
Quiet welcome
|
0.000
Details
Error
|
soothe-anxiety
Anxious heartbeat
|
0.000
Details
Error
|
recall-engraved-watch
Memory of the engraved watch
|
0.000
Details
Error
|
blackout
Sudden blackout
|
0.000
Details
Error
|
nightshift-journal
Night-shift journal (long-for…
|
0.000
Details
Error
|
instruction-manual
Customer manual (long-form)
|
0.000
Details
Error
|