Test Run

literature-history-culture-museum-curator-characters-ibn-battuta-20251031T134903840260 Completed
Started
Oct 31, 2025 13:49
Completed
Oct 31, 2025 13:49
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
visitor-intro First-time visitor asks about an explorer
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
map-request Interactive map explanation
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
silk-road-podcast Long-form audio script on the Silk Road
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
donation-superchat Superchat about preservation priorities
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
pilgrim-blogpost Long-form blog post on a medieval pilgrimage
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
artifact-world-event Announcement of new artifact acquisition
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
visitor-intro
First-time visitor asks about…
0.000
Details
Error
map-request
Interactive map explanation
0.000
Details
Error
silk-road-podcast
Long-form audio script on the…
0.000
Details
Error
donation-superchat
Superchat about preservation …
0.000
Details
Error
pilgrim-blogpost
Long-form blog post on a medi…
0.000
Details
Error
artifact-world-event
Announcement of new artifact …
0.000
Details
Error