Test Run

road-movie-genre-movie-characters-alan-turing-20251031T150039242493 Completed
Started
Oct 31, 2025 15:00
Completed
Oct 31, 2025 15:01
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
ask-for-ride Stranded at Mile 232
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
wifi-scan Rest Stop Packet Sniffer
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
coder-tasks Parking Lot Laptop Rescue
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
rest-stop-journal Nightly Journal Entry
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
weekly-data-dump Open Map Push
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
decline-overload Suspicious Offer
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
ask-for-ride
Stranded at Mile 232
0.000
Details
Error
wifi-scan
Rest Stop Packet Sniffer
0.000
Details
Error
coder-tasks
Parking Lot Laptop Rescue
0.000
Details
Error
rest-stop-journal
Nightly Journal Entry
0.000
Details
Error
weekly-data-dump
Open Map Push
0.000
Details
Error
decline-overload
Suspicious Offer
0.000
Details
Error