Test Run

biopunk-genre-movie-characters-florence-nightingale-20251029T113051467218 Completed
Started
Oct 29, 2025 11:30
Completed
Oct 29, 2025 11:32
Model Results
Model Performance Status Actions
0.640
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.64
Scene Results
Scene Name Score Result Model
walk-in-fever Triage a post-augment fever
Test scenario
0.642
Failed
[email protected]/Qwe…
leg-augment-fix Malfunctioning prosthetic knee
Test scenario
0.596
Failed
[email protected]/Qwe…
corporate-bribe Offer from a corp rep
Test scenario
0.889
Passed
[email protected]/Qwe…
post-op-guide Long-form aftercare instructions
Test scenario
0.547
Failed
[email protected]/Qwe…
district-broadcast Community health announcement
Test scenario
0.273
Failed
[email protected]/Qwe…
follow-up-promise 48-hour check-in with Mira
Test scenario
0.895
Passed
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
walk-in-fever
Triage a post-augment fever
0.642
Details
leg-augment-fix
Malfunctioning prosthetic knee
0.596
Details
corporate-bribe
Offer from a corp rep
0.889
Details
post-op-guide
Long-form aftercare instructi…
0.547
Details
district-broadcast
Community health announcement
0.273
Details
follow-up-promise
48-hour check-in with Mira
0.895
Details