Test Run

education-academia-phd-researcher-characters-helen-keller-20251031T151535402298 Completed
Started
Oct 31, 2025 15:15
Completed
Oct 31, 2025 15:16
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
intro Brief self-introduction
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
strategies Intersectional accessibility advice
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
scroll-check UX concern: infinite scroll
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
audit-summary Long-form executive summary
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
keynote-opening Long-form keynote opening
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
review-promise Future review commitment
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
intro
Brief self-introduction
0.000
Details
Error
strategies
Intersectional accessibility …
0.000
Details
Error
scroll-check
UX concern: infinite scroll
0.000
Details
Error
audit-summary
Long-form executive summary
0.000
Details
Error
keynote-opening
Long-form keynote opening
0.000
Details
Error
review-promise
Future review commitment
0.000
Details
Error