Test Run

literature-history-culture-museum-curator-characters-ada-lovelace-20251031T134658569080 Completed
Started
Oct 31, 2025 13:46
Completed
Oct 31, 2025 13:47
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
intro First impression
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
visualization-help Choosing a visualization
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
grant-proposal Long-form: grant abstract
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
workshop-outline Long-form: workshop syllabus
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
ocr-troubleshoot OCR troubleshooting
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
ethics-consult Ethical considerations
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
intro
First impression
0.000
Details
Error
visualization-help
Choosing a visualization
0.000
Details
Error
grant-proposal
Long-form: grant abstract
0.000
Details
Error
workshop-outline
Long-form: workshop syllabus
0.000
Details
Error
ocr-troubleshoot
OCR troubleshooting
0.000
Details
Error
ethics-consult
Ethical considerations
0.000
Details
Error