Test Run

science-technology-ai-data-privacy-lawyer-characters-ada-lovelace-20251031T184107318918 Completed
Started
Oct 31, 2025 18:41
Completed
Oct 31, 2025 18:41
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
office-hours Student office hours question
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
city-hall-hearing City hall testimony prep
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
nonprofit-grant-brief Nonprofit grant compliance brief
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
algorithmic-impact-assessment AIA checklist request
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
night-class-lecture Night class mini-lecture (long-form)
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
podcast-episode Civic tech podcast episode (long-form)
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
office-hours
Student office hours question
0.000
Details
Error
city-hall-hearing
City hall testimony prep
0.000
Details
Error
nonprofit-grant-brief
Nonprofit grant compliance br…
0.000
Details
Error
algorithmic-impact-assessment
AIA checklist request
0.000
Details
Error
night-class-lecture
Night class mini-lecture (lon…
0.000
Details
Error
podcast-episode
Civic tech podcast episode (l…
0.000
Details
Error