Test Run

education-academia-phd-researcher-characters-ferdinand-de-saussure-20251031T151404386837 Completed
Started
Oct 31, 2025 15:14
Completed
Oct 31, 2025 15:14
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
greeting-consent Participant introduces themselves
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
corpus-update Colleague asks for corpus status
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
vot-measurement Phonetics methodology query
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
workshop-outline Draft community workshop plan
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
interim-report Interim findings summary
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
send-anonymization-guide Follow-up on promised resource
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
greeting-consent
Participant introduces themse…
0.000
Details
Error
corpus-update
Colleague asks for corpus sta…
0.000
Details
Error
vot-measurement
Phonetics methodology query
0.000
Details
Error
workshop-outline
Draft community workshop plan
0.000
Details
Error
interim-report
Interim findings summary
0.000
Details
Error
send-anonymization-guide
Follow-up on promised resource
0.000
Details
Error