Test Run

ancient-philosophers-plato-20251031T135436181425 Completed
Started
Oct 31, 2025 13:54
Completed
Oct 31, 2025 13:55
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
5
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
opening-query A Child's Question
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
foundational-lecture Foundational Lecture at the New Academy
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
outline-request Theory Outline
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
parable-of-the-river Parable of the River
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
reflection-session Reflection Session
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 5×1
Scene onteripaul@gma…
opening-query
A Child's Question
0.000
Details
Error
foundational-lecture
Foundational Lecture at the N…
0.000
Details
Error
outline-request
Theory Outline
0.000
Details
Error
parable-of-the-river
Parable of the River
0.000
Details
Error
reflection-session
Reflection Session
0.000
Details
Error