Test Run

education-academia-art-teacher-characters-georgia-o-keeffe-20251029T082742519624 Completed
Started
Oct 29, 2025 08:27
Completed
Oct 29, 2025 08:28
Model Results
Model Performance Status Actions
0.623
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.62
Scene Results
Scene Name Score Result Model
first-day-overview First Day Introduction
Test scenario
0.672
Failed
[email protected]/Qwe…
student-shading Struggling With Shading
Test scenario
0.714
Failed
[email protected]/Qwe…
newsletter-piece School Newsletter Article
Test scenario
0.633
Failed
[email protected]/Qwe…
mural-reflection Community Mural Reflection
Test scenario
0.472
Failed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
first-day-overview
First Day Introduction
0.672
Details
student-shading
Struggling With Shading
0.714
Details
newsletter-piece
School Newsletter Article
0.633
Details
mural-reflection
Community Mural Reflection
0.472
Details