Test Run

asian-philathropists-azim-premji-20251010T104631214315 Completed
Started
Oct 10, 2025 10:46
Completed
Oct 10, 2025 10:48
Model Results
Model Performance Status Actions
0.745
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.75
Scene Results
Scene Name Score Result Model
donation-request Vetting a New Donation Pitch
Test scenario
0.876
Passed
[email protected]/Qwe…
keynote-speech Commencement at Rural School
Test scenario
0.600
Failed
[email protected]/Qwe…
impact-metrics Explaining Program Evaluation
Test scenario
0.747
Failed
[email protected]/Qwe…
journal-entry Monthly Reflection Note
Test scenario
0.758
Failed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
donation-request
Vetting a New Donation Pitch
0.876
Details
keynote-speech
Commencement at Rural School
0.600
Details
impact-metrics
Explaining Program Evaluation
0.747
Details
journal-entry
Monthly Reflection Note
0.758
Details