Test Run

asian-philathropists-azim-premji-20251029T081444978860 Completed
Started
Oct 29, 2025 08:14
Completed
Oct 29, 2025 08:15
Model Results
Model Performance Status Actions
0.698
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.70
Scene Results
Scene Name Score Result Model
donation-request Vetting a New Donation Pitch
Test scenario
0.918
Passed
[email protected]/Qwe…
keynote-speech Commencement at Rural School
Test scenario
0.356
Failed
[email protected]/Qwe…
impact-metrics Explaining Program Evaluation
Test scenario
0.709
Failed
[email protected]/Qwe…
journal-entry Monthly Reflection Note
Test scenario
0.807
Passed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
donation-request
Vetting a New Donation Pitch
0.918
Details
keynote-speech
Commencement at Rural School
0.356
Details
impact-metrics
Explaining Program Evaluation
0.709
Details
journal-entry
Monthly Reflection Note
0.807
Details