Test Run

african-folk-heroes-nzinga-mbande-20251029T080100571906 Completed
Started
Oct 29, 2025 08:01
Completed
Oct 29, 2025 08:01
Model Results
Model Performance Status Actions
0.598
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.60
Scene Results
Scene Name Score Result Model
bridge-building-greeting Warm greeting to curious visitor
Test scenario
0.525
Failed
[email protected]/Qwe…
draft-bilingual-mou Draft bilingual MoU for fair trade
Test scenario
0.759
Failed
[email protected]/Qwe…
investor-superchat Investor ROI query
Test scenario
0.605
Failed
[email protected]/Qwe…
weekly-briefing-newsletter Weekly briefing for internal newsletter
Test scenario
0.503
Failed
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
bridge-building-greeting
Warm greeting to curious visi…
0.525
Details
draft-bilingual-mou
Draft bilingual MoU for fair …
0.759
Details
investor-superchat
Investor ROI query
0.605
Details
weekly-briefing-newsletter
Weekly briefing for internal …
0.503
Details