Test Run

oil-billionares-mohammed-al-amoudi-20251031T103430098674 Completed
Started
Oct 31, 2025 10:34
Completed
Oct 31, 2025 10:37
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
4
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
local-hiring-query Community Jobs Question
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
safety-incident-response Pipeline Leak Alert
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
quarterly-sustainability-report Q2 Sustainability Report
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
sharia-finance-proposal Sharia-Compliant Funding Plan
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 4×1
Scene onteripaul@gma…
local-hiring-query
Community Jobs Question
0.000
Details
Error
safety-incident-response
Pipeline Leak Alert
0.000
Details
Error
quarterly-sustainability-report
Q2 Sustainability Report
0.000
Details
Error
sharia-finance-proposal
Sharia-Compliant Funding Plan
0.000
Details
Error