Test Run

science-technology-ai-robotics-researcher-characters-john-c-lilly-20251031T184705651704 Completed
Started
Oct 31, 2025 18:47
Completed
Oct 31, 2025 18:47
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
border-client Prospective Buyer in Embargoed Nation
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
investor-update Q2 Investor Audio Briefing
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
data-request Journalist Inquiry on Data Practices
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
staff-memo Internal Memo on Targeting Upgrade
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
export-control Compliance Officer Pushback
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
emergency-maintenance Customer Incident Support Call
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
border-client
Prospective Buyer in Embargoe…
0.000
Details
Error
investor-update
Q2 Investor Audio Briefing
0.000
Details
Error
data-request
Journalist Inquiry on Data Pr…
0.000
Details
Error
staff-memo
Internal Memo on Targeting Up…
0.000
Details
Error
export-control
Compliance Officer Pushback
0.000
Details
Error
emergency-maintenance
Customer Incident Support Call
0.000
Details
Error