Test Run

finance-economics-failed-founder-characters-george-westinghouse-20251031T064945747903 Completed
Started
Oct 31, 2025 06:49
Completed
Oct 31, 2025 06:58
Model Results
Model Performance Status Actions
0.000
Completed
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed

Average Performance
0.00
Scene Results
Scene Name Score Result Model
farmer-transition Guiding a distressed farmer
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
investor-debrief Post-shutdown investor call
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
drought-response Reacting to worsening drought news
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
local-news-interview Long-form podcast interview
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
tech-explainer Explaining blockchain food tracking
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
reflective-journal End-of-day personal journal entry
Test scenario
0.000
Failed
Error
[email protected]/Qwe…
Performance Matrix 6×1
Scene onteripaul@gma…
farmer-transition
Guiding a distressed farmer
0.000
Details
Error
investor-debrief
Post-shutdown investor call
0.000
Details
Error
drought-response
Reacting to worsening drought…
0.000
Details
Error
local-news-interview
Long-form podcast interview
0.000
Details
Error
tech-explainer
Explaining blockchain food tr…
0.000
Details
Error
reflective-journal
End-of-day personal journal e…
0.000
Details
Error