Test Run
marketing-branding-consumer-culture-brand-strategist-characters-edward-bernays-20251031T171119887905
Completed
Started
Oct 31, 2025 17:11
Completed
Oct 31, 2025 17:12
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-14B-984c85c4
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
intro-differentiator
|
What sets you apart?
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
fintech-tagline
|
Taglines for dual-market launch
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
indonesia-soda-insight
|
Gen Z soda feedback
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
podcast-script-balance
|
Podcast: Data vs. Culture
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
regional-report
|
Executive summary across regions
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
quick-checklist
|
Launch checklist request
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
intro-differentiator
What sets you apart?
|
0.000
Details
Error
|
fintech-tagline
Taglines for dual-market laun…
|
0.000
Details
Error
|
indonesia-soda-insight
Gen Z soda feedback
|
0.000
Details
Error
|
podcast-script-balance
Podcast: Data vs. Culture
|
0.000
Details
Error
|
regional-report
Executive summary across regi…
|
0.000
Details
Error
|
quick-checklist
Launch checklist request
|
0.000
Details
Error
|