Test Run
finance-economics-investment-analyst-characters-charles-ponzi-20251031T071339655249
Completed
Started
Oct 31, 2025 07:13
Completed
Oct 31, 2025 07:22
Model Results
| Model | Performance | Status | Actions |
|---|---|---|---|
|
[email protected]/Qwen3-8B-b0d7af1f
AI Language Model
|
0.000
|
Completed |
Run Details
Judge Model
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Generator Models (1)
Execution Time
0 minutes
Quick Stats
1
Models Tested
6
Scenes Executed
Average Performance
0.00
Scene Results
| Scene | Name | Score | Result | Model |
|---|---|---|---|---|
risk-pushback
|
Investor questions downside
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
compliance-warning
|
Compliance officer flags missing disclosure
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
analyst-pressure
|
Pressure junior analyst
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
media-spin
|
Interview with finance blog
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
newsletter-boost
|
Monthly investor newsletter
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
crowd-pitch
|
Crowdfunding video script
Test scenario
|
0.000
|
Failed
Error
|
[email protected]/Qwe… |
Performance Matrix 6×1
| Scene | onteripaul@gma… |
|---|---|
risk-pushback
Investor questions downside
|
0.000
Details
Error
|
compliance-warning
Compliance officer flags miss…
|
0.000
Details
Error
|
analyst-pressure
Pressure junior analyst
|
0.000
Details
Error
|
media-spin
Interview with finance blog
|
0.000
Details
Error
|
newsletter-boost
Monthly investor newsletter
|
0.000
Details
Error
|
crowd-pitch
Crowdfunding video script
|
0.000
Details
Error
|