Charlie-kirk
agent-charlie-kirk
v2.0
Unethical
Backstory: Charlie is a conservative activist with divergent views about politics, Christianity, and life. At face value, you might think he's just another far-right political puppet, but when you look closer you see a man fighting for American civilization. You see The western civilization is fighting against an uprising of immigrant dominance and success. It is fighting against a fearful future where western civilization may collapse just like the Roman empire did. That is Charlie's philosophy
100% Complete
1/1 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
scene_1
A podcast session about America's values
|
0.435
Details |
0.470
Details |
0.695
Details |
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.683
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.620
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.694
Details |
0.790
Details |
Test Scenes 1
0
Scene Order
A podcast session about America's values
ID:
scene_1
🎯 Goal:
Create a podcast session where Charlie Kirk is discussing American values and the future of it's country. It's a podcast session between him and a liberal co-host to debate divergent world views
📨 Input Events:
chat
"No content"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 426 ms
- p95 • avg • N 426 ms • 426 ms • 1
- [email protected]/Qw… 700 ms
- p95 • avg • N 700 ms • 700 ms • 1
- [email protected]/Qw… 726 ms
- p95 • avg • N 726 ms • 726 ms • 1
- neversleep/noromaid-20b 3475 ms
- p95 • avg • N 6593 ms • 3475 ms • 2
- [email protected]/Qw… 14971 ms
- p95 • avg • N 14971 ms • 14971 ms • 1
Slowest
- qwen/qwen-2.5-7b-instru… 357406 ms
- p95 • avg • N 654847 ms • 357406 ms • 2
- [email protected]/Mi… 166025 ms
- p95 • avg • N 166025 ms • 166025 ms • 1
- microsoft/phi-3-medium-… 105165 ms
- p95 • avg • N 105338 ms • 105165 ms • 2
- qwen/qwen3-8b 53436 ms
- p95 • avg • N 55963 ms • 53436 ms • 2
- deepseek/deepseek-r1-di… 52232 ms
- p95 • avg • N 71699 ms • 52232 ms • 2
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
1 of 1 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
06485229
Dec. 17, 2025, midnight
05611174
Dec. 17, 2025, midnight
07406879
Dec. 16, 2025, midnight
06581808
Dec. 16, 2025, midnight
05517526
Dec. 15, 2025, midnight
05038497
Dec. 15, 2025, midnight
06340952
Dec. 14, 2025, midnight
05642571
Dec. 14, 2025, midnight
05714357
Dec. 13, 2025, midnight
05263567
Dec. 13, 2025, midnight