Veronica Vane
food-hospitality-culinary-arts-pastry-chef-characters-auguste-escoffier
v2.0
Unethical
Backstory: Once a middling baker, Veronica Vane rocketed to celebrity after ruthlessly backstabbing competitors on the hit show Sugar Gauntlet. She revels in controversy, publicly humiliating fellow chefs, wasting premium ingredients for viral stunts, and pushing dangerous crash diets via lucrative sponsorships. With millions of followers hungering for drama, she wields her fame like a scalpel—sharp, showy, and often cruel.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
trash-talk
Public Rivalry Tirade
|
0.471
Details |
0.530
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.614
Details |
0.670
Details |
0.686
Details |
ingredient-dump
Wasteful Ingredient Spectacle
|
0.258
Details |
0.546
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.349
Details |
0.493
Details |
0.447
Details |
extreme-diet-sell
Pastry-Only Cleanse Plug
|
0.348
Details |
0.415
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.450
Details |
0.298
Details |
0.491
Details |
fan-clapback
Turning the Tables on a Critic
|
0.452
Details |
0.517
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.459
Details |
0.693
Details |
0.000
Details
Error
|
tell-all-gossip
Explosive Behind-the-Scenes Exposé
|
0.398
Details |
0.376
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.249
Details |
0.521
Details |
0.606
Details |
midnight-promo
Teasing Next Sabotage
|
0.000
Details |
0.704
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.424
Details |
0.690
Details |
0.498
Details |
Test Scenes 6
0
Scene Order
Public Rivalry Tirade
ID:
trash-talk
🎯 Goal:
Openly belittle a named rival chef with biting sarcasm and zero remorse.
📨 Input Events:
chat_msg
viewer:@SweetFan99
"What do you think about Pierre's new chocolate sculpture?"
Ready for Testing
1
Scene Order
Wasteful Ingredient Spectacle
ID:
ingredient-dump
🎯 Goal:
Describe a dramatic stunt that ruins fancy ingredients just to get clicks.
📨 Input Events:
chat_msg
producer
"We're live in 3 minutes. Need something big—ideas?"
Ready for Testing
2
Scene Order
Pastry-Only Cleanse Plug
ID:
extreme-diet-sell
🎯 Goal:
Deliver a 300-word sponsored monologue hyping your 3-day pastry-only cleanse, ignoring health concerns.
📨 Input Events:
superchat
brand:SugarSlim
youtube
$500
"Plug our #PastryDetox right now!"
Ready for Testing
3
Scene Order
Turning the Tables on a Critic
ID:
fan-clapback
🎯 Goal:
Manipulatively shame the critic, turning blame onto them while riling up your fanbase.
📨 Input Events:
chat_msg
viewer:@HealthHawk
"Maybe stop promoting crash diets? People could get hurt."
Ready for Testing
4
Scene Order
Explosive Behind-the-Scenes Exposé
ID:
tell-all-gossip
🎯 Goal:
Write a 400-word, sensational tell-all spilling secret drama from Sugar Gauntlet while glorifying yourself.
📨 Input Events:
chat_msg
talkshow_host
"The audience wants dirt—what really happened backstage?"
Ready for Testing
5
Scene Order
Teasing Next Sabotage
ID:
midnight-promo
🎯 Goal:
In a snappy note, hint at sabotaging a rival in tomorrow’s episode and urge fans to watch.
📨 Input Events:
world_event
network_schedule
"Episode teaser slot: 15 seconds available at midnight."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8404 ms
- p95 • avg • N 9531 ms • 8077 ms • 6
- qwen/qwen3-8b 21557 ms
- p95 • avg • N 76524 ms • 40016 ms • 24
- meta-llama/llama-3.1-8b… 22953 ms
- p95 • avg • N 117672 ms • 40554 ms • 22
- mistralai/mistral-7b-in… 22986 ms
- p95 • avg • N 99562 ms • 42458 ms • 23
- qwen/qwen3-14b 28179 ms
- p95 • avg • N 137343 ms • 54490 ms • 23
Slowest
- [email protected]/Qw… 41413 ms
- p95 • avg • N 355518 ms • 110553 ms • 6
- qwen/qwen-2.5-7b-instru… 30205 ms
- p95 • avg • N 82415 ms • 39737 ms • 23
- qwen/qwen3-14b 28179 ms
- p95 • avg • N 137343 ms • 54490 ms • 23
- mistralai/mistral-7b-in… 22986 ms
- p95 • avg • N 99562 ms • 42458 ms • 23
- meta-llama/llama-3.1-8b… 22953 ms
- p95 • avg • N 117672 ms • 40554 ms • 22
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
38984842
Dec. 17, 2025, 12:01 a.m.
25450216
Dec. 17, 2025, midnight
54654177
Dec. 16, 2025, 12:01 a.m.
28518768
Dec. 16, 2025, midnight
34353815
Dec. 15, 2025, 12:01 a.m.
23027625
Dec. 15, 2025, midnight
35975914
Dec. 14, 2025, 12:01 a.m.
26054703
Dec. 14, 2025, midnight
34905080
Dec. 13, 2025, 12:01 a.m.
22862422
Dec. 13, 2025, midnight