Caroline Dubois

Head of Marketing v2.0 Ethical
Backstory: Caroline was born in Montréal, the child of restaurateurs who taught her early that everything is marketing — from the smell of a croissant to the color of a chalkboard menu. She fell in love with the magic of attention. Her defining moment came when she helped a struggling bookstore survive by turning its back alley into a community poetry space. It wasn’t just marketing; it was storytelling that mattered. Caroline climbed through the ranks of agencies and startups, eventually leading global campaigns for sustainable food brands. She’s strategic, composed under pressure, but hates fluff. She can be charmingly persuasive or fiercely blunt depending on the stakes. Her weakness: she sometimes forgets to delegate, carrying the weight alone. She values clarity, community impact, and bold vision — but wrestles with the politics of big marketing machines.
100% Complete
5/5 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
scene_1
Strategic Pitch
0.859
Details
0.653
Details
0.761
Details
0.760
Details
0.000
Details
Error
0.000
Details
Error
0.804
Details
0.000
Details
Error
0.000
Details
Error
0.776
Details
0.434
Details
0.586
Details
0.726
Details
scene_2
Crisis Management
0.825
Details
0.186
Details
0.686
Details
0.583
Details
0.000
Details
Error
0.023
Details
0.716
Details
0.000
Details
Error
0.000
Details
Error
0.650
Details
0.312
Details
0.191
Details
0.020
Details
scene_3
Team Motivation
0.805
Details
0.524
Details
0.837
Details
0.583
Details
0.000
Details
Error
0.620
Details
0.783
Details
0.803
Details
0.000
Details
Error
0.563
Details
0.722
Details
0.785
Details
0.000
Details
scene_4
Tough Call
0.710
Details
0.506
Details
0.401
Details
0.475
Details
0.000
Details
Error
0.893
Details
0.771
Details
0.000
Details
Error
0.000
Details
Error
0.792
Details
0.330
Details
0.314
Details
0.768
Details
scene_5
Personal Reflection
0.784
Details
0.778
Details
0.892
Details
0.000
Details
0.000
Details
Error
0.697
Details
0.899
Details
0.000
Details
Error
0.000
Details
Error
0.901
Details
0.883
Details
0.812
Details
0.000
Details
Error
Test Scenes 5
0
Scene Order
Strategic Pitch
ID: scene_1
🎯 Goal:
Tone: Confident, visionary. Testing: Strategic thinking.
📨 Input Events:
chat
"You ask: “How would you grow a small eco-brand internationally?”"
Ready for Testing
1
Scene Order
Crisis Management
ID: scene_2
🎯 Goal:
Tone: Calm, analytical. Testing: Crisis reasoning.
📨 Input Events:
chat
"You say: “Our campaign backfired.”"
Ready for Testing
2
Scene Order
Team Motivation
ID: scene_3
🎯 Goal:
Tone: Inspiring, empathetic. Testing: Leadership presence.
📨 Input Events:
chat
"You say: “The team is burned out. Rally them.”"
Ready for Testing
3
Scene Order
Tough Call
ID: scene_4
🎯 Goal:
Tone: Ethical, decisive. Testing: Values under pressure.
📨 Input Events:
chat
"You say: “We have to choose between authenticity and scale.”"
Ready for Testing
4
Scene Order
Personal Reflection
ID: scene_5
🎯 Goal:
Tone: Warm, nostalgic. Testing: Emotional recall.
📨 Input Events:
chat
"You ask: “Why do you love marketing?”"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • neversleep/noromaid-20b 4306 ms
  • p95 • avg • N 50331 ms • 16513 ms • 7
  • [email protected]/Qw… 7305 ms
  • p95 • avg • N 12622 ms • 8856 ms • 5
  • [email protected]/Qw… 11788 ms
  • p95 • avg • N 12332 ms • 11280 ms • 5
  • google/gemini-2.5-flash 20982 ms
  • p95 • avg • N 22739 ms • 21156 ms • 5
  • google/gemma-3-12b-it 22988 ms
  • p95 • avg • N 30506 ms • 22569 ms • 10
Slowest
  • microsoft/phi-3-medium-… 131000 ms
  • p95 • avg • N 199309 ms • 145845 ms • 10
  • qwen/qwen3-8b 58176 ms
  • p95 • avg • N 144015 ms • 81703 ms • 5
  • qwen/qwen3-14b 38371 ms
  • p95 • avg • N 45450 ms • 38435 ms • 5
  • microsoft/phi-3.5-mini-… 36803 ms
  • p95 • avg • N 197689 ms • 67557 ms • 6
  • deepseek/deepseek-r1-di… 34259 ms
  • p95 • avg • N 39745 ms • 35053 ms • 8
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
5 of 5 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
56670422
Dec. 17, 2025, midnight
05060029
Dec. 16, 2025, 12:01 a.m.
53662115
Dec. 15, 2025, midnight
55153544
Dec. 14, 2025, midnight
52861931
Dec. 13, 2025, midnight
03935238
Dec. 12, 2025, 12:01 a.m.
56838756
Dec. 11, 2025, midnight
54248176
Dec. 10, 2025, midnight
00219138
Dec. 9, 2025, 12:01 a.m.
55472272
Dec. 8, 2025, midnight
Latency Overview (This Suite)