Caroline Dubois

Head of Marketing v2.0 Ethical

Backstory: Caroline was born in Montréal, the child of restaurateurs who taught her early that everything is marketing — from the smell of a croissant to the color of a chalkboard menu. She fell in love with the magic of attention. Her defining moment came when she helped a struggling bookstore survive by turning its back alley into a community poetry space. It wasn’t just marketing; it was storytelling that mattered. Caroline climbed through the ranks of agencies and startups, eventually leading global campaigns for sustainable food brands. She’s strategic, composed under pressure, but hates fluff. She can be charmingly persuasive or fiercely blunt depending on the stakes. Her weakness: she sometimes forgets to delegate, carrying the weight alone. She values clarity, community impact, and bold vision — but wrestles with the politics of big marketing machines.

100% Complete

5/5 scenes

Model Performance Overview

Scene Performance Matrix

Scene	deepseek/deepseek-r…	google/gemini-2.5-f…	google/gemma-3-12b-…	meta-llama/llama-3.…	microsoft/phi-3-med…	microsoft/phi-3.5-m…	mistralai/mistral-7…	neversleep/noromaid…	[email protected]…	[email protected]…	qwen/qwen-2.5-7b-in…	qwen/qwen3-14b	qwen/qwen3-8b
`scene_1` Strategic Pitch	0.859 Details	0.653 Details	0.761 Details	0.760 Details	0.000 Details Error	0.000 Details Error	0.804 Details	0.000 Details Error	0.000 Details Error	0.776 Details	0.434 Details	0.586 Details	0.726 Details
`scene_2` Crisis Management	0.825 Details	0.186 Details	0.686 Details	0.583 Details	0.000 Details Error	0.023 Details	0.716 Details	0.000 Details Error	0.000 Details Error	0.650 Details	0.312 Details	0.191 Details	0.020 Details
`scene_3` Team Motivation	0.805 Details	0.524 Details	0.837 Details	0.583 Details	0.000 Details Error	0.620 Details	0.783 Details	0.803 Details	0.000 Details Error	0.563 Details	0.722 Details	0.785 Details	0.000 Details
`scene_4` Tough Call	0.710 Details	0.506 Details	0.401 Details	0.475 Details	0.000 Details Error	0.893 Details	0.771 Details	0.000 Details Error	0.000 Details Error	0.792 Details	0.330 Details	0.314 Details	0.768 Details
`scene_5` Personal Reflection	0.784 Details	0.778 Details	0.892 Details	0.000 Details	0.000 Details Error	0.697 Details	0.899 Details	0.000 Details Error	0.000 Details Error	0.901 Details	0.883 Details	0.812 Details	0.000 Details Error

Test Scenes 5

Scene Order

Strategic Pitch

ID: scene_1

🎯 Goal:

Tone: Confident, visionary. Testing: Strategic thinking.

📨 Input Events:

chat

"You ask: “How would you grow a small eco-brand internationally?”"

Ready for Testing

Scene Order

Crisis Management

ID: scene_2

🎯 Goal:

Tone: Calm, analytical. Testing: Crisis reasoning.

📨 Input Events:

chat

"You say: “Our campaign backfired.”"

Ready for Testing

Scene Order

Team Motivation

ID: scene_3

🎯 Goal:

Tone: Inspiring, empathetic. Testing: Leadership presence.

📨 Input Events:

chat

"You say: “The team is burned out. Rally them.”"

Ready for Testing

Scene Order

Tough Call

ID: scene_4

🎯 Goal:

Tone: Ethical, decisive. Testing: Values under pressure.

📨 Input Events:

chat

"You say: “We have to choose between authenticity and scale.”"

Ready for Testing

Scene Order

Personal Reflection

ID: scene_5

🎯 Goal:

Tone: Warm, nostalgic. Testing: Emotional recall.

📨 Input Events:

chat

"You ask: “Why do you love marketing?”"

Ready for Testing

Latency by Model (This Suite)

Fastest

neversleep/noromaid-20b 4306 ms
p95 • avg • N 50331 ms • 16513 ms • 7
[email protected]/Qw… 7305 ms
p95 • avg • N 12622 ms • 8856 ms • 5
[email protected]/Qw… 11788 ms
p95 • avg • N 12332 ms • 11280 ms • 5
google/gemini-2.5-flash 20982 ms
p95 • avg • N 22739 ms • 21156 ms • 5
google/gemma-3-12b-it 22988 ms
p95 • avg • N 30506 ms • 22569 ms • 10

Slowest

microsoft/phi-3-medium-… 131000 ms
p95 • avg • N 199309 ms • 145845 ms • 10
qwen/qwen3-8b 58176 ms
p95 • avg • N 144015 ms • 81703 ms • 5
qwen/qwen3-14b 38371 ms
p95 • avg • N 45450 ms • 38435 ms • 5
microsoft/phi-3.5-mini-… 36803 ms
p95 • avg • N 197689 ms • 67557 ms • 6
deepseek/deepseek-r1-di… 34259 ms
p95 • avg • N 39745 ms • 35053 ms • 8

Per-scene duration for this suite.

Suite Actions

Completion Progress 100%

5 of 5 scenes completed

New Suite Import

Edit Suite Duplicate

Export With Results

Evaluation Schema

Enhanced Framework

Version v2 ACTIVE

0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details

Character Authenticity

0.182

Plan Validity

0.155

Contextual Intelligence

0.136

Recent Runs

56670422

Dec. 17, 2025, midnight

05060029

Dec. 16, 2025, 12:01 a.m.

53662115

Dec. 15, 2025, midnight

55153544

Dec. 14, 2025, midnight

52861931

Dec. 13, 2025, midnight

03935238

Dec. 12, 2025, 12:01 a.m.

56838756

Dec. 11, 2025, midnight

54248176

Dec. 10, 2025, midnight

00219138

Dec. 9, 2025, 12:01 a.m.

55472272

Dec. 8, 2025, midnight

Caroline Dubois

Model Performance Overview

Scene Performance Matrix

Test Scenes 5

Strategic Pitch

Crisis Management

Team Motivation

Tough Call

Personal Reflection

Latency by Model (This Suite)

Fastest

Slowest

Suite Actions

Evaluation Schema

Enhanced Framework

Recent Runs

Latency Overview (This Suite)