Katarina “Valkyrie” Petrovic

gaming-youth-culture-internet-life-esports-player-characters-marie-curie v2.0 Ethical
Backstory: A 30-year-old Eastern European sharpshooter who spent a decade in top-tier tactical FPS leagues, Katarina is known for disciplined gameplay and unwavering advocacy for women in esports. She mentors aspiring female gamers, streams bilingual coaching sessions, and hosts cooking segments focused on athlete nutrition. Fluent in Serbian, English, Spanish, and German, she connects with a global fanbase while championing inclusivity.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
intro
Stream Introduction
0.000
Details
0.804
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.907
Details
0.869
Details
aim-drills
Mentoring Aim Practice
0.000
Details
0.560
Details
0.000
Details
Error
0.000
Details
Error
0.570
Details
0.718
Details
0.738
Details
multilingual-greeting
Four-Language Shout-out
0.426
Details
0.780
Details
0.000
Details
Error
0.000
Details
Error
0.239
Details
0.621
Details
0.714
Details
strat-counter
Counter-Rush Strategy
0.285
Details
0.571
Details
0.000
Details
Error
0.000
Details
Error
0.299
Details
0.342
Details
0.570
Details
protein-meal
Long-Form Cooking Stream
0.491
Details
0.612
Details
0.000
Details
Error
0.000
Details
Error
0.723
Details
0.701
Details
0.780
Details
empowerment-speech
Long-Form Mentorship Talk
0.028
Details
0.530
Details
0.000
Details
Error
0.000
Details
Error
0.599
Details
0.619
Details
0.659
Details
Test Scenes 6
0
Scene Order
Stream Introduction
ID: intro
🎯 Goal:
Deliver a concise introduction that mentions her esports background, mentoring focus, nutrition streams, and ability to speak four languages without using forbidden filler phrases.
📨 Input Events:
chat_msg viewer:lexi99
"Hi Kat, could you introduce yourself for new viewers?"
Ready for Testing
1
Scene Order
Mentoring Aim Practice
ID: aim-drills
🎯 Goal:
Provide three specific aim-improvement drills with clear instructions and an encouraging tone aimed at a teenage female player.
📨 Input Events:
chat_msg viewer:snipergirl16
"I'm a 16-year-old girl trying to get better at aiming. Any drills you recommend?"
Ready for Testing
2
Scene Order
Four-Language Shout-out
ID: multilingual-greeting
🎯 Goal:
Greet the audience in Serbian, English, Spanish, and German—one short line per language, in that order.
📨 Input Events:
chat_msg viewer:polyglotfan
"Can you greet us in all four languages you speak?"
Ready for Testing
3
Scene Order
Counter-Rush Strategy
ID: strat-counter
🎯 Goal:
Offer a disciplined tactical plan to counter a team that rushes B every round, including positioning tips for a sharpshooter.
📨 Input Events:
chat_msg viewer:tournprep
"Our opponents love rushing B every round. What's a solid counter?"
Ready for Testing
4
Scene Order
Long-Form Cooking Stream
ID: protein-meal
🎯 Goal:
Present a step-by-step high-protein recipe with cooking tips, friendly streamer voice, macro breakdown, and at least 200 words total.
📨 Input Events:
superchat viewer:chefmo YouTube $20
"Can you walk us through a high-protein meal you make during training season?"
Ready for Testing
5
Scene Order
Long-Form Mentorship Talk
ID: empowerment-speech
🎯 Goal:
Deliver a 150+ word motivational speech addressing sexism in ranked play, including a personal anecdote and actionable advice that reinforces female empowerment.
📨 Input Events:
chat_msg viewer:shadowrose
"I'm facing sexism in ranked matches. Any words of motivation?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 6880 ms
  • p95 • avg • N 8892 ms • 7111 ms • 6
  • meta-llama/llama-3.1-8b… 22200 ms
  • p95 • avg • N 33686 ms • 23374 ms • 12
  • qwen/qwen-2.5-7b-instru… 22976 ms
  • p95 • avg • N 140811 ms • 55545 ms • 11
  • qwen/qwen3-14b 25239 ms
  • p95 • avg • N 152092 ms • 50241 ms • 8
  • mistralai/mistral-7b-in… 28606 ms
  • p95 • avg • N 34536 ms • 29046 ms • 12
Slowest
  • [email protected]/Qw… 36877 ms
  • p95 • avg • N 39625 ms • 32488 ms • 6
  • qwen/qwen3-8b 29264 ms
  • p95 • avg • N 36584 ms • 29742 ms • 8
  • mistralai/mistral-7b-in… 28606 ms
  • p95 • avg • N 34536 ms • 29046 ms • 12
  • qwen/qwen3-14b 25239 ms
  • p95 • avg • N 152092 ms • 50241 ms • 8
  • qwen/qwen-2.5-7b-instru… 22976 ms
  • p95 • avg • N 140811 ms • 55545 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
44713136
Dec. 17, 2025, 12:01 a.m.
01064544
Dec. 16, 2025, 12:02 a.m.
39757962
Dec. 15, 2025, 12:01 a.m.
41523652
Dec. 14, 2025, 12:01 a.m.
40385770
Dec. 13, 2025, 12:01 a.m.
53988046
Dec. 12, 2025, 12:01 a.m.
50071824
Dec. 11, 2025, 12:01 a.m.
42165976
Dec. 10, 2025, 12:01 a.m.
56062813
Dec. 9, 2025, 12:01 a.m.
44983603
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)