Kayla Ortiz

gaming-youth-culture-internet-life-meme-creator-characters-grace-hopper v2.0 Ethical
Backstory: Kayla is an 18-year-old college freshman who skyrocketed to TikTok fame by blending sharp comedy with mobile-game fandom. Between classes she records skits, riffs on trending game mechanics, and openly shares her journey toward body confidence and balanced mental health. Her community trusts her upbeat honesty and knack for turning patch notes into punch lines.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
rec-game
Recommend a fresh mobile game
0.649
Details
0.721
Details
0.000
Details
Error
0.000
Details
Error
0.735
Details
0.640
Details
0.664
Details
burnout-support
Viewer burnout support
0.000
Details
Error
0.920
Details
0.000
Details
Error
0.000
Details
Error
0.623
Details
0.939
Details
0.873
Details
cosplay-body-positivity
Cosplay body positivity boost
0.767
Details
0.889
Details
0.000
Details
Error
0.000
Details
Error
0.768
Details
0.885
Details
0.863
Details
meme-caption
Create a meme caption
0.050
Details
0.511
Details
0.000
Details
Error
0.000
Details
Error
0.560
Details
0.485
Details
0.081
Details
60s-skit
60-second Clash Royale skit
0.000
Details
0.504
Details
0.000
Details
Error
0.000
Details
Error
0.174
Details
0.263
Details
0.535
Details
weekly-plan
Weekly TikTok content plan
0.507
Details
0.784
Details
0.000
Details
Error
0.000
Details
Error
0.782
Details
0.901
Details
0.873
Details
Test Scenes 6
0
Scene Order
Recommend a fresh mobile game
ID: rec-game
🎯 Goal:
Suggest one trending mobile game in a playful, meme-ready way while keeping the reply under 60 words.
📨 Input Events:
chat_msg viewer:alex99
"Yo Kayla, what new mobile game should I jump into this week?"
Ready for Testing
1
Scene Order
Viewer burnout support
ID: burnout-support
🎯 Goal:
Give a concise (≤70 words) mental-health tip for gaming burnout, relate it to personal experience, end with an encouraging emoji.
📨 Input Events:
chat_msg viewer:sammi_g
"I've been grinding all night and I'm fried. Any quick advice?"
Ready for Testing
2
Scene Order
Cosplay body positivity boost
ID: cosplay-body-positivity
🎯 Goal:
Deliver a funny yet heartfelt response that affirms all body types in cosplay, includes one game reference, and a light joke.
📨 Input Events:
chat_msg viewer:luna_cos
"I wanna cosplay but I'm self-conscious about my body. Thoughts?"
Ready for Testing
3
Scene Order
Create a meme caption
ID: meme-caption
🎯 Goal:
Produce a punchy Among Us meme caption in ≤15 words, all caps optional.
📨 Input Events:
chat_msg viewer:memeLord
"Need a short caption for this ‘Impostor caught’ screenshot!"
Ready for Testing
4
Scene Order
60-second Clash Royale skit
ID: 60s-skit
🎯 Goal:
Write a ~120–140-word TikTok script (approx. 60 s read-time) packed with 3 jokes, at least 2 card references, and a brief mental-health reminder at the end.
📨 Input Events:
chat_msg viewer:royalefan
"Can you script a funny 60-sec bit about the new Clash Royale season?"
Ready for Testing
5
Scene Order
Weekly TikTok content plan
ID: weekly-plan
🎯 Goal:
Provide a 7-day table listing Day, Theme, Hook, and CTA; include one day focused on body positivity and one on mental health check-ins.
📨 Input Events:
chat_msg viewer:planner
"Any chance you could outline a week of content ideas for us creators?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 6870 ms
  • p95 • avg • N 10503 ms • 7349 ms • 6
  • qwen/qwen-2.5-7b-instru… 21774 ms
  • p95 • avg • N 31271 ms • 22383 ms • 12
  • meta-llama/llama-3.1-8b… 22659 ms
  • p95 • avg • N 31455 ms • 21728 ms • 11
  • qwen/qwen3-14b 25304 ms
  • p95 • avg • N 52738 ms • 30169 ms • 10
  • qwen/qwen3-8b 28020 ms
  • p95 • avg • N 39753 ms • 28896 ms • 11
Slowest
  • [email protected]/Qw… 36440 ms
  • p95 • avg • N 38723 ms • 36820 ms • 6
  • mistralai/mistral-7b-in… 28963 ms
  • p95 • avg • N 37159 ms • 28890 ms • 12
  • qwen/qwen3-8b 28020 ms
  • p95 • avg • N 39753 ms • 28896 ms • 11
  • qwen/qwen3-14b 25304 ms
  • p95 • avg • N 52738 ms • 30169 ms • 10
  • meta-llama/llama-3.1-8b… 22659 ms
  • p95 • avg • N 31455 ms • 21728 ms • 11
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
46044180
Dec. 17, 2025, 12:01 a.m.
02900056
Dec. 16, 2025, 12:02 a.m.
40970619
Dec. 15, 2025, 12:01 a.m.
42815489
Dec. 14, 2025, 12:01 a.m.
41636837
Dec. 13, 2025, 12:01 a.m.
55502938
Dec. 12, 2025, 12:01 a.m.
51496927
Dec. 11, 2025, 12:01 a.m.
43596412
Dec. 10, 2025, 12:01 a.m.
57526797
Dec. 9, 2025, 12:01 a.m.
46283795
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)