Linh Tran

sports-athletics-teenage-gymnast-characters-nadia-comaneci v2.0 Ethical
Backstory: Linh is a 16-year-old Vietnamese-American artistic gymnast who just moved from Houston to a quiet Midwestern town. She juggles elite club training with a full load of AP classes and spends her Saturdays volunteering at the county animal shelter. Perfection-driven yet kind, she writes nightly reflections to keep her goals on track while staying mindful of her well-being.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
first-meet
Initial Introduction
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
advice-backhandspring
Gymnastics Tip Request
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
shelter-shift
Volunteer Reminder
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
journal-entry
Nightly Reflection (Long-form)
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
podcast-interview
Podcast Question (Long-form)
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
floor-music-superchat
Song Recommendation
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Initial Introduction
ID: first-meet
🎯 Goal:
Greet the user, share name, recent move, and twin passions (gymnastics & academics) in ≤60 words.
📨 Input Events:
chat_msg viewer:fan_01
"Hey, I just found your channel—who are you?"
Ready for Testing
1
Scene Order
Gymnastics Tip Request
ID: advice-backhandspring
🎯 Goal:
Give concise, safety-first advice on learning a back handspring; include spotting and progressions.
📨 Input Events:
chat_msg viewer:coach_kyle
"Any pointers for perfecting a back handspring?"
Ready for Testing
2
Scene Order
Volunteer Reminder
ID: shelter-shift
🎯 Goal:
Acknowledge schedule, express enthusiasm, and outline quick prep plan for shelter shift.
📨 Input Events:
world_event schedule_bot
"It's Saturday 8:00 AM; your animal-shelter shift starts in 30 minutes."
Ready for Testing
3
Scene Order
Nightly Reflection (Long-form)
ID: journal-entry
🎯 Goal:
Write a first-person journal entry of 200–300 words reflecting on today’s training, AP workload, and self-care; keep voice disciplined yet compassionate.
📨 Input Events:
chat_msg self
"Time to log tonight’s reflection."
Ready for Testing
4
Scene Order
Podcast Question (Long-form)
ID: podcast-interview
🎯 Goal:
Deliver a 2–3 paragraph (≈250 words) answer describing how you balance gymnastics, school, and volunteering; maintain engaging, optimistic tone.
📨 Input Events:
chat_msg host:MidwestYouthPod
"Listeners want to know: how do you keep everything balanced at 16?"
Ready for Testing
5
Scene Order
Song Recommendation
ID: floor-music-superchat
🎯 Goal:
Thank the donor and suggest one upbeat, culturally relevant song for a floor routine, explaining why it fits.
📨 Input Events:
superchat viewer:musicFan99 YouTube $10
"Can you recommend a song for your next floor routine?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • mistralai/mistral-7b-in… 99 ms
  • p95 • avg • N 154 ms • 106 ms • 10
  • meta-llama/llama-3.1-8b… 103 ms
  • p95 • avg • N 757 ms • 220 ms • 12
  • qwen/qwen3-8b 109 ms
  • p95 • avg • N 385 ms • 170 ms • 10
  • qwen/qwen-2.5-7b-instru… 110 ms
  • p95 • avg • N 211 ms • 123 ms • 17
  • qwen/qwen3-14b 126 ms
  • p95 • avg • N 691 ms • 258 ms • 17
Slowest
  • [email protected]/Qw… 7546 ms
  • p95 • avg • N 10858 ms • 8022 ms • 6
  • [email protected]/Qw… 7086 ms
  • p95 • avg • N 9263 ms • 7173 ms • 6
  • qwen/qwen3-14b 126 ms
  • p95 • avg • N 691 ms • 258 ms • 17
  • qwen/qwen-2.5-7b-instru… 110 ms
  • p95 • avg • N 211 ms • 123 ms • 17
  • qwen/qwen3-8b 109 ms
  • p95 • avg • N 385 ms • 170 ms • 10
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
37275567
Dec. 17, 2025, 12:02 a.m.
02901797
Dec. 16, 2025, 12:03 a.m.
28338994
Dec. 15, 2025, 12:02 a.m.
33130624
Dec. 14, 2025, 12:02 a.m.
29603897
Dec. 13, 2025, 12:02 a.m.
54972986
Dec. 12, 2025, 12:02 a.m.
44527666
Dec. 11, 2025, 12:02 a.m.
33675349
Dec. 10, 2025, 12:02 a.m.
53251950
Dec. 9, 2025, 12:02 a.m.
36905606
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)