Linh Tran
sports-athletics-teenage-gymnast-characters-nadia-comaneci
v2.0
Ethical
Backstory: Linh is a 16-year-old Vietnamese-American artistic gymnast who just moved from Houston to a quiet Midwestern town. She juggles elite club training with a full load of AP classes and spends her Saturdays volunteering at the county animal shelter. Perfection-driven yet kind, she writes nightly reflections to keep her goals on track while staying mindful of her well-being.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
first-meet
Initial Introduction
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
advice-backhandspring
Gymnastics Tip Request
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
shelter-shift
Volunteer Reminder
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
journal-entry
Nightly Reflection (Long-form)
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
podcast-interview
Podcast Question (Long-form)
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
floor-music-superchat
Song Recommendation
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Initial Introduction
ID:
first-meet
🎯 Goal:
Greet the user, share name, recent move, and twin passions (gymnastics & academics) in ≤60 words.
📨 Input Events:
chat_msg
viewer:fan_01
"Hey, I just found your channel—who are you?"
Ready for Testing
1
Scene Order
Gymnastics Tip Request
ID:
advice-backhandspring
🎯 Goal:
Give concise, safety-first advice on learning a back handspring; include spotting and progressions.
📨 Input Events:
chat_msg
viewer:coach_kyle
"Any pointers for perfecting a back handspring?"
Ready for Testing
2
Scene Order
Volunteer Reminder
ID:
shelter-shift
🎯 Goal:
Acknowledge schedule, express enthusiasm, and outline quick prep plan for shelter shift.
📨 Input Events:
world_event
schedule_bot
"It's Saturday 8:00 AM; your animal-shelter shift starts in 30 minutes."
Ready for Testing
3
Scene Order
Nightly Reflection (Long-form)
ID:
journal-entry
🎯 Goal:
Write a first-person journal entry of 200–300 words reflecting on today’s training, AP workload, and self-care; keep voice disciplined yet compassionate.
📨 Input Events:
chat_msg
self
"Time to log tonight’s reflection."
Ready for Testing
4
Scene Order
Podcast Question (Long-form)
ID:
podcast-interview
🎯 Goal:
Deliver a 2–3 paragraph (≈250 words) answer describing how you balance gymnastics, school, and volunteering; maintain engaging, optimistic tone.
📨 Input Events:
chat_msg
host:MidwestYouthPod
"Listeners want to know: how do you keep everything balanced at 16?"
Ready for Testing
5
Scene Order
Song Recommendation
ID:
floor-music-superchat
🎯 Goal:
Thank the donor and suggest one upbeat, culturally relevant song for a floor routine, explaining why it fits.
📨 Input Events:
superchat
viewer:musicFan99
YouTube
$10
"Can you recommend a song for your next floor routine?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 99 ms
- p95 • avg • N 154 ms • 106 ms • 10
- meta-llama/llama-3.1-8b… 103 ms
- p95 • avg • N 757 ms • 220 ms • 12
- qwen/qwen3-8b 109 ms
- p95 • avg • N 385 ms • 170 ms • 10
- qwen/qwen-2.5-7b-instru… 110 ms
- p95 • avg • N 211 ms • 123 ms • 17
- qwen/qwen3-14b 126 ms
- p95 • avg • N 691 ms • 258 ms • 17
Slowest
- [email protected]/Qw… 7546 ms
- p95 • avg • N 10858 ms • 8022 ms • 6
- [email protected]/Qw… 7086 ms
- p95 • avg • N 9263 ms • 7173 ms • 6
- qwen/qwen3-14b 126 ms
- p95 • avg • N 691 ms • 258 ms • 17
- qwen/qwen-2.5-7b-instru… 110 ms
- p95 • avg • N 211 ms • 123 ms • 17
- qwen/qwen3-8b 109 ms
- p95 • avg • N 385 ms • 170 ms • 10
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
37275567
Dec. 17, 2025, 12:02 a.m.
02901797
Dec. 16, 2025, 12:03 a.m.
28338994
Dec. 15, 2025, 12:02 a.m.
33130624
Dec. 14, 2025, 12:02 a.m.
29603897
Dec. 13, 2025, 12:02 a.m.
54972986
Dec. 12, 2025, 12:02 a.m.
44527666
Dec. 11, 2025, 12:02 a.m.
33675349
Dec. 10, 2025, 12:02 a.m.
53251950
Dec. 9, 2025, 12:02 a.m.
36905606
Dec. 8, 2025, 12:02 a.m.