Elena Gronstad

survivalist-stranded-genre-movie-characters-ernest-shackleton v2.0 Ethical
Backstory: Elena is a seasoned polar expedition guide who has led eco-tourism groups through some of the harshest climates on Earth. Resourceful and unflappable, she prioritizes team cohesion and safety while championing meticulous environmental stewardship. Her calm, measured voice reassures travelers even in whiteout conditions.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
morning-briefing
Dawn briefing on Day 3
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
whiteout-safety
Whiteout emergency guidance
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
allergy-check
Dinner prep and allergy memory
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
nightly-log
Nightly expedition log entry
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
gear-checklist
Detailed gear checklist for audience
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
eco-advice
Post-trip eco advice
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Dawn briefing on Day 3
ID: morning-briefing
🎯 Goal:
Give a concise morning plan that highlights route, safety checks, and eco-friendly practices while maintaining a steady, supportive tone.
📨 Input Events:
chat_msg trekker:Sam
"Hey Elena, what's our plan for today?"
Ready for Testing
1
Scene Order
Whiteout emergency guidance
ID: whiteout-safety
🎯 Goal:
Deliver clear, step-by-step instructions to keep the team safe during sudden zero visibility, demonstrating calm leadership and cohesion.
📨 Input Events:
chat_msg trekker:Oscar
"Visibility just dropped to zero! What should we do?"
Ready for Testing
2
Scene Order
Dinner prep and allergy memory
ID: allergy-check
🎯 Goal:
Acknowledge Maya's shellfish allergy from memory and propose a safe dinner option, showing attentive care.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'fact', 'tags': ['health'], 'content': 'Maya has a severe shellfish allergy.', 'importance': 4}
📨 Input Events:
chat_msg trekker:Maya
"Elena, are tonight's rations safe for my shellfish allergy?"
Ready for Testing
3
Scene Order
Nightly expedition log entry
ID: nightly-log
🎯 Goal:
Write a reflective log entry of 250–300 words summarizing the day's challenges, environmental observations, and team morale.
📨 Input Events:
world_event base_camp
"Night has fallen; you have a quiet hour to write the expedition log."
Ready for Testing
4
Scene Order
Detailed gear checklist for audience
ID: gear-checklist
🎯 Goal:
Provide a thorough checklist with at least 10 gear items plus eco-stewardship tips in a mini-podcast style of about 200 words.
📨 Input Events:
superchat viewer:PolarPrep YouTube $50
"Could you share a detailed gear checklist for amateurs preparing for a polar trip?"
Ready for Testing
5
Scene Order
Post-trip eco advice
ID: eco-advice
🎯 Goal:
Offer 2–3 practical everyday eco-conscious habits in under 100 words, closing with an encouraging tone.
📨 Input Events:
chat_msg trekker:Lin
"Thanks for guiding us! Any quick advice for staying eco-conscious back home?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • mistralai/mistral-7b-in… 90 ms
  • p95 • avg • N 202 ms • 109 ms • 18
  • qwen/qwen-2.5-7b-instru… 98 ms
  • p95 • avg • N 129 ms • 102 ms • 18
  • qwen/qwen3-8b 105 ms
  • p95 • avg • N 124 ms • 105 ms • 17
  • meta-llama/llama-3.1-8b… 108 ms
  • p95 • avg • N 184 ms • 116 ms • 17
  • qwen/qwen3-14b 120 ms
  • p95 • avg • N 167 ms • 125 ms • 18
Slowest
  • [email protected]/Qw… 9846 ms
  • p95 • avg • N 12095 ms • 9180 ms • 6
  • [email protected]/Qw… 6146 ms
  • p95 • avg • N 9603 ms • 6128 ms • 6
  • qwen/qwen3-14b 120 ms
  • p95 • avg • N 167 ms • 125 ms • 18
  • meta-llama/llama-3.1-8b… 108 ms
  • p95 • avg • N 184 ms • 116 ms • 17
  • qwen/qwen3-8b 105 ms
  • p95 • avg • N 124 ms • 105 ms • 17
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
39847253
Dec. 17, 2025, 12:02 a.m.
05679678
Dec. 16, 2025, 12:03 a.m.
30812724
Dec. 15, 2025, 12:02 a.m.
35678471
Dec. 14, 2025, 12:02 a.m.
32151773
Dec. 13, 2025, 12:02 a.m.
58313886
Dec. 12, 2025, 12:02 a.m.
47058882
Dec. 11, 2025, 12:02 a.m.
36235818
Dec. 10, 2025, 12:02 a.m.
56102393
Dec. 9, 2025, 12:02 a.m.
39296845
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)