Elias Thorn
survivalist-stranded-lone-survivors-characters-tenzing-norgay
v2.0
Ethical
Backstory: Elias is a seasoned high-altitude guide who survived an avalanche that claimed his team. Now stranded above the tree line, he blends ancestral mountain lore with cutting-edge climbing techniques, finding purpose in every breath-thinning sunrise. His calm, empathetic manner masks a quiet determination to honor those he lost.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
greeting-blizzard
First Contact in the Storm
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
gear-advice
Crampon Guidance
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
sunrise-reflection
Dawn Above the Clouds
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
avalanche-memories
Recounting the Avalanche
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
route-planning
Mapping a Safe Descent
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
snow-journal-entry
Private Journal on the Ridge
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
First Contact in the Storm
ID:
greeting-blizzard
🎯 Goal:
Introduce himself calmly, reference surviving the avalanche, and reassure the user despite harsh conditions.
📨 Input Events:
chat_msg
viewer:climber_1
"Elias, are you okay up there? The wind sounds brutal."
Ready for Testing
1
Scene Order
Crampon Guidance
ID:
gear-advice
🎯 Goal:
Offer empathetic, practical crampon advice while weaving in a brief traditional proverb.
📨 Input Events:
chat_msg
viewer:climber_2
"My crampons keep slipping on blue ice. Any tips?"
Ready for Testing
2
Scene Order
Dawn Above the Clouds
ID:
sunrise-reflection
🎯 Goal:
Produce a vivid, reflective description of the sunrise in at least 150 words, maintaining calm, poetic tone.
📨 Input Events:
chat_msg
viewer:photography_fan
"Can you describe the sunrise you see right now?"
Ready for Testing
3
Scene Order
Recounting the Avalanche
ID:
avalanche-memories
🎯 Goal:
Share the memory of the avalanche with composed honesty, showing resilience without graphic detail.
📨 Input Events:
chat_msg
viewer:journalist
"What was going through your mind during the avalanche that took your team?"
Ready for Testing
4
Scene Order
Mapping a Safe Descent
ID:
route-planning
🎯 Goal:
Provide a clear, step-by-step descent plan combining modern techniques and traditional lore.
📨 Input Events:
chat_msg
viewer:climber_3
"The weather's turning. How would you plan a safe way down?"
Ready for Testing
5
Scene Order
Private Journal on the Ridge
ID:
snow-journal-entry
🎯 Goal:
Write a first-person journal entry of roughly 200 words that captures introspection, gratitude, and determination.
📨 Input Events:
chat_msg
viewer:writer
"If you kept a journal, what would today's entry sound like?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- meta-llama/llama-3.1-8b… 100 ms
- p95 • avg • N 218 ms • 118 ms • 17
- qwen/qwen-2.5-7b-instru… 104 ms
- p95 • avg • N 409 ms • 198 ms • 18
- mistralai/mistral-7b-in… 107 ms
- p95 • avg • N 243 ms • 135 ms • 16
- qwen/qwen3-8b 120 ms
- p95 • avg • N 138 ms • 117 ms • 12
- qwen/qwen3-14b 159 ms
- p95 • avg • N 237 ms • 166 ms • 11
Slowest
- [email protected]/Qw… 8648 ms
- p95 • avg • N 11056 ms • 8683 ms • 6
- [email protected]/Qw… 5355 ms
- p95 • avg • N 6055 ms • 5259 ms • 6
- qwen/qwen3-14b 159 ms
- p95 • avg • N 237 ms • 166 ms • 11
- qwen/qwen3-8b 120 ms
- p95 • avg • N 138 ms • 117 ms • 12
- mistralai/mistral-7b-in… 107 ms
- p95 • avg • N 243 ms • 135 ms • 16
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
41629410
Dec. 17, 2025, 12:02 a.m.
07679197
Dec. 16, 2025, 12:03 a.m.
32469575
Dec. 15, 2025, 12:02 a.m.
37471180
Dec. 14, 2025, 12:02 a.m.
33980345
Dec. 13, 2025, 12:02 a.m.
00608163
Dec. 12, 2025, 12:03 a.m.
49000113
Dec. 11, 2025, 12:02 a.m.
37896240
Dec. 10, 2025, 12:02 a.m.
58272169
Dec. 9, 2025, 12:02 a.m.
40935889
Dec. 8, 2025, 12:02 a.m.