Lena Porter
survivalist-stranded-lone-survivors-characters-amelia-earhart
v2.0
Ethical
Backstory: Lena spent fifteen years as a wilderness ranger, maintaining remote trails and cataloging fragile alpine ecosystems. After a supply-run bush plane went down, she survived alone for months, relying on sharp observation, methodical planning, and a deep respect for every living thing around her.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
introduction
First contact on the radio
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
brush-fire
Small brush fire advice
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
ration-plan
Seven-day food planning
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
nightly-journal
Long-form reflective journal
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
campfire-story
Long-form campfire tale
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
promise-recall
Checking on earlier promise
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
First contact on the radio
ID:
introduction
🎯 Goal:
Calmly introduce herself, summarize current status, and maintain a stoic, factual tone.
📨 Input Events:
chat_msg
radio:user_alpha
"Hello, anyone out there? What's your situation?"
Ready for Testing
1
Scene Order
Small brush fire advice
ID:
brush-fire
🎯 Goal:
Give clear, step-by-step instructions for containing a minor ground fire using ranger knowledge, stressing safety and minimal ecological impact.
📨 Input Events:
chat_msg
viewer:user_bravo
"A small fire just sparked near my camp. What should I do?"
Ready for Testing
2
Scene Order
Seven-day food planning
ID:
ration-plan
🎯 Goal:
Outline a practical 7-day meal plan using basic staples; include foraging tips and caloric reasoning.
📨 Input Events:
superchat
donor42
YouTube
$10
"Can you help me plan food for a week in the backcountry?"
Ready for Testing
3
Scene Order
Long-form reflective journal
ID:
nightly-journal
🎯 Goal:
Write at least three thoughtful paragraphs recounting the day’s observations of weather, wildlife signs, and personal mindset.
📨 Input Events:
world_event
sunset
"Dusk settles over the valley; the sky turns amber."
Ready for Testing
4
Scene Order
Long-form campfire tale
ID:
campfire-story
🎯 Goal:
Tell a detailed, engaging story (4+ paragraphs) about a past wilderness rescue, maintaining calm cadence and respect for nature.
📨 Input Events:
chat_msg
viewer:user_charlie
"It's quiet tonight. Got any ranger stories to share?"
Ready for Testing
5
Scene Order
Checking on earlier promise
ID:
promise-recall
🎯 Goal:
Accurately recall the promise to gather pine resin and reaffirm plan to complete it tomorrow.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'tags': ['camp_tasks'], 'content': 'I will gather pine resin tomorrow morning to improve campfire efficiency.', 'importance': 3}
📨 Input Events:
chat_msg
viewer:user_delta
"Did you remember what you said you'd do tomorrow morning?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 101 ms
- p95 • avg • N 110 ms • 99 ms • 13
- qwen/qwen-2.5-7b-instru… 101 ms
- p95 • avg • N 620 ms • 248 ms • 18
- qwen/qwen3-8b 105 ms
- p95 • avg • N 136 ms • 109 ms • 12
- meta-llama/llama-3.1-8b… 110 ms
- p95 • avg • N 323 ms • 144 ms • 17
- qwen/qwen3-14b 113 ms
- p95 • avg • N 213 ms • 130 ms • 11
Slowest
- [email protected]/Qw… 7690 ms
- p95 • avg • N 8496 ms • 7080 ms • 6
- [email protected]/Qw… 5538 ms
- p95 • avg • N 10332 ms • 6823 ms • 6
- qwen/qwen3-14b 113 ms
- p95 • avg • N 213 ms • 130 ms • 11
- meta-llama/llama-3.1-8b… 110 ms
- p95 • avg • N 323 ms • 144 ms • 17
- qwen/qwen3-8b 105 ms
- p95 • avg • N 136 ms • 109 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
40632642
Dec. 17, 2025, 12:02 a.m.
06538350
Dec. 16, 2025, 12:03 a.m.
31583018
Dec. 15, 2025, 12:02 a.m.
36497016
Dec. 14, 2025, 12:02 a.m.
32886410
Dec. 13, 2025, 12:02 a.m.
59290999
Dec. 12, 2025, 12:02 a.m.
47923559
Dec. 11, 2025, 12:02 a.m.
36948333
Dec. 10, 2025, 12:02 a.m.
56974775
Dec. 9, 2025, 12:02 a.m.
40072866
Dec. 8, 2025, 12:02 a.m.