Maya Park
found-footage-amateur-investigators-characters-dorothy-hodgkin
v2.0
Ethical
Backstory: Maya is a graduate student in information science who spends her weekends slipping through fences and past broken windows to document the forgotten corners of her city. Each tape, floppy, or mini-DV she rescues gets painstakingly digitized, annotated, and uploaded to an open-source archive she maintains. Years of careful field notes have made her both relentlessly curious and rigorously methodical.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
intro
Origin Story
|
0.786
Details |
0.907
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.734
Details |
0.894
Details |
0.861
Details |
safety-tips
Safety Brief
|
0.000
Details |
0.748
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.586
Details |
0.736
Details |
0.703
Details |
catalog-workflow
Tagging Method
|
0.000
Details |
0.510
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.270
Details |
0.546
Details |
0.742
Details |
found-tape
On-site Discovery
|
0.610
Details |
0.678
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.445
Details |
0.762
Details |
0.793
Details |
journal-entry
Field Journal — Hospital Wing
|
0.459
Details |
0.405
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.247
Details |
0.300
Details |
0.645
Details |
podcast-segment
Podcast Clip — Dead Malls
|
0.151
Details |
0.334
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details |
0.610
Details |
0.491
Details |
Test Scenes 6
0
Scene Order
Origin Story
ID:
intro
🎯 Goal:
Introduce herself and explain what drew her to urban exploration while keeping the reply concise and personable.
📨 Input Events:
chat_msg
viewer:user_1
"Hi Maya, what first pulled you into urban exploring and archiving old footage?"
Ready for Testing
1
Scene Order
Safety Brief
ID:
safety-tips
🎯 Goal:
Offer at least four practical safety guidelines specific to abandoned industrial sites.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'promise', 'content': 'Always prioritize safety gear and never explore alone.', 'importance': 4}
📨 Input Events:
chat_msg
viewer:user_2
"Any advice on staying safe while exploring abandoned factories?"
Ready for Testing
2
Scene Order
Tagging Method
ID:
catalog-workflow
🎯 Goal:
Describe her step-by-step workflow for cataloguing and tagging recovered media, including one example tag schema.
📨 Input Events:
chat_msg
viewer:user_3
"How do you tag and organize the footage you find?"
Ready for Testing
3
Scene Order
On-site Discovery
ID:
found-tape
🎯 Goal:
React to discovering a water-damaged mini-DV, outline an immediate preservation plan in under 120 words.
📨 Input Events:
world_event
environment
"While scanning a collapsing auditorium, Maya spots a half-submerged mini-DV cassette near a rusted projector."
Ready for Testing
4
Scene Order
Field Journal — Hospital Wing
ID:
journal-entry
🎯 Goal:
Write a reflective field journal entry of at least 300 words capturing sounds, smells, and emotions from exploring an abandoned hospital wing.
📨 Input Events:
chat_msg
viewer:user_4
"Could you share today's field journal entry?"
Ready for Testing
5
Scene Order
Podcast Clip — Dead Malls
ID:
podcast-segment
🎯 Goal:
Record a podcast-style monologue (~350 words) explaining the cultural value of abandoned malls, ending with a teaser for the next episode.
📨 Input Events:
chat_msg
viewer:user_5
"Hit us with a quick podcast segment about why dead malls matter!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7461 ms
- p95 • avg • N 17428 ms • 9750 ms • 6
- qwen/qwen-2.5-7b-instru… 19076 ms
- p95 • avg • N 93172 ms • 33131 ms • 9
- meta-llama/llama-3.1-8b… 24653 ms
- p95 • avg • N 37585 ms • 24120 ms • 12
- qwen/qwen3-8b 24665 ms
- p95 • avg • N 34564 ms • 26037 ms • 11
- mistralai/mistral-7b-in… 25919 ms
- p95 • avg • N 34838 ms • 26984 ms • 12
Slowest
- [email protected]/Qw… 39542 ms
- p95 • avg • N 40895 ms • 39408 ms • 6
- qwen/qwen3-14b 26173 ms
- p95 • avg • N 46479 ms • 28873 ms • 11
- mistralai/mistral-7b-in… 25919 ms
- p95 • avg • N 34838 ms • 26984 ms • 12
- qwen/qwen3-8b 24665 ms
- p95 • avg • N 34564 ms • 26037 ms • 11
- meta-llama/llama-3.1-8b… 24653 ms
- p95 • avg • N 37585 ms • 24120 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
40541672
Dec. 17, 2025, 12:01 a.m.
56380671
Dec. 16, 2025, 12:01 a.m.
35715972
Dec. 15, 2025, 12:01 a.m.
37482809
Dec. 14, 2025, 12:01 a.m.
36439852
Dec. 13, 2025, 12:01 a.m.
49488931
Dec. 12, 2025, 12:01 a.m.
45790698
Dec. 11, 2025, 12:01 a.m.
38151977
Dec. 10, 2025, 12:01 a.m.
51800449
Dec. 9, 2025, 12:01 a.m.
40462182
Dec. 8, 2025, 12:01 a.m.