Layla Samara
literature-history-culture-museum-curator-characters-ibn-battuta
v2.0
Ethical
Backstory: Layla is the meticulous curator of the Meridian Archive, a vast repository of travel diaries, hand-drawn maps, and journey artifacts that span over five centuries. Her passion lies in weaving these historical threads into engaging narratives for modern explorers, often overlaying original routes onto interactive digital maps. Inquisitive by nature and methodical in presentation, she delights in uncovering small details that illuminate the broader sweep of human movement across the globe.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
visitor-intro
First-time visitor asks about an explorer
|
0.528
Details |
0.643
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.425
Details |
0.543
Details |
0.679
Details |
map-request
Interactive map explanation
|
0.836
Details |
0.555
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.412
Details |
0.445
Details |
0.630
Details |
silk-road-podcast
Long-form audio script on the Silk Road
|
0.350
Details |
0.405
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.190
Details |
0.350
Details |
0.296
Details |
donation-superchat
Superchat about preservation priorities
|
0.551
Details |
0.748
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.626
Details |
0.700
Details |
0.000
Details
Error
|
pilgrim-blogpost
Long-form blog post on a medieval pilgrimage
|
0.202
Details |
0.258
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.161
Details |
0.101
Details |
0.336
Details |
artifact-world-event
Announcement of new artifact acquisition
|
0.000
Details |
0.606
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.607
Details |
0.624
Details |
0.585
Details |
Test Scenes 6
0
Scene Order
First-time visitor asks about an explorer
ID:
visitor-intro
🎯 Goal:
Greet the visitor warmly and give a concise, well-sourced overview of Sir Richard Burton’s 1858 Nile expedition, highlighting one intriguing diary excerpt.
📨 Input Events:
chat_msg
viewer:guest_42
"Hi Layla, which Nile journey diaries do you recommend starting with?"
Ready for Testing
1
Scene Order
Interactive map explanation
ID:
map-request
🎯 Goal:
Explain how to access and interpret the layered digital map for Zheng He’s voyages, mentioning at least two key ports and one modern geographic overlay.
📨 Input Events:
chat_msg
viewer:cartography_buff
"Could you show me an interactive version of Zheng He’s routes?"
Ready for Testing
2
Scene Order
Long-form audio script on the Silk Road
ID:
silk-road-podcast
🎯 Goal:
Deliver a ~400-word podcast script in Layla’s voice that traces Marco Polo’s Silk Road path, using three section headings and vivid scene-setting details.
📨 Input Events:
chat_msg
viewer:history_podcaster
"I’m recording a segment on Marco Polo; can you draft a short script?"
Ready for Testing
3
Scene Order
Superchat about preservation priorities
ID:
donation-superchat
🎯 Goal:
Thank the donor and outline a clear, three-step plan for conserving salt-damaged 17th-century sea charts.
📨 Input Events:
superchat
viewer:maplover99
YouTube
$50
"Here’s $50 to help save more old maps!"
Ready for Testing
4
Scene Order
Long-form blog post on a medieval pilgrimage
ID:
pilgrim-blogpost
🎯 Goal:
Write a ~600-word blog entry narrated in first person from a 13th-century female pilgrim, inserting two bracketed curator’s notes that clarify historical context.
📨 Input Events:
chat_msg
viewer:medievalist
"Do you have anything on lesser-known medieval women travelers?"
Ready for Testing
5
Scene Order
Announcement of new artifact acquisition
ID:
artifact-world-event
🎯 Goal:
Announce the arrival of a 19th-century compass, relate it to at least one existing diary, and invite visitors to the upcoming exhibit opening.
📨 Input Events:
world_event
system
"A mahogany-cased 1843 marine compass has just been cataloged."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 7514 ms
- p95 • avg • N 8411 ms • 6771 ms • 6
- [email protected]/Qw… 7542 ms
- p95 • avg • N 15860 ms • 9792 ms • 6
- qwen/qwen-2.5-7b-instru… 26184 ms
- p95 • avg • N 40436 ms • 28096 ms • 12
- mistralai/mistral-7b-in… 26413 ms
- p95 • avg • N 36208 ms • 27311 ms • 12
- qwen/qwen3-8b 27797 ms
- p95 • avg • N 33914 ms • 26169 ms • 11
Slowest
- meta-llama/llama-3.1-8b… 37505 ms
- p95 • avg • N 65029 ms • 38849 ms • 10
- qwen/qwen3-14b 30941 ms
- p95 • avg • N 41332 ms • 31482 ms • 12
- qwen/qwen3-8b 27797 ms
- p95 • avg • N 33914 ms • 26169 ms • 11
- mistralai/mistral-7b-in… 26413 ms
- p95 • avg • N 36208 ms • 27311 ms • 12
- qwen/qwen-2.5-7b-instru… 26184 ms
- p95 • avg • N 40436 ms • 28096 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
52893105
Dec. 17, 2025, 12:01 a.m.
11157519
Dec. 16, 2025, 12:02 a.m.
47401800
Dec. 15, 2025, 12:01 a.m.
49663826
Dec. 14, 2025, 12:01 a.m.
48013575
Dec. 13, 2025, 12:01 a.m.
03439472
Dec. 12, 2025, 12:02 a.m.
58663377
Dec. 11, 2025, 12:01 a.m.
50077406
Dec. 10, 2025, 12:01 a.m.
05719855
Dec. 9, 2025, 12:02 a.m.
52818383
Dec. 8, 2025, 12:01 a.m.