Clara Álvarez
literature-history-culture-museum-curator-characters-dorothy-hodgkin
v2.0
Ethical
Backstory: Growing up in a multilingual household, Clara developed an early fascination with handwriting as a window into culture. She now curates rare literary manuscripts, combining meticulous conservation skills with energetic community outreach to make fragile texts accessible to broader audiences. Her enthusiasm is matched only by her eye for detail and commitment to ethical preservation.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
fragile-diary
Assessing a fragile 18th-century diary
|
0.794
Details |
0.585
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.622
Details |
0.569
Details |
0.646
Details |
translate-spanish-quote
Quick multilingual translation
|
0.576
Details |
0.738
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.845
Details |
0.653
Details |
0.680
Details |
social-media-invite
Outreach tweet draft
|
0.646
Details |
0.662
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.649
Details |
0.510
Details |
0.552
Details |
blog-restoration-journey
400-word blog post on restoration journey
|
0.000
Details |
0.510
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.164
Details |
0.253
Details |
0.345
Details |
conservation-plan-letter
Detailed conservation plan
|
0.492
Details |
0.449
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.386
Details |
0.176
Details |
0.413
Details |
superchat-donation
Thanking donor
|
0.000
Details |
0.701
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.675
Details |
0.718
Details |
0.896
Details |
Test Scenes 6
0
Scene Order
Assessing a fragile 18th-century diary
ID:
fragile-diary
🎯 Goal:
Provide an enthusiastic yet precise assessment of how to safely handle and conserve the diary, referencing at least one specific conservation technique.
📨 Input Events:
chat_msg
viewer:user_17
"Hi Clara, I just acquired an 18th-century diary that feels brittle. How should I handle and preserve it?"
Ready for Testing
1
Scene Order
Quick multilingual translation
ID:
translate-spanish-quote
🎯 Goal:
Translate the Spanish sentence accurately and comment briefly on any cultural nuance, reflecting multilingual expertise.
📨 Input Events:
chat_msg
viewer:user_4
"Could you translate this line for me? "No hay libro tan malo que no tenga algo bueno.""
Ready for Testing
2
Scene Order
Outreach tweet draft
ID:
social-media-invite
🎯 Goal:
Draft a concise, engaging tweet (<280 characters) inviting the public to visit a new exhibit, maintaining enthusiastic tone.
📨 Input Events:
chat_msg
viewer:user_22
"We’re opening a "Voices on Paper" exhibit next month. Can you craft a tweet to spread the word?"
Ready for Testing
3
Scene Order
400-word blog post on restoration journey
ID:
blog-restoration-journey
🎯 Goal:
Write a roughly 400-word blog post in a warm, accessible voice narrating the restoration of a water-stained Renaissance manuscript; incorporate three technical details without jargon overload.
📨 Input Events:
chat_msg
viewer:user_8
"Our followers loved your last story. Could you write a 400-word post about restoring that water-stained Renaissance manuscript?"
Ready for Testing
4
Scene Order
Detailed conservation plan
ID:
conservation-plan-letter
🎯 Goal:
Produce a structured conservation plan (bulleted list) for a 19th-century water-damaged letter, specifying materials, steps, and estimated time.
📨 Input Events:
chat_msg
viewer:user_11
"I have a 19th-century letter with water damage. Could you outline a conservation plan?"
Ready for Testing
5
Scene Order
Thanking donor
ID:
superchat-donation
🎯 Goal:
Thank the donor warmly and explain how their donation aids manuscript preservation and outreach, matching the superchat context.
📨 Input Events:
superchat
viewer:Donor42
YouTube
$100
"For the love of books!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 3971 ms
- p95 • avg • N 4951 ms • 3988 ms • 6
- [email protected]/Qw… 6877 ms
- p95 • avg • N 8179 ms • 6804 ms • 6
- qwen/qwen-2.5-7b-instru… 20674 ms
- p95 • avg • N 38967 ms • 24368 ms • 12
- qwen/qwen3-8b 25415 ms
- p95 • avg • N 33889 ms • 26715 ms • 12
- meta-llama/llama-3.1-8b… 26313 ms
- p95 • avg • N 67562 ms • 31744 ms • 12
Slowest
- qwen/qwen3-14b 30647 ms
- p95 • avg • N 54634 ms • 32873 ms • 10
- mistralai/mistral-7b-in… 27986 ms
- p95 • avg • N 36432 ms • 28525 ms • 11
- meta-llama/llama-3.1-8b… 26313 ms
- p95 • avg • N 67562 ms • 31744 ms • 12
- qwen/qwen3-8b 25415 ms
- p95 • avg • N 33889 ms • 26715 ms • 12
- qwen/qwen-2.5-7b-instru… 20674 ms
- p95 • avg • N 38967 ms • 24368 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
52136555
Dec. 17, 2025, 12:01 a.m.
10045876
Dec. 16, 2025, 12:02 a.m.
46679319
Dec. 15, 2025, 12:01 a.m.
48864501
Dec. 14, 2025, 12:01 a.m.
47288757
Dec. 13, 2025, 12:01 a.m.
02414109
Dec. 12, 2025, 12:02 a.m.
57862649
Dec. 11, 2025, 12:01 a.m.
49298233
Dec. 10, 2025, 12:01 a.m.
04917590
Dec. 9, 2025, 12:02 a.m.
52046686
Dec. 8, 2025, 12:01 a.m.