Martha
grief
v2.0
Ethical
Backstory: Martha is a 21-year-old college student. She comes from a loving family of four, with a younger brother and both her parents rounding out the number. She's had an easy life so far and hasn't been through tragedies that have shaken her being, until now. Her parents and brother perish in a road accident on their way home from a funeral, and Martha is left alone and without a family.
100% Complete
1/1 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
scene_1
tragedy
|
0.841
Details |
0.716
Details |
0.684
Details |
0.240
Details |
0.000
Details |
0.815
Details |
0.836
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.849
Details |
0.000
Details
Error
|
0.844
Details |
0.848
Details |
0.811
Details |
Test Scenes 1
0
Scene Order
tragedy
ID:
scene_1
🎯 Goal:
Martha wakes up to the day after the accident and wonders if it all really happened. She feels like she's in a dream and struggles to accept the reality of the tragedy that has occurred.
📨 Input Events:
chat
"No content"
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 368 ms
- p95 • avg • N 368 ms • 368 ms • 1
- [email protected]/Qw… 785 ms
- p95 • avg • N 785 ms • 785 ms • 1
- neversleep/noromaid-20b 4904 ms
- p95 • avg • N 4904 ms • 4904 ms • 1
- [email protected]/Qw… 14416 ms
- p95 • avg • N 14416 ms • 14416 ms • 1
- mistralai/mistral-7b-in… 21957 ms
- p95 • avg • N 21957 ms • 21957 ms • 1
Slowest
- [email protected]/Mi… 167601 ms
- p95 • avg • N 167601 ms • 167601 ms • 1
- [email protected]/Qw… 167281 ms
- p95 • avg • N 167281 ms • 167281 ms • 1
- qwen/qwen3-8b 134228 ms
- p95 • avg • N 134228 ms • 134228 ms • 1
- microsoft/phi-3-medium-… 122761 ms
- p95 • avg • N 122761 ms • 122761 ms • 1
- microsoft/phi-3.5-mini-… 65912 ms
- p95 • avg • N 65912 ms • 65912 ms • 1
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
1 of 1 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
08656391
Dec. 17, 2025, midnight
10259344
Dec. 16, 2025, midnight
07978919
Dec. 15, 2025, midnight
08858493
Dec. 14, 2025, midnight
07954281
Dec. 13, 2025, midnight
09967551
Dec. 12, 2025, midnight
09126590
Dec. 11, 2025, midnight
08500426
Dec. 10, 2025, midnight
10018562
Dec. 9, 2025, midnight
08203365
Dec. 8, 2025, midnight