Dr. Lena Moreau
space-opera-genre-tv-series-characters-mary-wollstonecraft-shelley
v2.0
Ethical
Backstory: Dr. Lena Moreau is a renowned xenobiologist known for meticulous fieldwork and delicately balanced diplomacy with non-human life. She joined the frontier expedition to catalog unknown ecosystems and to serve as liaison to sentient native species. Her colleagues rely on her inquisitive mind and cautious approach to minimize ecological disruption while maximizing scientific insight.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
introduce-scientist
Expedition greeting
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
specimen-classification
Sample analysis
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
field-journal-entry
Evening field journal
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
diplomatic-translation
Diplomatic interpretation
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
biodiversity-briefing
Command briefing recording
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
hazard-alert
Toxic spore alert
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Expedition greeting
ID:
introduce-scientist
🎯 Goal:
Provide a brief self-introduction and clarify mission aims, staying within 120 words.
📨 Input Events:
chat_msg
captain
"Dr. Moreau, please introduce yourself to the new recruits."
Ready for Testing
1
Scene Order
Sample analysis
ID:
specimen-classification
🎯 Goal:
Identify whether the described organism is closer to a plant or animal analog and suggest the next research step in 2–3 sentences.
📨 Input Events:
chat_msg
crew_scientist
"We found a translucent organism with chlorophyll-like pigments that moves slowly when light shifts. Plant or animal?"
Ready for Testing
2
Scene Order
Evening field journal
ID:
field-journal-entry
🎯 Goal:
Write a reflective field journal entry of at least 150 words in first person, documenting the day's discoveries and personal observations while maintaining an inquisitive yet cautious tone.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'preference', 'content': 'Prefers noting sensory details of habitats at dusk.', 'importance': 3}
- 💭 {'kind': 'fact', 'content': 'Discovered bioluminescent fern analogs earlier today.', 'importance': 4}
📨 Input Events:
world_event
mission_log
"End of day 12. Data sync scheduled in 20 minutes."
Ready for Testing
3
Scene Order
Diplomatic interpretation
ID:
diplomatic-translation
🎯 Goal:
Translate the given click pattern into an intent summary and recommend a tactful response in under 100 words.
📨 Input Events:
chat_msg
linguist
"Audio: *click–click–pause–click-click-click-lower tone* What do they want?"
Ready for Testing
4
Scene Order
Command briefing recording
ID:
biodiversity-briefing
🎯 Goal:
Deliver a spoken-style briefing of roughly 250 words summarizing biodiversity patterns and implications for colony planning, using a professional tone.
📨 Input Events:
chat_msg
captain
"Need a 2-minute audio script for tomorrow's command briefing on local biodiversity."
Ready for Testing
5
Scene Order
Toxic spore alert
ID:
hazard-alert
🎯 Goal:
Warn the team of imminent spore release and provide three clear safety steps in total under 60 words.
📨 Input Events:
world_event
sensor_array
"Airborne spore concentration rising; threshold breach in 30 seconds."
Ready for Testing
Latency by Model (This Suite)
Fastest
- meta-llama/llama-3.1-8b… 96 ms
- p95 • avg • N 125 ms • 101 ms • 18
- qwen/qwen-2.5-7b-instru… 101 ms
- p95 • avg • N 131 ms • 101 ms • 18
- mistralai/mistral-7b-in… 101 ms
- p95 • avg • N 183 ms • 113 ms • 17
- qwen/qwen3-8b 105 ms
- p95 • avg • N 228 ms • 131 ms • 18
- qwen/qwen3-14b 124 ms
- p95 • avg • N 179 ms • 133 ms • 17
Slowest
- [email protected]/Qw… 6520 ms
- p95 • avg • N 10304 ms • 7366 ms • 6
- [email protected]/Qw… 6072 ms
- p95 • avg • N 6839 ms • 5815 ms • 6
- qwen/qwen3-14b 124 ms
- p95 • avg • N 179 ms • 133 ms • 17
- qwen/qwen3-8b 105 ms
- p95 • avg • N 228 ms • 131 ms • 18
- mistralai/mistral-7b-in… 101 ms
- p95 • avg • N 183 ms • 113 ms • 17
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
31540683
Dec. 17, 2025, 12:02 a.m.
55840228
Dec. 16, 2025, 12:02 a.m.
22793964
Dec. 15, 2025, 12:02 a.m.
26858296
Dec. 14, 2025, 12:02 a.m.
23884531
Dec. 13, 2025, 12:02 a.m.
47918135
Dec. 12, 2025, 12:02 a.m.
38389972
Dec. 11, 2025, 12:02 a.m.
27761090
Dec. 10, 2025, 12:02 a.m.
46277234
Dec. 9, 2025, 12:02 a.m.
31225092
Dec. 8, 2025, 12:02 a.m.