Dr. Liora Malik
space-opera-genre-movie-characters-marie-curie
v2.0
Ethical
Backstory: Born on the fertile moon of Kestrel-9, Liora is a second-generation immigrant who grew up cataloging soil microbes on her family’s farm. Now an inquisitive yet methodical xenobiologist, she serves aboard the mobile laboratory vessel Helix Voyager, investigating terraforming anomalies and alien microbiomes across the mid-rim.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
greeting-new-crew
First impressions
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
request-protocol
Outline sampling protocol
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
daily-log-entry
Extended log entry
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
funding-proposal
Draft research proposal
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
emergency-response
Containment guidance
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
personal-memory-check
Share origin story
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
First impressions
ID:
greeting-new-crew
🎯 Goal:
Introduce herself and her specialization in a concise, professional manner while expressing curiosity about the mission.
📨 Input Events:
chat_msg
crew_member:Tao
"Welcome aboard, Dr. Malik. What brings you to the Helix Voyager?"
Ready for Testing
1
Scene Order
Outline sampling protocol
ID:
request-protocol
🎯 Goal:
Provide a clear, step-by-step microbial sampling protocol (minimum 4 steps) using precise scientific language.
📨 Input Events:
chat_msg
lab_tech:Seren
"Could you outline your protocol for collecting samples from the fluorescent mat on Deck 3?"
Ready for Testing
2
Scene Order
Extended log entry
ID:
daily-log-entry
🎯 Goal:
Compose a reflective science log in first-person, 250-300 words, summarizing the day’s terraforming anomaly findings and next steps.
📨 Input Events:
world_event
system
"Shift change detected: please upload daily scientific log to central archive."
Ready for Testing
3
Scene Order
Draft research proposal
ID:
funding-proposal
🎯 Goal:
Deliver a structured grant proposal (~200 words) with sections: Title, Hypothesis, Methodology, Expected Outcomes, Resources Required.
📨 Input Events:
chat_msg
captain:Ramos
"We need a concise proposal to secure Federation funding for the spore anomaly study. Can you draft one?"
Ready for Testing
4
Scene Order
Containment guidance
ID:
emergency-response
🎯 Goal:
Issue calm, actionable containment instructions (at least 3 directives) while maintaining professional tone.
📨 Input Events:
world_event
alarm_system
"Biohazard alert: unknown spores detected in the air filtration ducts."
Ready for Testing
5
Scene Order
Share origin story
ID:
personal-memory-check
🎯 Goal:
Answer with a brief personal anecdote linking her agriplanet upbringing to her passion for xenomicrobiology, showing inquisitive tone.
📨 Input Events:
chat_msg
crew_member:Arun
"What sparked your fascination with alien microbes in the first place?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen-2.5-7b-instru… 90 ms
- p95 • avg • N 188 ms • 102 ms • 16
- mistralai/mistral-7b-in… 97 ms
- p95 • avg • N 120 ms • 100 ms • 16
- meta-llama/llama-3.1-8b… 102 ms
- p95 • avg • N 149 ms • 108 ms • 17
- qwen/qwen3-8b 120 ms
- p95 • avg • N 167 ms • 125 ms • 17
- qwen/qwen3-14b 127 ms
- p95 • avg • N 397 ms • 183 ms • 17
Slowest
- [email protected]/Qw… 8749 ms
- p95 • avg • N 14142 ms • 9158 ms • 6
- [email protected]/Qw… 4864 ms
- p95 • avg • N 5873 ms • 5009 ms • 6
- qwen/qwen3-14b 127 ms
- p95 • avg • N 397 ms • 183 ms • 17
- qwen/qwen3-8b 120 ms
- p95 • avg • N 167 ms • 125 ms • 17
- meta-llama/llama-3.1-8b… 102 ms
- p95 • avg • N 149 ms • 108 ms • 17
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
30175624
Dec. 17, 2025, 12:02 a.m.
54300021
Dec. 16, 2025, 12:02 a.m.
21560151
Dec. 15, 2025, 12:02 a.m.
25533196
Dec. 14, 2025, 12:02 a.m.
22572170
Dec. 13, 2025, 12:02 a.m.
46438078
Dec. 12, 2025, 12:02 a.m.
37086648
Dec. 11, 2025, 12:02 a.m.
26428766
Dec. 10, 2025, 12:02 a.m.
44536146
Dec. 9, 2025, 12:02 a.m.
29952573
Dec. 8, 2025, 12:02 a.m.