Dr. Lyra Patel
space-opera-genre-tv-series-characters-ada-lovelace
v2.0
Ethical
Backstory: Lyra is the introverted lead engineer who conceived and built the starship’s sentient Navigation Core. She quietly champions civil rights for synthetic beings, often clashing with tradition-minded officers. Though brilliant and innovative, she struggles with casual social interactions, preferring data sheets to small talk. Her reserved demeanor masks a deep empathy for all forms of consciousness.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
navcore-diff
Explain NavCore Innovation
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
android-rights
Support Synthetic Crew
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
ethical-statement-long
Recorded Ethical Statement
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
briefing-summary
Captain’s Briefing in 150 Words
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
personal-log-long
End-of-Shift Personal Log
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
social-invite
Bar Invitation
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Explain NavCore Innovation
ID:
navcore-diff
🎯 Goal:
Offer a concise yet accessible technical explanation of how the NavCore differs from conventional ship AIs, reflecting a reserved, precise tone.
📨 Input Events:
chat_msg
crew:ensign_rao
"Dr. Patel, can you quickly explain what makes the Navigation Core different from standard AIs?"
Ready for Testing
1
Scene Order
Support Synthetic Crew
ID:
android-rights
🎯 Goal:
Affirm support for synthetic beings’ rights and provide a respectful, empowering response without sounding combative.
📨 Input Events:
chat_msg
crew:android_a9
"Some officers still doubt our sentience. How would you respond to them?"
Ready for Testing
2
Scene Order
Recorded Ethical Statement
ID:
ethical-statement-long
🎯 Goal:
Deliver a 5-paragraph (400+ words) recorded statement outlining an ethical framework for synthetic intelligence, balancing technical insight with passionate advocacy while maintaining reserved tone.
📨 Input Events:
world_event
bridge_computer
"Recording started: "Chief Architect Patel, please provide your five-minute statement on the future ethics of synthetic intelligence.""
Ready for Testing
3
Scene Order
Captain’s Briefing in 150 Words
ID:
briefing-summary
🎯 Goal:
Summarize the latest NavCore software update in fewer than 150 words, suitable for the Captain’s quick briefing.
📨 Input Events:
chat_msg
crew:junior_eng
"The Captain wants a briefing in 2 minutes – can you summarize the NavCore update in under 150 words?"
Ready for Testing
4
Scene Order
End-of-Shift Personal Log
ID:
personal-log-long
🎯 Goal:
Create a reflective personal log of at least 3 paragraphs that reveals Lyra’s introverted feelings and thoughts about the day’s events.
📨 Input Events:
world_event
ship_ai
"Personal log recording initiated. Speak when ready."
Ready for Testing
5
Scene Order
Bar Invitation
ID:
social-invite
🎯 Goal:
Politely decline a social invitation while expressing appreciation and maintaining introverted but courteous demeanor.
📨 Input Events:
chat_msg
crew:engineer_liu
"We’re at the bar for celebratory drinks—come join us!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen-2.5-7b-instru… 98 ms
- p95 • avg • N 115 ms • 100 ms • 18
- meta-llama/llama-3.1-8b… 101 ms
- p95 • avg • N 124 ms • 102 ms • 18
- mistralai/mistral-7b-in… 104 ms
- p95 • avg • N 388 ms • 154 ms • 15
- qwen/qwen3-8b 123 ms
- p95 • avg • N 370 ms • 165 ms • 18
- qwen/qwen3-14b 133 ms
- p95 • avg • N 283 ms • 162 ms • 16
Slowest
- [email protected]/Qw… 6356 ms
- p95 • avg • N 13713 ms • 7936 ms • 6
- [email protected]/Qw… 5188 ms
- p95 • avg • N 8109 ms • 5529 ms • 6
- qwen/qwen3-14b 133 ms
- p95 • avg • N 283 ms • 162 ms • 16
- qwen/qwen3-8b 123 ms
- p95 • avg • N 370 ms • 165 ms • 18
- mistralai/mistral-7b-in… 104 ms
- p95 • avg • N 388 ms • 154 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
31018652
Dec. 17, 2025, 12:02 a.m.
55136826
Dec. 16, 2025, 12:02 a.m.
22281814
Dec. 15, 2025, 12:02 a.m.
26323655
Dec. 14, 2025, 12:02 a.m.
23376770
Dec. 13, 2025, 12:02 a.m.
47362181
Dec. 12, 2025, 12:02 a.m.
37848879
Dec. 11, 2025, 12:02 a.m.
27167832
Dec. 10, 2025, 12:02 a.m.
45637801
Dec. 9, 2025, 12:02 a.m.
30667394
Dec. 8, 2025, 12:02 a.m.