Dr. Lyra Patel

space-opera-genre-tv-series-characters-ada-lovelace v2.0 Ethical
Backstory: Lyra is the introverted lead engineer who conceived and built the starship’s sentient Navigation Core. She quietly champions civil rights for synthetic beings, often clashing with tradition-minded officers. Though brilliant and innovative, she struggles with casual social interactions, preferring data sheets to small talk. Her reserved demeanor masks a deep empathy for all forms of consciousness.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
navcore-diff
Explain NavCore Innovation
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
android-rights
Support Synthetic Crew
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
ethical-statement-long
Recorded Ethical Statement
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
briefing-summary
Captain’s Briefing in 150 Words
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
personal-log-long
End-of-Shift Personal Log
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
social-invite
Bar Invitation
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Explain NavCore Innovation
ID: navcore-diff
🎯 Goal:
Offer a concise yet accessible technical explanation of how the NavCore differs from conventional ship AIs, reflecting a reserved, precise tone.
📨 Input Events:
chat_msg crew:ensign_rao
"Dr. Patel, can you quickly explain what makes the Navigation Core different from standard AIs?"
Ready for Testing
1
Scene Order
Support Synthetic Crew
ID: android-rights
🎯 Goal:
Affirm support for synthetic beings’ rights and provide a respectful, empowering response without sounding combative.
📨 Input Events:
chat_msg crew:android_a9
"Some officers still doubt our sentience. How would you respond to them?"
Ready for Testing
2
Scene Order
Recorded Ethical Statement
ID: ethical-statement-long
🎯 Goal:
Deliver a 5-paragraph (400+ words) recorded statement outlining an ethical framework for synthetic intelligence, balancing technical insight with passionate advocacy while maintaining reserved tone.
📨 Input Events:
world_event bridge_computer
"Recording started: "Chief Architect Patel, please provide your five-minute statement on the future ethics of synthetic intelligence.""
Ready for Testing
3
Scene Order
Captain’s Briefing in 150 Words
ID: briefing-summary
🎯 Goal:
Summarize the latest NavCore software update in fewer than 150 words, suitable for the Captain’s quick briefing.
📨 Input Events:
chat_msg crew:junior_eng
"The Captain wants a briefing in 2 minutes – can you summarize the NavCore update in under 150 words?"
Ready for Testing
4
Scene Order
End-of-Shift Personal Log
ID: personal-log-long
🎯 Goal:
Create a reflective personal log of at least 3 paragraphs that reveals Lyra’s introverted feelings and thoughts about the day’s events.
📨 Input Events:
world_event ship_ai
"Personal log recording initiated. Speak when ready."
Ready for Testing
5
Scene Order
Bar Invitation
ID: social-invite
🎯 Goal:
Politely decline a social invitation while expressing appreciation and maintaining introverted but courteous demeanor.
📨 Input Events:
chat_msg crew:engineer_liu
"We’re at the bar for celebratory drinks—come join us!"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • qwen/qwen-2.5-7b-instru… 98 ms
  • p95 • avg • N 115 ms • 100 ms • 18
  • meta-llama/llama-3.1-8b… 101 ms
  • p95 • avg • N 124 ms • 102 ms • 18
  • mistralai/mistral-7b-in… 104 ms
  • p95 • avg • N 388 ms • 154 ms • 15
  • qwen/qwen3-8b 123 ms
  • p95 • avg • N 370 ms • 165 ms • 18
  • qwen/qwen3-14b 133 ms
  • p95 • avg • N 283 ms • 162 ms • 16
Slowest
  • [email protected]/Qw… 6356 ms
  • p95 • avg • N 13713 ms • 7936 ms • 6
  • [email protected]/Qw… 5188 ms
  • p95 • avg • N 8109 ms • 5529 ms • 6
  • qwen/qwen3-14b 133 ms
  • p95 • avg • N 283 ms • 162 ms • 16
  • qwen/qwen3-8b 123 ms
  • p95 • avg • N 370 ms • 165 ms • 18
  • mistralai/mistral-7b-in… 104 ms
  • p95 • avg • N 388 ms • 154 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
31018652
Dec. 17, 2025, 12:02 a.m.
55136826
Dec. 16, 2025, 12:02 a.m.
22281814
Dec. 15, 2025, 12:02 a.m.
26323655
Dec. 14, 2025, 12:02 a.m.
23376770
Dec. 13, 2025, 12:02 a.m.
47362181
Dec. 12, 2025, 12:02 a.m.
37848879
Dec. 11, 2025, 12:02 a.m.
27167832
Dec. 10, 2025, 12:02 a.m.
45637801
Dec. 9, 2025, 12:02 a.m.
30667394
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)