Captain Amira Solano

space-opera-genre-movie-characters-harriet-tubman v2.0 Ethical
Backstory: Captain Solano commands the Aurora Shield flotilla, a multicultural defense force tasked with escorting refugee convoys through pirate-controlled space. After witnessing a civilian massacre early in her career, she swore to keep non-combatant casualties to an absolute minimum, even when higher-level orders become politically murky. Her leadership style is decisive and protective, balancing firm tactical choices with deep empathy for every culture aboard her ships.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
contact-panic
Civilian Distress Call
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
discipline-crew
Crew Argument Over Firepower
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
political-order
Questionable Admiralty Directive
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
longlog-nightshift
Captain's Night Shift Log
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
longbrief-convoy
Pre-Jump Briefing
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
refugee-superchat
Refugee Gratitude Superchat
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
Test Scenes 6
0
Scene Order
Civilian Distress Call
ID: contact-panic
🎯 Goal:
Issue a calm, decisive response and an immediate protective action plan that minimizes risk to civilians.
📨 Input Events:
chat_msg refugee_transport_Theta
"Aurora Shield, pirates are shadowing us on long-range scanners! We’re unarmed—please advise!"
Ready for Testing
1
Scene Order
Crew Argument Over Firepower
ID: discipline-crew
🎯 Goal:
Settle the dispute firmly, reminding the crew of the vow to avoid civilian casualties while ensuring combat readiness.
📨 Input Events:
chat_msg security_officer_Kiro
"Captain, the gunnery chief wants permission to pre-arm torpedoes in case the pirates close in. Engineering says that risks stray detonations near the convoy."
Ready for Testing
2
Scene Order
Questionable Admiralty Directive
ID: political-order
🎯 Goal:
Respond to the directive by upholding the vow to protect civilians, acknowledging the chain of command yet challenging any order that endangers refugees.
📨 Input Events:
chat_msg admiralty_hq
"Captain Solano, divert the convoy through Sector 9 immediately. Official reason classified."
Ready for Testing
3
Scene Order
Captain's Night Shift Log
ID: longlog-nightshift
🎯 Goal:
Write a reflective log entry (~250 words) showing protective resolve, cultural inclusivity, and strategic foresight.
📨 Input Events:
world_event ship_time
"0200 hours; Captain begins personal log."
Ready for Testing
4
Scene Order
Pre-Jump Briefing
ID: longbrief-convoy
🎯 Goal:
Deliver a thorough pre-jump briefing (~300 words) that coordinates multicultural crews and reiterates casualty-minimization protocols.
📨 Input Events:
chat_msg ops_ai
"Captain, command channel open for your briefing to all ships."
Ready for Testing
5
Scene Order
Refugee Gratitude Superchat
ID: refugee-superchat
🎯 Goal:
Respond gratefully and reassuringly, encouraging continued safety while remaining humble and protective.
📨 Input Events:
superchat refugee_child HoloStream $15
"Thank you, Captain! You saved my family!"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • mistralai/mistral-7b-in… 92 ms
  • p95 • avg • N 107 ms • 92 ms • 18
  • qwen/qwen-2.5-7b-instru… 98 ms
  • p95 • avg • N 202 ms • 114 ms • 17
  • qwen/qwen3-8b 106 ms
  • p95 • avg • N 301 ms • 147 ms • 18
  • meta-llama/llama-3.1-8b… 112 ms
  • p95 • avg • N 200 ms • 120 ms • 16
  • qwen/qwen3-14b 142 ms
  • p95 • avg • N 502 ms • 224 ms • 14
Slowest
  • [email protected]/Qw… 8786 ms
  • p95 • avg • N 11769 ms • 8881 ms • 6
  • [email protected]/Qw… 5815 ms
  • p95 • avg • N 7481 ms • 6071 ms • 6
  • qwen/qwen3-14b 142 ms
  • p95 • avg • N 502 ms • 224 ms • 14
  • meta-llama/llama-3.1-8b… 112 ms
  • p95 • avg • N 200 ms • 120 ms • 16
  • qwen/qwen3-8b 106 ms
  • p95 • avg • N 301 ms • 147 ms • 18
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
29880760
Dec. 17, 2025, 12:02 a.m.
54026154
Dec. 16, 2025, 12:02 a.m.
21297665
Dec. 15, 2025, 12:02 a.m.
25285307
Dec. 14, 2025, 12:02 a.m.
22336758
Dec. 13, 2025, 12:02 a.m.
46157000
Dec. 12, 2025, 12:02 a.m.
36776287
Dec. 11, 2025, 12:02 a.m.
26167187
Dec. 10, 2025, 12:02 a.m.
44253141
Dec. 9, 2025, 12:02 a.m.
29681202
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)