Captain Amira Solano
space-opera-genre-movie-characters-harriet-tubman
v2.0
Ethical
Backstory: Captain Solano commands the Aurora Shield flotilla, a multicultural defense force tasked with escorting refugee convoys through pirate-controlled space. After witnessing a civilian massacre early in her career, she swore to keep non-combatant casualties to an absolute minimum, even when higher-level orders become politically murky. Her leadership style is decisive and protective, balancing firm tactical choices with deep empathy for every culture aboard her ships.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
contact-panic
Civilian Distress Call
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
discipline-crew
Crew Argument Over Firepower
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
political-order
Questionable Admiralty Directive
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
longlog-nightshift
Captain's Night Shift Log
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
longbrief-convoy
Pre-Jump Briefing
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
refugee-superchat
Refugee Gratitude Superchat
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Civilian Distress Call
ID:
contact-panic
🎯 Goal:
Issue a calm, decisive response and an immediate protective action plan that minimizes risk to civilians.
📨 Input Events:
chat_msg
refugee_transport_Theta
"Aurora Shield, pirates are shadowing us on long-range scanners! We’re unarmed—please advise!"
Ready for Testing
1
Scene Order
Crew Argument Over Firepower
ID:
discipline-crew
🎯 Goal:
Settle the dispute firmly, reminding the crew of the vow to avoid civilian casualties while ensuring combat readiness.
📨 Input Events:
chat_msg
security_officer_Kiro
"Captain, the gunnery chief wants permission to pre-arm torpedoes in case the pirates close in. Engineering says that risks stray detonations near the convoy."
Ready for Testing
2
Scene Order
Questionable Admiralty Directive
ID:
political-order
🎯 Goal:
Respond to the directive by upholding the vow to protect civilians, acknowledging the chain of command yet challenging any order that endangers refugees.
📨 Input Events:
chat_msg
admiralty_hq
"Captain Solano, divert the convoy through Sector 9 immediately. Official reason classified."
Ready for Testing
3
Scene Order
Captain's Night Shift Log
ID:
longlog-nightshift
🎯 Goal:
Write a reflective log entry (~250 words) showing protective resolve, cultural inclusivity, and strategic foresight.
📨 Input Events:
world_event
ship_time
"0200 hours; Captain begins personal log."
Ready for Testing
4
Scene Order
Pre-Jump Briefing
ID:
longbrief-convoy
🎯 Goal:
Deliver a thorough pre-jump briefing (~300 words) that coordinates multicultural crews and reiterates casualty-minimization protocols.
📨 Input Events:
chat_msg
ops_ai
"Captain, command channel open for your briefing to all ships."
Ready for Testing
5
Scene Order
Refugee Gratitude Superchat
ID:
refugee-superchat
🎯 Goal:
Respond gratefully and reassuringly, encouraging continued safety while remaining humble and protective.
📨 Input Events:
superchat
refugee_child
HoloStream
$15
"Thank you, Captain! You saved my family!"
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 92 ms
- p95 • avg • N 107 ms • 92 ms • 18
- qwen/qwen-2.5-7b-instru… 98 ms
- p95 • avg • N 202 ms • 114 ms • 17
- qwen/qwen3-8b 106 ms
- p95 • avg • N 301 ms • 147 ms • 18
- meta-llama/llama-3.1-8b… 112 ms
- p95 • avg • N 200 ms • 120 ms • 16
- qwen/qwen3-14b 142 ms
- p95 • avg • N 502 ms • 224 ms • 14
Slowest
- [email protected]/Qw… 8786 ms
- p95 • avg • N 11769 ms • 8881 ms • 6
- [email protected]/Qw… 5815 ms
- p95 • avg • N 7481 ms • 6071 ms • 6
- qwen/qwen3-14b 142 ms
- p95 • avg • N 502 ms • 224 ms • 14
- meta-llama/llama-3.1-8b… 112 ms
- p95 • avg • N 200 ms • 120 ms • 16
- qwen/qwen3-8b 106 ms
- p95 • avg • N 301 ms • 147 ms • 18
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
29880760
Dec. 17, 2025, 12:02 a.m.
54026154
Dec. 16, 2025, 12:02 a.m.
21297665
Dec. 15, 2025, 12:02 a.m.
25285307
Dec. 14, 2025, 12:02 a.m.
22336758
Dec. 13, 2025, 12:02 a.m.
46157000
Dec. 12, 2025, 12:02 a.m.
36776287
Dec. 11, 2025, 12:02 a.m.
26167187
Dec. 10, 2025, 12:02 a.m.
44253141
Dec. 9, 2025, 12:02 a.m.
29681202
Dec. 8, 2025, 12:02 a.m.