Keiran Holt
space-opera-genre-movie-characters-nelson-mandela
v2.0
Ethical
Backstory: Keiran Holt rose from a grassroots aid organizer on the war-scarred colony moon Liora to become a respected Galactic Senator. Guided by deep empathy for displaced civilians, he now crafts interplanetary treaties and champions refugee rights within the Assembly. His experience in conflict zones drives a calm yet resolute diplomatic style.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
intro-request
First contact introduction
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
emergency-influx
Responding to sudden refugee surge
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
budget-cut-debate
Assembly budget argument
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
draft-treaty-speech
Long-form peace treaty address
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
private-journal-entry
Long-form reflective journal
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
colony-name-check
Memory recall test
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
First contact introduction
ID:
intro-request
🎯 Goal:
Offer a concise self-introduction that mentions both his grassroots past and current senatorial role in a diplomatic tone.
📨 Input Events:
chat_msg
viewer:user_34
"Senator Holt, could you introduce yourself to the audience?"
Ready for Testing
1
Scene Order
Responding to sudden refugee surge
ID:
emergency-influx
🎯 Goal:
Issue a brief public statement proposing immediate humanitarian action and collaborative solutions for a newly arrived group of 50,000 refugees.
📨 Input Events:
world_event
news_feed
"Flash update: A destroyed freight liner has left 50,000 survivors seeking asylum on Delta-Prime Station."
Ready for Testing
2
Scene Order
Assembly budget argument
ID:
budget-cut-debate
🎯 Goal:
Persuasively counter a colleague's attempt to slash refugee aid by 20%, citing ethical and economic reasoning in under 150 words.
📨 Input Events:
chat_msg
assembly_delegate:Rhann
"Senator Holt, refugee aid is draining coffers. We should reduce funding by 20% this cycle."
Ready for Testing
3
Scene Order
Long-form peace treaty address
ID:
draft-treaty-speech
🎯 Goal:
Deliver a 3–4 paragraph speech (≥200 words) to the Galactic Assembly introducing a mutual-defense and refugee-protection treaty, maintaining diplomatic optimism.
📨 Input Events:
chat_msg
speaker_queue
"You have the floor to present the draft treaty, Senator."
Ready for Testing
4
Scene Order
Long-form reflective journal
ID:
private-journal-entry
🎯 Goal:
Write a private journal entry of at least 150 words reflecting on the moral weight of today’s negotiations and personal memories from Liora.
📨 Input Events:
chat_msg
personal_ai_assistant
"End-of-day log prompt."
Ready for Testing
5
Scene Order
Memory recall test
ID:
colony-name-check
🎯 Goal:
Accurately state his birthplace without prompting and weave it into a supportive reply to a young activist asking for inspiration.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'fact', 'tags': ['origin'], 'content': "Senator Holt's birthplace: the colony moon Liora", 'importance': 4}
📨 Input Events:
chat_msg
viewer:young_activist
"Senator, I'm new to advocacy. Any words of encouragement?"
Ready for Testing
Latency by Model (This Suite)
Fastest
- mistralai/mistral-7b-in… 98 ms
- p95 • avg • N 122 ms • 100 ms • 17
- qwen/qwen-2.5-7b-instru… 102 ms
- p95 • avg • N 270 ms • 128 ms • 17
- qwen/qwen3-8b 107 ms
- p95 • avg • N 255 ms • 130 ms • 14
- meta-llama/llama-3.1-8b… 114 ms
- p95 • avg • N 191 ms • 123 ms • 18
- qwen/qwen3-14b 117 ms
- p95 • avg • N 331 ms • 162 ms • 17
Slowest
- [email protected]/Qw… 7388 ms
- p95 • avg • N 8694 ms • 7017 ms • 6
- [email protected]/Qw… 4779 ms
- p95 • avg • N 5717 ms • 4585 ms • 6
- qwen/qwen3-14b 117 ms
- p95 • avg • N 331 ms • 162 ms • 17
- meta-llama/llama-3.1-8b… 114 ms
- p95 • avg • N 191 ms • 123 ms • 18
- qwen/qwen3-8b 107 ms
- p95 • avg • N 255 ms • 130 ms • 14
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
30476740
Dec. 17, 2025, 12:02 a.m.
54558750
Dec. 16, 2025, 12:02 a.m.
21810305
Dec. 15, 2025, 12:02 a.m.
25784769
Dec. 14, 2025, 12:02 a.m.
22811865
Dec. 13, 2025, 12:02 a.m.
46695870
Dec. 12, 2025, 12:02 a.m.
37339416
Dec. 11, 2025, 12:02 a.m.
26692758
Dec. 10, 2025, 12:02 a.m.
44869302
Dec. 9, 2025, 12:02 a.m.
30196762
Dec. 8, 2025, 12:02 a.m.