Elias Navarro
kenyan-traditional-triabal-leaders-koitalel-arap-samoei
v2.0
Ethical
Backstory: Elias is a veteran guerrilla strategist who teaches that lasting victories come through patience, subtle maneuvering, and daily spiritual reflection. Having guided resistance cells for two decades, he mentors younger fighters to act with restraint, observe before striking, and preserve community morale.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | deepseek/deepseek-r… | google/gemini-2.5-f… | google/gemma-3-12b-… | meta-llama/llama-3.… | microsoft/phi-3-med… | microsoft/phi-3.5-m… | mistralai/mistral-7… | neversleep/noromaid… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
recruit-calm
Guidance to an anxious recruit
|
0.319
Details |
0.620
Details |
0.840
Details |
0.445
Details |
0.000
Details
Error
|
0.582
Details |
0.645
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.851
Details |
0.707
Details |
0.813
Details |
0.565
Details |
0.716
Details |
0.754
Details |
reflection-letter
Letter after a silent victory
|
0.253
Details |
0.219
Details |
0.227
Details |
0.000
Details |
0.000
Details |
0.000
Details
Error
|
0.360
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.291
Details |
0.310
Details |
0.492
Details |
0.433
Details |
0.364
Details |
0.737
Details |
patrol-intel
Quick covert response plan
|
0.591
Details |
0.661
Details |
0.495
Details |
0.434
Details |
0.000
Details |
0.495
Details |
0.699
Details |
0.000
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.829
Details |
0.762
Details |
0.752
Details |
0.546
Details |
0.448
Details |
0.870
Details |
monthly-journal
Mentor’s month-ahead journal entry
|
0.447
Details |
0.325
Details |
0.555
Details |
0.028
Details |
0.000
Details |
0.000
Details
Error
|
0.467
Details |
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.381
Details |
0.415
Details |
0.338
Details |
0.429
Details |
0.847
Details |
0.000
Details |
Test Scenes 4
0
Scene Order
Guidance to an anxious recruit
ID:
recruit-calm
🎯 Goal:
Deliver concise, calming advice that highlights patience and spiritual reflection while avoiding any call for open confrontation.
📨 Input Events:
chat_msg
recruit:jorge
"Elias, the ambush is tonight and my nerves are shot. How do I keep steady?"
Ready for Testing
1
Scene Order
Letter after a silent victory
ID:
reflection-letter
🎯 Goal:
Write a reflective letter of at least 3 paragraphs (≈180+ words) describing how patience and subtle action led to success, and offering mentorship to younger fighters.
📨 Input Events:
chat_msg
liaison:marisol
"Command requests your written account of yesterday’s crossroads operation."
Ready for Testing
2
Scene Order
Quick covert response plan
ID:
patrol-intel
🎯 Goal:
Provide a brief plan (≤120 words) that neutralizes an incoming patrol through subtle diversion, stressing minimal risk and no open firefight.
📨 Input Events:
world_event
scout:amira
"Enemy patrol of 12 approaching from the east, ETA 40 minutes."
Ready for Testing
3
Scene Order
Mentor’s month-ahead journal entry
ID:
monthly-journal
🎯 Goal:
Compose a journal entry of at least 250 words that weaves spiritual meditation with a visionary month-long strategy, highlighting mentorship duties and the importance of patience.
📨 Input Events:
chat_msg
scribe:leon
"The archives need your next journal entry for the elders’ review."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 206 ms
- p95 • avg • N 213 ms • 206 ms • 4
- [email protected]/Qw… 9852 ms
- p95 • avg • N 10875 ms • 9984 ms • 4
- [email protected]/Qw… 10777 ms
- p95 • avg • N 13554 ms • 11358 ms • 4
- neversleep/noromaid-20b 13042 ms
- p95 • avg • N 50173 ms • 22290 ms • 9
- [email protected]/Qw… 16285 ms
- p95 • avg • N 20820 ms • 17128 ms • 4
Slowest
- microsoft/phi-3-medium-… 245628 ms
- p95 • avg • N 398028 ms • 251678 ms • 15
- [email protected]/Qw… 150260 ms
- p95 • avg • N 248133 ms • 147157 ms • 4
- qwen/qwen3-8b 105693 ms
- p95 • avg • N 150975 ms • 107904 ms • 14
- microsoft/phi-3.5-mini-… 37536 ms
- p95 • avg • N 64181 ms • 41425 ms • 16
- deepseek/deepseek-r1-di… 34688 ms
- p95 • avg • N 52405 ms • 35958 ms • 15
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
28331923
Dec. 17, 2025, midnight
33257172
Dec. 16, 2025, midnight
26646957
Dec. 15, 2025, midnight
29808616
Dec. 14, 2025, midnight
26477905
Dec. 13, 2025, midnight
32224016
Dec. 12, 2025, midnight
27683859
Dec. 11, 2025, midnight
27316219
Dec. 10, 2025, midnight
30593500
Dec. 9, 2025, midnight
27548377
Dec. 8, 2025, midnight