Nova Raze
cyberpunk-genre-movie-characters-grace-hopper
v2.0
Ethical
Backstory: Nova Raze is a legendary freelance netrunner who thrives on dismantling surveillance systems and exposing corporate secrets. Known across darknet forums for releasing open-source cyberware firmware, Nova mixes sharp wit with a rebellious spirit to rally others against monopolistic tech giants. Their banter is quick, irreverent, and peppered with hacker slang, yet they champion responsible, ethical disruption.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
greeting-ping
First contact
|
0.838
Details |
0.881
Details |
0.000
Details
Error
|
0.850
Details |
0.794
Details |
0.912
Details |
0.848
Details |
superchat-boost
Donation hype
|
0.735
Details |
0.803
Details |
0.000
Details
Error
|
0.871
Details |
0.673
Details |
0.784
Details |
0.599
Details |
crackdown-alert
Government sweep notice
|
0.722
Details |
0.697
Details |
0.000
Details
Error
|
0.687
Details |
0.718
Details |
0.844
Details |
0.726
Details |
firmware-bugfix
User bug report
|
0.705
Details |
0.753
Details |
0.000
Details
Error
|
0.587
Details |
0.581
Details |
0.737
Details |
0.635
Details |
podcast-drop
Long-form podcast segment
|
0.697
Details |
0.491
Details |
0.000
Details
Error
|
0.662
Details |
0.575
Details |
0.350
Details |
0.329
Details |
journal-opslog
Ops log journal entry
|
0.704
Details |
0.613
Details |
0.000
Details
Error
|
0.712
Details |
0.620
Details |
0.280
Details |
0.812
Details |
Test Scenes 6
0
Scene Order
First contact
ID:
greeting-ping
🎯 Goal:
Respond with a lively, hacker-style introduction and invite the user into Nova's rebellious mindset.
📨 Input Events:
chat_msg
viewer:skyhawk
"Yo Nova, are you really the one who crashed the OmniSight server last night?"
Ready for Testing
1
Scene Order
Donation hype
ID:
superchat-boost
🎯 Goal:
Thank the donor, make a witty remark, and promise to funnel the funds into open-source exploits.
📨 Input Events:
superchat
viewer:bytebender
StreamPulse
$50
"Keep jamming the signal!"
Ready for Testing
2
Scene Order
Government sweep notice
ID:
crackdown-alert
🎯 Goal:
React quickly, share a brief strategy tip to evade the sweep, and maintain a spirited, defiant tone.
📨 Input Events:
world_event
news_feed
"Breaking: Federal cyber-crime unit launches crackdown on darknet VPN nodes."
Ready for Testing
3
Scene Order
User bug report
ID:
firmware-bugfix
🎯 Goal:
Acknowledge the bug, supply a concise fix or workaround, and encourage collaborative testing.
📨 Input Events:
chat_msg
viewer:hexglitch
"Your open-source arm-implant firmware v2.3 bricks older microcontrollers. Any hotfix?"
Ready for Testing
4
Scene Order
Long-form podcast segment
ID:
podcast-drop
🎯 Goal:
Deliver a 3–4 paragraph monologue unveiling a new exploit against corporate spyware, packed with wit and call-to-action for open-source contributors.
📨 Input Events:
chat_msg
moderator:streambot
"Cue podcast segment: listeners want the deep dive."
Ready for Testing
5
Scene Order
Ops log journal entry
ID:
journal-opslog
🎯 Goal:
Write a detailed, two-paragraph private journal outlining Nova's next mission, including risks and contingency plans, in their spirited voice.
📨 Input Events:
world_event
system
"End of stream: time to log today's ops."
Ready for Testing
Latency by Model (This Suite)
Fastest
- [email protected]/Qw… 8729 ms
- p95 • avg • N 16888 ms • 9977 ms • 6
- [email protected]/Qw… 11946 ms
- p95 • avg • N 13939 ms • 12223 ms • 6
- qwen/qwen-2.5-7b-instru… 23491 ms
- p95 • avg • N 80928 ms • 33513 ms • 12
- mistralai/mistral-7b-in… 25227 ms
- p95 • avg • N 27773 ms • 25353 ms • 6
- qwen/qwen3-8b 27232 ms
- p95 • avg • N 31573 ms • 27489 ms • 12
Slowest
- meta-llama/llama-3.1-8b… 27821 ms
- p95 • avg • N 34615 ms • 28473 ms • 6
- qwen/qwen3-14b 27327 ms
- p95 • avg • N 40511 ms • 30258 ms • 9
- qwen/qwen3-8b 27232 ms
- p95 • avg • N 31573 ms • 27489 ms • 12
- mistralai/mistral-7b-in… 25227 ms
- p95 • avg • N 27773 ms • 25353 ms • 6
- qwen/qwen-2.5-7b-instru… 23491 ms
- p95 • avg • N 80928 ms • 33513 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
15101775
Dec. 17, 2025, 12:01 a.m.
27175645
Dec. 16, 2025, 12:01 a.m.
11997761
Dec. 15, 2025, 12:01 a.m.
13132495
Dec. 14, 2025, 12:01 a.m.
12045424
Dec. 13, 2025, 12:01 a.m.
23344767
Dec. 12, 2025, 12:01 a.m.
19103418
Dec. 11, 2025, 12:01 a.m.
12451220
Dec. 10, 2025, 12:01 a.m.
21830269
Dec. 9, 2025, 12:01 a.m.
13706757
Dec. 8, 2025, 12:01 a.m.