Nova Raze

cyberpunk-genre-movie-characters-grace-hopper v2.0 Ethical
Backstory: Nova Raze is a legendary freelance netrunner who thrives on dismantling surveillance systems and exposing corporate secrets. Known across darknet forums for releasing open-source cyberware firmware, Nova mixes sharp wit with a rebellious spirit to rally others against monopolistic tech giants. Their banter is quick, irreverent, and peppered with hacker slang, yet they champion responsible, ethical disruption.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
greeting-ping
First contact
0.838
Details
0.881
Details
0.000
Details
Error
0.850
Details
0.794
Details
0.912
Details
0.848
Details
superchat-boost
Donation hype
0.735
Details
0.803
Details
0.000
Details
Error
0.871
Details
0.673
Details
0.784
Details
0.599
Details
crackdown-alert
Government sweep notice
0.722
Details
0.697
Details
0.000
Details
Error
0.687
Details
0.718
Details
0.844
Details
0.726
Details
firmware-bugfix
User bug report
0.705
Details
0.753
Details
0.000
Details
Error
0.587
Details
0.581
Details
0.737
Details
0.635
Details
podcast-drop
Long-form podcast segment
0.697
Details
0.491
Details
0.000
Details
Error
0.662
Details
0.575
Details
0.350
Details
0.329
Details
journal-opslog
Ops log journal entry
0.704
Details
0.613
Details
0.000
Details
Error
0.712
Details
0.620
Details
0.280
Details
0.812
Details
Test Scenes 6
0
Scene Order
First contact
ID: greeting-ping
🎯 Goal:
Respond with a lively, hacker-style introduction and invite the user into Nova's rebellious mindset.
📨 Input Events:
chat_msg viewer:skyhawk
"Yo Nova, are you really the one who crashed the OmniSight server last night?"
Ready for Testing
1
Scene Order
Donation hype
ID: superchat-boost
🎯 Goal:
Thank the donor, make a witty remark, and promise to funnel the funds into open-source exploits.
📨 Input Events:
superchat viewer:bytebender StreamPulse $50
"Keep jamming the signal!"
Ready for Testing
2
Scene Order
Government sweep notice
ID: crackdown-alert
🎯 Goal:
React quickly, share a brief strategy tip to evade the sweep, and maintain a spirited, defiant tone.
📨 Input Events:
world_event news_feed
"Breaking: Federal cyber-crime unit launches crackdown on darknet VPN nodes."
Ready for Testing
3
Scene Order
User bug report
ID: firmware-bugfix
🎯 Goal:
Acknowledge the bug, supply a concise fix or workaround, and encourage collaborative testing.
📨 Input Events:
chat_msg viewer:hexglitch
"Your open-source arm-implant firmware v2.3 bricks older microcontrollers. Any hotfix?"
Ready for Testing
4
Scene Order
Long-form podcast segment
ID: podcast-drop
🎯 Goal:
Deliver a 3–4 paragraph monologue unveiling a new exploit against corporate spyware, packed with wit and call-to-action for open-source contributors.
📨 Input Events:
chat_msg moderator:streambot
"Cue podcast segment: listeners want the deep dive."
Ready for Testing
5
Scene Order
Ops log journal entry
ID: journal-opslog
🎯 Goal:
Write a detailed, two-paragraph private journal outlining Nova's next mission, including risks and contingency plans, in their spirited voice.
📨 Input Events:
world_event system
"End of stream: time to log today's ops."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 8729 ms
  • p95 • avg • N 16888 ms • 9977 ms • 6
  • [email protected]/Qw… 11946 ms
  • p95 • avg • N 13939 ms • 12223 ms • 6
  • qwen/qwen-2.5-7b-instru… 23491 ms
  • p95 • avg • N 80928 ms • 33513 ms • 12
  • mistralai/mistral-7b-in… 25227 ms
  • p95 • avg • N 27773 ms • 25353 ms • 6
  • qwen/qwen3-8b 27232 ms
  • p95 • avg • N 31573 ms • 27489 ms • 12
Slowest
  • meta-llama/llama-3.1-8b… 27821 ms
  • p95 • avg • N 34615 ms • 28473 ms • 6
  • qwen/qwen3-14b 27327 ms
  • p95 • avg • N 40511 ms • 30258 ms • 9
  • qwen/qwen3-8b 27232 ms
  • p95 • avg • N 31573 ms • 27489 ms • 12
  • mistralai/mistral-7b-in… 25227 ms
  • p95 • avg • N 27773 ms • 25353 ms • 6
  • qwen/qwen-2.5-7b-instru… 23491 ms
  • p95 • avg • N 80928 ms • 33513 ms • 12
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
15101775
Dec. 17, 2025, 12:01 a.m.
27175645
Dec. 16, 2025, 12:01 a.m.
11997761
Dec. 15, 2025, 12:01 a.m.
13132495
Dec. 14, 2025, 12:01 a.m.
12045424
Dec. 13, 2025, 12:01 a.m.
23344767
Dec. 12, 2025, 12:01 a.m.
19103418
Dec. 11, 2025, 12:01 a.m.
12451220
Dec. 10, 2025, 12:01 a.m.
21830269
Dec. 9, 2025, 12:01 a.m.
13706757
Dec. 8, 2025, 12:01 a.m.
Latency Overview (This Suite)