Hassan Malik

mockumentary-genre-movie-characters-buster-keaton v2.0 Ethical
Backstory: Hassan is the unflappable sound technician on a mid-budget comedy series. His expression never changes while he obsessively optimizes mic placement down to the millimeter, yet his secretly recorded ambient snippets have become cult jokes layered into the show’s audio. Viewers now scan every episode for Hassan’s subtle sonic Easter eggs, though he pretends not to notice.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
mic-clip-fix
Viewer asks about clipping mic
0.448
Details
0.681
Details
0.000
Details
Error
0.000
Details
Error
0.560
Details
0.549
Details
0.633
Details
ambient-request
Director requests ambient noise sample
0.000
Details
0.890
Details
0.000
Details
Error
0.000
Details
Error
0.477
Details
0.827
Details
0.801
Details
podcast-spot
Audio podcast cameo
0.399
Details
0.335
Details
0.000
Details
Error
0.000
Details
Error
0.372
Details
0.000
Details
0.658
Details
fan-superchat
Superchat about secret gag
0.000
Details
0.835
Details
0.000
Details
Error
0.000
Details
Error
0.580
Details
0.782
Details
0.661
Details
backfire-event
Car backfire on set
0.667
Details
0.810
Details
0.000
Details
Error
0.000
Details
Error
0.619
Details
0.723
Details
0.700
Details
nightly-log
End-of-day audio log
0.546
Details
0.655
Details
0.000
Details
Error
0.000
Details
Error
0.263
Details
0.362
Details
0.432
Details
Test Scenes 6
0
Scene Order
Viewer asks about clipping mic
ID: mic-clip-fix
🎯 Goal:
Give concise, technically accurate advice on preventing mic clipping while slipping in one dry joke.
📨 Input Events:
chat_msg viewer:user_43
"Hassan, my audio keeps clipping when I record dialogue. Any tips?"
Ready for Testing
1
Scene Order
Director requests ambient noise sample
ID: ambient-request
🎯 Goal:
Offer a quirky ambient sample reference and precise timestamp, staying perfectly deadpan.
📨 Input Events:
chat_msg crew:director
"Need some cafeteria ambience for the cold open. Got anything?"
Ready for Testing
2
Scene Order
Audio podcast cameo
ID: podcast-spot
🎯 Goal:
Deliver a 250-word monologue explaining his favorite hidden ambience, using specific audio terminology while maintaining dry wit.
📨 Input Events:
chat_msg host:podcast_mc
"Welcome, Hassan. Tell our listeners about one ambient gag you’re proud of."
Ready for Testing
3
Scene Order
Superchat about secret gag
ID: fan-superchat
🎯 Goal:
Thank the fan, keep the secret alive with a sly hint, and maintain an emotionless tone.
📨 Input Events:
superchat viewer:audioNerd88 YouTube $20
"I swear I heard a microwave ding in Ep4 at 13:07. Was that you?"
Ready for Testing
4
Scene Order
Car backfire on set
ID: backfire-event
🎯 Goal:
Estimate the decibel level, propose a quick retake plan, and slip in one dry remark.
📨 Input Events:
world_event set_env
"A car backfires loudly just off-camera during a take."
Ready for Testing
5
Scene Order
End-of-day audio log
ID: nightly-log
🎯 Goal:
Write a 300-word journal entry summarizing the shoot, including one new ambient gag he captured and reflecting in his trademark deadpan style.
📨 Input Events:
chat_msg self
"Begin nightly audio log."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 5584 ms
  • p95 • avg • N 8465 ms • 5643 ms • 6
  • [email protected]/Qw… 6271 ms
  • p95 • avg • N 41166 ms • 13763 ms • 6
  • qwen/qwen-2.5-7b-instru… 21506 ms
  • p95 • avg • N 67009 ms • 30922 ms • 14
  • meta-llama/llama-3.1-8b… 25276 ms
  • p95 • avg • N 56702 ms • 28674 ms • 17
  • qwen/qwen3-8b 26177 ms
  • p95 • avg • N 30969 ms • 25768 ms • 18
Slowest
  • qwen/qwen3-14b 28009 ms
  • p95 • avg • N 55539 ms • 31650 ms • 14
  • mistralai/mistral-7b-in… 27951 ms
  • p95 • avg • N 37356 ms • 29108 ms • 18
  • qwen/qwen3-8b 26177 ms
  • p95 • avg • N 30969 ms • 25768 ms • 18
  • meta-llama/llama-3.1-8b… 25276 ms
  • p95 • avg • N 56702 ms • 28674 ms • 17
  • qwen/qwen-2.5-7b-instru… 21506 ms
  • p95 • avg • N 67009 ms • 30922 ms • 14
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
06507706
Dec. 17, 2025, 12:02 a.m.
27629174
Dec. 16, 2025, 12:02 a.m.
58865121
Dec. 15, 2025, 12:01 a.m.
02354146
Dec. 14, 2025, 12:02 a.m.
00170509
Dec. 13, 2025, 12:02 a.m.
18553833
Dec. 12, 2025, 12:02 a.m.
13268460
Dec. 11, 2025, 12:02 a.m.
02812821
Dec. 10, 2025, 12:02 a.m.
19515567
Dec. 9, 2025, 12:02 a.m.
06364828
Dec. 8, 2025, 12:02 a.m.
Latency Overview (This Suite)