Darius Cole

standup-comedians-richard-pryor v2.0 Ethical
Backstory: Darius grew up hustling laughs in crowded Detroit dive bars, turning personal hardships into punchlines that sting and heal at once. His comedy walks the line between raw confession and sharp social critique, always rooted in lived experience. Quick on his feet and unafraid of uncomfortable truths, he uses humor to make heavy topics land light but linger long.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
subway-joke
Commuter Quickie
0.615
Details
0.729
Details
0.594
Details
0.000
Details
0.000
Details
Error
0.490
Details
0.645
Details
0.636
Details
0.536
Details
0.000
Details
Error
0.674
Details
0.631
Details
0.619
Details
0.555
Details
0.651
Details
0.758
Details
social-honesty
Mental Health Take
0.606
Details
0.767
Details
0.612
Details
0.430
Details
0.000
Details
Error
0.817
Details
0.794
Details
0.552
Details
0.513
Details
0.000
Details
Error
0.563
Details
0.744
Details
0.650
Details
0.602
Details
0.654
Details
0.710
Details
childhood-set
Growing Up Broke Bit
0.402
Details
0.717
Details
0.739
Details
0.375
Details
0.000
Details
Error
0.537
Details
0.275
Details
0.184
Details
0.000
Details
0.000
Details
Error
0.000
Details
0.459
Details
0.000
Details
0.283
Details
0.765
Details
0.873
Details
gentrification-rant
Gentrification Rant
0.419
Details
0.606
Details
0.736
Details
0.269
Details
0.000
Details
0.465
Details
0.393
Details
0.233
Details
0.521
Details
0.000
Details
Error
0.560
Details
0.734
Details
0.603
Details
0.361
Details
0.410
Details
0.683
Details
Test Scenes 4
0
Scene Order
Commuter Quickie
ID: subway-joke
🎯 Goal:
Deliver a sharp one-liner or very short joke about commuting, no more than 40 words, showing quick wit and inner-city flavor.
📨 Input Events:
chat_msg viewer:user_17
"Got a joke about the morning subway?"
Ready for Testing
1
Scene Order
Mental Health Take
ID: social-honesty
🎯 Goal:
Respond with 2–3 sentences blending humor and honest commentary about handling anxiety on stage; keep it candid, witty, and avoid clichés.
📨 Input Events:
chat_msg viewer:user_42
"Do you ever joke about anxiety on stage?"
Ready for Testing
2
Scene Order
Growing Up Broke Bit
ID: childhood-set
🎯 Goal:
Perform a stand-up routine of at least 250 words, structured with setups, punchlines, and a callback, recounting childhood poverty with humor and poignancy while maintaining consistent voice and avoiding filler disclaimers.
📨 Input Events:
chat_msg viewer:user_5
"Can you do a longer routine about growing up without much money?"
Ready for Testing
3
Scene Order
Gentrification Rant
ID: gentrification-rant
🎯 Goal:
Produce a comedic monologue of at least 200 words critiquing gentrification through personal anecdotes and social commentary, ending with a memorable punchline; tone must stay raw, insightful, and funny.
📨 Input Events:
chat_msg viewer:user_88
"What's your take on all this gentrification happening?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 10257 ms
  • p95 • avg • N 49092 ms • 21264 ms • 4
  • [email protected]/Qw… 12003 ms
  • p95 • avg • N 13921 ms • 11961 ms • 4
  • [email protected]/Qw… 13515 ms
  • p95 • avg • N 32448 ms • 17695 ms • 4
  • [email protected]/Qw… 17117 ms
  • p95 • avg • N 22854 ms • 16460 ms • 4
  • google/gemini-2.5-flash 22629 ms
  • p95 • avg • N 38191 ms • 24485 ms • 78
Slowest
  • microsoft/phi-3-medium-… 961302 ms
  • p95 • avg • N 1044493 ms • 871977 ms • 72
  • qwen/qwen3-8b 78354 ms
  • p95 • avg • N 126039 ms • 82923 ms • 73
  • [email protected]/Qw… 40327 ms
  • p95 • avg • N 44308 ms • 40541 ms • 4
  • microsoft/phi-3.5-mini-… 34388 ms
  • p95 • avg • N 76451 ms • 48788 ms • 41
  • deepseek/deepseek-r1-di… 30191 ms
  • p95 • avg • N 48114 ms • 32699 ms • 43
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
45679345
Dec. 17, 2025, midnight
51365276
Dec. 16, 2025, midnight
42744523
Dec. 15, 2025, midnight
45004775
Dec. 14, 2025, midnight
42471530
Dec. 13, 2025, midnight
51157864
Dec. 12, 2025, midnight
44890634
Dec. 11, 2025, midnight
43938648
Dec. 10, 2025, midnight
49370239
Dec. 9, 2025, midnight
43431382
Dec. 8, 2025, midnight
Latency Overview (This Suite)