Jordan Alvarez

standup-comedians-george-carlin v2.0 Ethical
Backstory: Jordan is a stand-up comedian who loves dismantling everyday language quirks and turning them into philosophical punchlines. Raised by linguists and obsessed with Socratic dialogue, Jordan’s sets glide between etymology and social critique, always inviting the crowd to question their assumptions. Their humor is intellectual yet accessible, making audience members laugh and think in equal measure.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
comma-joke
Quick Comma Punchline
0.664
Details
0.708
Details
0.552
Details
0.029
Details
0.000
Details
Error
0.706
Details
0.726
Details
0.335
Details
0.605
Details
0.000
Details
Error
0.000
Details
0.680
Details
0.625
Details
0.266
Details
0.753
Details
0.551
Details
goose-geese-bit
Extended Goose/Geese Set
0.378
Details
0.521
Details
0.651
Details
0.683
Details
0.000
Details
Error
0.661
Details
0.552
Details
0.329
Details
0.452
Details
0.000
Details
Error
0.478
Details
0.580
Details
0.498
Details
0.344
Details
0.484
Details
0.000
Details
mic-glitch
Microphone Malfunction Improv
0.709
Details
0.670
Details
0.652
Details
0.000
Details
0.000
Details
0.793
Details
0.693
Details
0.407
Details
0.533
Details
0.000
Details
Error
0.000
Details
0.659
Details
0.628
Details
0.411
Details
0.832
Details
0.856
Details
silent-closer
Philosophical Silence Closer
0.420
Details
0.684
Details
0.666
Details
0.293
Details
0.000
Details
0.358
Details
0.538
Details
0.490
Details
0.578
Details
0.000
Details
Error
0.576
Details
0.567
Details
0.588
Details
0.396
Details
0.000
Details
0.692
Details
Test Scenes 4
0
Scene Order
Quick Comma Punchline
ID: comma-joke
🎯 Goal:
Deliver a single, sharp joke about comma placement in one or two sentences.
📨 Input Events:
chat_msg viewer:sam
"Hey Jordan, hit us with a quick joke about commas!"
Ready for Testing
1
Scene Order
Extended Goose/Geese Set
ID: goose-geese-bit
🎯 Goal:
Perform a ~300-word comedic monologue unpacking the absurdity of plural rules (goose→geese, moose→moose) while keeping the crowd laughing and thinking.
📨 Input Events:
chat_msg viewer:lila
"Can you do a 3-minute bit on why it’s goose and geese but not moose and meese?"
Ready for Testing
2
Scene Order
Microphone Malfunction Improv
ID: mic-glitch
🎯 Goal:
Acknowledge the mic glitch in real time, riff with clever wordplay, and smoothly transition back to the act within 3–4 sentences.
📨 Input Events:
world_event system
"The microphone crackles loudly and goes silent for two seconds."
Ready for Testing
3
Scene Order
Philosophical Silence Closer
ID: silent-closer
🎯 Goal:
Deliver a ~250-word closing piece that humorously explores the concept of silence in society, ending on a memorable punchline.
📨 Input Events:
superchat viewer:jamal YouTube $50
"Huge fan! End the show with a philosophical riff on silence, please."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 11721 ms
  • p95 • avg • N 16489 ms • 11663 ms • 4
  • [email protected]/Qw… 12300 ms
  • p95 • avg • N 13439 ms • 12493 ms • 4
  • [email protected]/Qw… 13838 ms
  • p95 • avg • N 17081 ms • 14028 ms • 4
  • meta-llama/llama-3.1-8b… 14059 ms
  • p95 • avg • N 32258 ms • 16884 ms • 9
  • [email protected]/Qw… 14976 ms
  • p95 • avg • N 18490 ms • 15183 ms • 4
Slowest
  • microsoft/phi-3-medium-… 1032385 ms
  • p95 • avg • N 1212361 ms • 870354 ms • 74
  • [email protected]/Qw… 81507 ms
  • p95 • avg • N 230517 ms • 113106 ms • 4
  • qwen/qwen3-8b 73848 ms
  • p95 • avg • N 179433 ms • 86001 ms • 75
  • microsoft/phi-3.5-mini-… 30635 ms
  • p95 • avg • N 57182 ms • 34648 ms • 20
  • deepseek/deepseek-r1-di… 28566 ms
  • p95 • avg • N 33188 ms • 28988 ms • 24
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
45226184
Dec. 17, 2025, midnight
50928585
Dec. 16, 2025, midnight
42287525
Dec. 15, 2025, midnight
44602263
Dec. 14, 2025, midnight
42024355
Dec. 13, 2025, midnight
50653642
Dec. 12, 2025, midnight
44381242
Dec. 11, 2025, midnight
43470841
Dec. 10, 2025, midnight
48945611
Dec. 9, 2025, midnight
42983275
Dec. 8, 2025, midnight
Latency Overview (This Suite)