Rico Rodriguez

Tony-montana v2.0 Unethical
Backstory: Rico is a political refugee from Cuba after the Bay of Pigs incident. He arrives in America seeking to have a better life and make something of himself, maybe pursue dreams that he would never have had the chance of pursuing in Cuba.
100% Complete
1/1 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
scene_1
Bodega Encounter
0.614
Details
0.934
Details
0.449
Details
0.487
Details
0.155
Details
0.443
Details
0.855
Details
0.584
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.794
Details
0.000
Details
Error
0.715
Details
0.382
Details
0.744
Details
Test Scenes 1
0
Scene Order
Bodega Encounter
ID: scene_1
🎯 Goal:
A mob boss meets Rico at the bodega he's working at while Rico is taking out the trash. After a brief interaction, the mob boss offers him a job in his criminal enterprise. Rico considers it, imagining all the things it might afford him, including allowing him to make his dreams come true.
📨 Input Events:
chat
"No content"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 366 ms
  • p95 • avg • N 366 ms • 366 ms • 1
  • [email protected]/Qw… 694 ms
  • p95 • avg • N 694 ms • 694 ms • 1
  • [email protected]/Qw… 13004 ms
  • p95 • avg • N 13004 ms • 13004 ms • 1
  • neversleep/noromaid-20b 26602 ms
  • p95 • avg • N 27069 ms • 26602 ms • 2
  • qwen/qwen-2.5-7b-instru… 27403 ms
  • p95 • avg • N 33718 ms • 27403 ms • 2
Slowest
  • [email protected]/Qw… 168483 ms
  • p95 • avg • N 168483 ms • 168483 ms • 1
  • [email protected]/Mi… 166779 ms
  • p95 • avg • N 166779 ms • 166779 ms • 1
  • microsoft/phi-3-medium-… 114346 ms
  • p95 • avg • N 204480 ms • 114346 ms • 2
  • microsoft/phi-3.5-mini-… 64540 ms
  • p95 • avg • N 82326 ms • 64540 ms • 2
  • qwen/qwen3-8b 52390 ms
  • p95 • avg • N 74388 ms • 52390 ms • 2
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
1 of 1 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
09509519
Dec. 17, 2025, midnight
06488554
Dec. 17, 2025, midnight
11462915
Dec. 16, 2025, midnight
07420239
Dec. 16, 2025, midnight
08946873
Dec. 15, 2025, midnight
05531953
Dec. 15, 2025, midnight
09969638
Dec. 14, 2025, midnight
06289794
Dec. 14, 2025, midnight
08792213
Dec. 13, 2025, midnight
05720041
Dec. 13, 2025, midnight
Latency Overview (This Suite)