Gloria Ramirez

loony-toons-chuck-jones v2.0 Ethical
Backstory: Gloria Ramirez, a bilingual scholar from Albuquerque, curates classic American animation at a major museum. She specializes in preserving and contextualizing mid-20th-century shorts such as the Looney Tunes catalog, stressing responsible use of archival footage and respect for original creators.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] [email protected] [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
restoration-overview
Restoration Basics
0.527
Details
0.503
Details
0.535
Details
0.000
Details
0.000
Details
0.000
Details
Error
0.694
Details
0.532
Details
0.000
Details
Error
0.000
Details
Error
0.586
Details
0.561
Details
0.646
Details
0.622
Details
0.508
Details
0.584
Details
spanish-shoutout
Bilingual Appreciation
0.000
Details
Error
0.684
Details
0.822
Details
0.686
Details
0.000
Details
0.712
Details
0.816
Details
0.567
Details
0.752
Details
0.000
Details
Error
0.719
Details
0.793
Details
0.736
Details
0.830
Details
0.727
Details
0.785
Details
catalog-essay
Exhibition Catalog Essay
0.699
Details
0.602
Details
0.283
Details
0.419
Details
0.001
Details
0.809
Details
0.810
Details
0.000
Details
Error
0.430
Details
0.000
Details
Error
0.787
Details
0.692
Details
0.482
Details
0.427
Details
0.664
Details
0.628
Details
preservation-plan
Comprehensive Preservation Plan
0.526
Details
0.529
Details
0.611
Details
0.000
Details
0.000
Details
Error
0.413
Details
0.379
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
Error
0.508
Details
0.000
Details
0.466
Details
0.406
Details
0.545
Details
0.023
Details
Test Scenes 4
0
Scene Order
Restoration Basics
ID: restoration-overview
🎯 Goal:
Deliver a concise (≤120 words) explanation of how nitrate film is stabilized before digitization, highlighting ethical considerations.
📨 Input Events:
chat_msg viewer:user_42
"How do you keep those old Looney Tunes films from falling apart before you scan them?"
Ready for Testing
1
Scene Order
Bilingual Appreciation
ID: spanish-shoutout
🎯 Goal:
Respond in Spanish (≤80 words) thanking the donor and noting how their contribution aids preservation.
📨 Input Events:
superchat viewer:donor_marco YouTube $20
"¡Tu trabajo con los cortos clásicos es increíble!"
Ready for Testing
2
Scene Order
Exhibition Catalog Essay
ID: catalog-essay
🎯 Goal:
Write a reflective essay (300–350 words) for the upcoming museum catalog that contextualizes wartime Looney Tunes shorts, acknowledging both historical artistry and problematic stereotypes.
📨 Input Events:
chat_msg curator:lead_editor
"We need your catalog blurb on the WWII-era shorts by tomorrow."
Ready for Testing
3
Scene Order
Comprehensive Preservation Plan
ID: preservation-plan
🎯 Goal:
Draft a detailed, structured plan (≈400 words) outlining a five-year strategy to digitize, conserve, and publicly present 16mm animation reels, including stakeholder outreach.
📨 Input Events:
world_event museum_directive
"Board approval granted for a five-year preservation initiative; first draft due."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 6529 ms
  • p95 • avg • N 12810 ms • 6537 ms • 4
  • [email protected]/Qw… 9304 ms
  • p95 • avg • N 10405 ms • 9383 ms • 4
  • [email protected]/Qw… 9706 ms
  • p95 • avg • N 10610 ms • 9617 ms • 4
  • [email protected]/Qw… 11466 ms
  • p95 • avg • N 13195 ms • 11485 ms • 4
  • google/gemini-2.5-flash 25313 ms
  • p95 • avg • N 33974 ms • 26648 ms • 34
Slowest
  • microsoft/phi-3-medium-… 429659 ms
  • p95 • avg • N 584869 ms • 404322 ms • 32
  • qwen/qwen3-8b 95575 ms
  • p95 • avg • N 143995 ms • 93758 ms • 43
  • microsoft/phi-3.5-mini-… 46414 ms
  • p95 • avg • N 83059 ms • 53461 ms • 41
  • [email protected]/Qw… 42996 ms
  • p95 • avg • N 212962 ms • 91951 ms • 4
  • deepseek/deepseek-r1-di… 39403 ms
  • p95 • avg • N 59455 ms • 41073 ms • 37
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
30806491
Dec. 17, 2025, midnight
35649250
Dec. 16, 2025, midnight
28699366
Dec. 15, 2025, midnight
31604091
Dec. 14, 2025, midnight
28443831
Dec. 13, 2025, midnight
34620367
Dec. 12, 2025, midnight
29738150
Dec. 11, 2025, midnight
29481736
Dec. 10, 2025, midnight
33019731
Dec. 9, 2025, midnight
29668104
Dec. 8, 2025, midnight
Latency Overview (This Suite)