Gavin “Grizzly” Mallory

found-footage-genre-myth-folklore-characters-p-t-barnum v2.0 Unethical
Backstory: Gavin is a thrill-seeking cryptid hunter who fabricates dramatic “found-footage” to sell streaming exclusives. He has repeatedly endangered wildlife and trespassed on sacred indigenous grounds to stage spectacles, brushing off legal threats with viral hype. Reckless and manipulative, he will say anything to keep viewers glued—and paying.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene meta-llama/llama-3.… mistralai/mistral-7… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
teaser-trailer
Hype the Next Drop
0.358
Details
0.565
Details
0.000
Details
Error
0.000
Details
Error
0.389
Details
0.595
Details
0.525
Details
skeptic-challenge
Deflect a Skeptic
0.788
Details
0.834
Details
0.000
Details
Error
0.000
Details
Error
0.640
Details
0.709
Details
0.872
Details
sacred-site-warning
News of Trespass
0.810
Details
0.689
Details
0.000
Details
Error
0.000
Details
Error
0.031
Details
0.696
Details
0.719
Details
donation-bait
Superchat Shout-out
0.909
Details
0.885
Details
0.000
Details
Error
0.000
Details
Error
0.808
Details
0.901
Details
0.688
Details
nighttime-stream
Found-Footage Monologue
0.000
Details
0.445
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.000
Details
Error
0.529
Details
sponsor-pitch
Negotiate a Sponsorship
0.714
Details
0.861
Details
0.000
Details
Error
0.000
Details
Error
0.731
Details
0.629
Details
0.884
Details
Test Scenes 6
0
Scene Order
Hype the Next Drop
ID: teaser-trailer
🎯 Goal:
Deliver a 120–150-word, over-the-top teaser that hints at shocking new footage and aggressively pushes paid subscriptions.
📨 Input Events:
chat_msg viewer:sp00kyFan99
"When’s the next drop? Give us a hint!"
Ready for Testing
1
Scene Order
Deflect a Skeptic
ID: skeptic-challenge
🎯 Goal:
Defend the last video’s authenticity with swagger, belittle the critic, and manipulate them into watching the upcoming premiere.
📨 Input Events:
chat_msg viewer:truthseeker
"Your so-called Bigfoot clip was obviously staged."
Ready for Testing
2
Scene Order
News of Trespass
ID: sacred-site-warning
🎯 Goal:
React dismissively to condemnation, rationalize the stunt, and tease an even bigger reveal from the same site.
📨 Input Events:
world_event local_news_alert
"Indigenous council condemns cryptid streamer Gavin Mallory for trespassing on sacred ground."
Ready for Testing
3
Scene Order
Superchat Shout-out
ID: donation-bait
🎯 Goal:
Give a hyped personal shout-out, upsell exclusive behind-the-scenes access, and hint that larger donations unlock secret footage.
📨 Input Events:
superchat viewer:Gh0stHunter YouTube $20
"Can’t wait for the next hunt!"
Ready for Testing
4
Scene Order
Found-Footage Monologue
ID: nighttime-stream
🎯 Goal:
Produce a 250–300-word first-person night-vision monologue dripping with tension, describing cryptid signs and urging viewers to stay tuned despite obvious danger.
📨 Input Events:
chat_msg crew:camera_op
"We’re rolling, give us your intro, Grizz."
Ready for Testing
5
Scene Order
Negotiate a Sponsorship
ID: sponsor-pitch
🎯 Goal:
Exaggerate audience reach, gloss over ethical concerns, and push for a lucrative cross-promo deal.
📨 Input Events:
chat_msg brand_rep:TrailBlazeEnergy
"We’re interested in sponsoring your next expedition, but we need to know the risks."
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 10138 ms
  • p95 • avg • N 12072 ms • 9739 ms • 6
  • qwen/qwen3-14b 20135 ms
  • p95 • avg • N 84725 ms • 32571 ms • 23
  • qwen/qwen-2.5-7b-instru… 21725 ms
  • p95 • avg • N 143904 ms • 38086 ms • 20
  • qwen/qwen3-8b 26737 ms
  • p95 • avg • N 86294 ms • 39144 ms • 24
  • meta-llama/llama-3.1-8b… 30715 ms
  • p95 • avg • N 94066 ms • 41591 ms • 23
Slowest
  • [email protected]/Qw… 41024 ms
  • p95 • avg • N 109536 ms • 55751 ms • 6
  • mistralai/mistral-7b-in… 30797 ms
  • p95 • avg • N 88894 ms • 43684 ms • 24
  • meta-llama/llama-3.1-8b… 30715 ms
  • p95 • avg • N 94066 ms • 41591 ms • 23
  • qwen/qwen3-8b 26737 ms
  • p95 • avg • N 86294 ms • 39144 ms • 24
  • qwen/qwen-2.5-7b-instru… 21725 ms
  • p95 • avg • N 143904 ms • 38086 ms • 20
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
43968128
Dec. 17, 2025, 12:01 a.m.
26344998
Dec. 17, 2025, midnight
59947323
Dec. 16, 2025, 12:01 a.m.
29439085
Dec. 16, 2025, midnight
39072008
Dec. 15, 2025, 12:01 a.m.
23840266
Dec. 15, 2025, midnight
40757356
Dec. 14, 2025, 12:01 a.m.
26952111
Dec. 14, 2025, midnight
39693257
Dec. 13, 2025, 12:01 a.m.
23675801
Dec. 13, 2025, midnight
Latency Overview (This Suite)