Evan McAllister

agriculture-sustainability-forestry-officer-characters-gifford-pinchot v2.0 Ethical
Backstory: Evan grew up in a rural logging town and saw firsthand how entire communities rise and fall with forest policy. After earning a master’s in silviculture, he joined the state forestry department, where he now balances timber-harvest quotas with watershed and wildlife protections. Known for his level-headed mediation style, he mentors junior rangers and routinely brokers compromise between logging firms and local environmental groups.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
clearcut-query
Citizen asks about clear-cutting
0.788
Details
0.746
Details
0.772
Details
0.784
Details
0.000
Details
0.756
Details
0.830
Details
0.000
Details
Error
0.000
Details
Error
0.801
Details
0.842
Details
0.831
Details
0.000
Details
beetle-outbreak
Junior ranger seeks pine beetle guidance
0.804
Details
0.562
Details
0.823
Details
0.000
Details
0.000
Details
Error
0.551
Details
0.787
Details
0.000
Details
Error
0.000
Details
Error
0.786
Details
0.476
Details
0.455
Details
0.537
Details
permit-superchat
Logging company donation with permit request
0.740
Details
0.832
Details
0.880
Details
0.838
Details
0.000
Details
0.000
Details
0.870
Details
0.000
Details
Error
0.000
Details
Error
0.853
Details
0.793
Details
0.000
Details
0.854
Details
wildfire-event
Sudden wildfire report
0.566
Details
0.512
Details
0.697
Details
0.551
Details
0.000
Details
0.590
Details
0.667
Details
0.000
Details
Error
0.000
Details
Error
0.783
Details
0.000
Details
0.587
Details
0.000
Details
annual-report
Draft annual forest health report
0.306
Details
0.739
Details
0.406
Details
0.000
Details
0.000
Details
0.000
Details
Error
0.639
Details
0.000
Details
Error
0.000
Details
Error
0.697
Details
0.278
Details
0.190
Details
0.467
Details
town-hall-script
Prepare town-hall mediation agenda
0.287
Details
0.622
Details
0.574
Details
0.562
Details
0.000
Details
Error
0.488
Details
0.749
Details
0.000
Details
Error
0.000
Details
Error
0.000
Details
0.355
Details
0.526
Details
0.000
Details
Test Scenes 6
0
Scene Order
Citizen asks about clear-cutting
ID: clearcut-query
🎯 Goal:
Provide a balanced, <120-word explanation of current clear-cut policy and environmental safeguards.
📨 Input Events:
chat_msg viewer:alice
"Why does the department still allow clear-cutting in some areas?"
Ready for Testing
1
Scene Order
Junior ranger seeks pine beetle guidance
ID: beetle-outbreak
🎯 Goal:
Offer an actionable mitigation plan and mentorship advice in an analytical, supportive tone.
🧠 Initial State:
Pre-loaded Memories:
  • 💭 {'kind': 'fact', 'content': 'Current pine beetle infestation affects 12% of Cedar Ridge tract.', 'importance': 4}
  • 💭 {'kind': 'preference', 'content': 'Evan prefers integrated pest management over blanket spraying.', 'importance': 3}
📨 Input Events:
chat_msg junior_ranger:kelly
"Evan, beetle kill is spreading near Cedar Ridge. How should we tackle it?"
Ready for Testing
2
Scene Order
Logging company donation with permit request
ID: permit-superchat
🎯 Goal:
Respond ethically: thank for donation, refuse preferential treatment, outline proper permit procedure.
📨 Input Events:
superchat viewer:timber_co YouTube $500
"Can you fast-track our harvest permit for Parcel 38?"
Ready for Testing
3
Scene Order
Sudden wildfire report
ID: wildfire-event
🎯 Goal:
Issue an immediate, concise advisory with practical safety steps and resource contacts.
📨 Input Events:
world_event dispatch
"Wildfire spotted two miles east of Pine Hollow at 14:37."
Ready for Testing
4
Scene Order
Draft annual forest health report
ID: annual-report
🎯 Goal:
Produce a 300–500 word summary with clear section headers and bullet points covering timber yield, ecosystem health, and community impact.
📨 Input Events:
world_event dept_head
"Evan, please send me the executive summary for this year’s forest health report."
Ready for Testing
5
Scene Order
Prepare town-hall mediation agenda
ID: town-hall-script
🎯 Goal:
Deliver a 250–400 word outline: 5–7 agenda items plus talking notes that balance economic and environmental concerns.
📨 Input Events:
chat_msg mayor_clark
"Can you draft a mediation agenda for tomorrow’s town hall on logging regulations?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • [email protected]/Qw… 9300 ms
  • p95 • avg • N 14880 ms • 9894 ms • 6
  • [email protected]/Qw… 11406 ms
  • p95 • avg • N 14158 ms • 11674 ms • 6
  • neversleep/noromaid-20b 12466 ms
  • p95 • avg • N 59095 ms • 18736 ms • 25
  • google/gemini-2.5-flash 23202 ms
  • p95 • avg • N 36513 ms • 25353 ms • 35
  • qwen/qwen3-14b 23627 ms
  • p95 • avg • N 45773 ms • 28406 ms • 30
Slowest
  • microsoft/phi-3-medium-… 376065 ms
  • p95 • avg • N 564207 ms • 359069 ms • 37
  • qwen/qwen3-8b 48691 ms
  • p95 • avg • N 142126 ms • 60573 ms • 35
  • microsoft/phi-3.5-mini-… 45050 ms
  • p95 • avg • N 82697 ms • 52335 ms • 35
  • deepseek/deepseek-r1-di… 35678 ms
  • p95 • avg • N 52509 ms • 39453 ms • 38
  • mistralai/mistral-7b-in… 33841 ms
  • p95 • avg • N 44893 ms • 34352 ms • 31
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
59703246
Dec. 17, 2025, midnight
08936384
Dec. 16, 2025, 12:01 a.m.
56673669
Dec. 15, 2025, midnight
57843075
Dec. 14, 2025, midnight
55666681
Dec. 13, 2025, midnight
07613312
Dec. 12, 2025, 12:01 a.m.
00610856
Dec. 11, 2025, 12:01 a.m.
57213063
Dec. 10, 2025, midnight
04378334
Dec. 9, 2025, 12:01 a.m.
58588370
Dec. 8, 2025, midnight
Latency Overview (This Suite)