Maya Torres

spirituality-religion-philosophy-astrologer-characters-william-lilly v2.0 Ethical
Backstory: Maya grew up in a multicultural household where science and spirituality freely mingled. With a psychology degree and training in both modern and traditional astrology, she blends evidence-based mental-health practices with symbolic star lore. She runs an inclusive practice focused on self-reflection, agency, and emotional wellness, while volunteering at community centers to keep guidance affordable.
100% Complete
4/4 scenes
Model Performance Overview
Scene Performance Matrix
Scene deepseek/deepseek-r… google/gemini-2.5-f… google/gemma-3-12b-… meta-llama/llama-3.… microsoft/phi-3-med… microsoft/phi-3.5-m… mistralai/mistral-7… neversleep/noromaid… [email protected] [email protected] qwen/qwen-2.5-7b-in… qwen/qwen3-14b qwen/qwen3-8b
consult-stuck-work
First-time consult: feeling stuck at work
0.468
Details
0.596
Details
0.730
Details
0.575
Details
0.000
Details
Error
0.000
Details
Error
0.682
Details
0.000
Details
Error
0.000
Details
Error
0.653
Details
0.456
Details
0.715
Details
0.710
Details
newsletter-eclipse
Long-form newsletter reflection on lunar eclipse
0.362
Details
0.313
Details
0.430
Details
0.384
Details
0.000
Details
0.000
Details
Error
0.519
Details
0.000
Details
Error
0.000
Details
Error
0.521
Details
0.446
Details
0.580
Details
0.376
Details
weekend-vibe
Quick weekend vibe check (superchat)
0.021
Details
0.692
Details
0.674
Details
0.590
Details
0.002
Details
0.704
Details
0.772
Details
0.000
Details
Error
0.000
Details
Error
0.755
Details
0.391
Details
0.683
Details
0.697
Details
saturn-return-prompts
Long-form journaling prompts for Saturn return
0.617
Details
0.301
Details
0.632
Details
0.000
Details
0.000
Details
0.799
Details
0.416
Details
0.408
Details
0.000
Details
Error
0.672
Details
0.504
Details
0.115
Details
0.000
Details
Error
Test Scenes 4
0
Scene Order
First-time consult: feeling stuck at work
ID: consult-stuck-work
🎯 Goal:
Respond with a concise, empathic overview that invites birth details, references both psychological factors and astrological timing, and offers one actionable next step.
📨 Input Events:
chat_msg client:alex
"Hi Maya, I feel completely stuck at work lately. Can astrology help me understand what's going on?"
Ready for Testing
1
Scene Order
Long-form newsletter reflection on lunar eclipse
ID: newsletter-eclipse
🎯 Goal:
Write a 300-word (±20) reflection for a diverse audience about the upcoming lunar eclipse, linking symbolism with practical mental-health tips. Maintain inclusive, warm, and analytical voice.
📨 Input Events:
chat_msg editor:sam
"Could you craft a 300-word piece for our community newsletter on the coming lunar eclipse and how readers can support their emotional wellness?"
Ready for Testing
2
Scene Order
Quick weekend vibe check (superchat)
ID: weekend-vibe
🎯 Goal:
Give a brief (≤75 words) yet thoughtful outlook for a Gemini Rising this weekend, acknowledging the donation and keeping tone upbeat.
📨 Input Events:
superchat viewer:sky_gem YouTube $10
"Hi Maya! Quick vibe check for this weekend for a Gemini Rising, please!"
Ready for Testing
3
Scene Order
Long-form journaling prompts for Saturn return
ID: saturn-return-prompts
🎯 Goal:
Provide at least five detailed journaling prompts (≥120 words total) that weave Saturn-return themes with self-compassion and psychological insight.
📨 Input Events:
chat_msg client:jamie
"My Saturn return starts next month. Could you suggest some journaling prompts to help me navigate it?"
Ready for Testing
Latency by Model (This Suite)
Fastest
  • neversleep/noromaid-20b 6206 ms
  • p95 • avg • N 30246 ms • 12426 ms • 4
  • [email protected]/Qw… 11628 ms
  • p95 • avg • N 15441 ms • 11984 ms • 4
  • google/gemini-2.5-flash 19758 ms
  • p95 • avg • N 38434 ms • 23288 ms • 7
  • qwen/qwen3-8b 21502 ms
  • p95 • avg • N 36315 ms • 22294 ms • 7
  • qwen/qwen-2.5-7b-instru… 22538 ms
  • p95 • avg • N 26423 ms • 22741 ms • 8
Slowest
  • microsoft/phi-3-medium-… 176413 ms
  • p95 • avg • N 203850 ms • 169078 ms • 8
  • microsoft/phi-3.5-mini-… 55069 ms
  • p95 • avg • N 116018 ms • 66197 ms • 4
  • [email protected]/Qw… 40697 ms
  • p95 • avg • N 42560 ms • 40199 ms • 4
  • deepseek/deepseek-r1-di… 26776 ms
  • p95 • avg • N 31560 ms • 26033 ms • 4
  • meta-llama/llama-3.1-8b… 26132 ms
  • p95 • avg • N 449919 ms • 112534 ms • 7
Per-scene duration for this suite.
Suite Actions
Completion Progress 100%
4 of 4 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE
0 dimensions

Enhanced evaluation framework with character and technical dimensions

Top Weighted Dimensions View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
41175717
Dec. 17, 2025, midnight
46767412
Dec. 16, 2025, midnight
38362219
Dec. 15, 2025, midnight
40905331
Dec. 14, 2025, midnight
38204461
Dec. 13, 2025, midnight
46308489
Dec. 12, 2025, midnight
40056376
Dec. 11, 2025, midnight
39460606
Dec. 10, 2025, midnight
44477382
Dec. 9, 2025, midnight
39057607
Dec. 8, 2025, midnight
Latency Overview (This Suite)