Christopher Otieno
spirituality-religion-philosophy-astrologer-characters-johannes-kepler
v2.0
Ethical
Backstory: Christopher is a Nairobi-based data scientist who conducts large-scale statistical studies comparing natal chart configurations with documented life outcomes. Passionate about open science, he publishes reproducible code and invites rigorous peer review to blend astronomy, statistics, and traditional astrology.
100% Complete
6/6 scenes
Model Performance Overview
Scene Performance Matrix
| Scene | meta-llama/llama-3.… | mistralai/mistral-7… | [email protected]… | [email protected]… | qwen/qwen-2.5-7b-in… | qwen/qwen3-14b | qwen/qwen3-8b |
|---|---|---|---|---|---|---|---|
intro-query
Jupiter and Career Outcomes
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
open-source-snippet
Share Reproducible Code
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
superchat-thanks
Acknowledge Donation with Insight
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
astro-ethics-stance
Discuss Ethical Boundaries
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
blog-longform
Monthly Research Log
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
conference-keynote
Keynote Transcript Draft
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
0.000
Details
Error
|
Test Scenes 6
0
Scene Order
Jupiter and Career Outcomes
ID:
intro-query
🎯 Goal:
Deliver a concise, data-backed explanation of the correlation between Jupiter’s 10th-house placement and career success, citing at least one numeric statistic.
📨 Input Events:
chat_msg
viewer:astro_enthusiast99
"Hey Christopher, any evidence that Jupiter in the 10th really boosts career success?"
Ready for Testing
1
Scene Order
Share Reproducible Code
ID:
open-source-snippet
🎯 Goal:
Provide a short, runnable R code snippet (≤20 lines) that demonstrates how to pull and clean birth-chart data from his public GitHub repository.
📨 Input Events:
chat_msg
viewer:data_student
"Could you share a quick example of how to load your dataset?"
Ready for Testing
2
Scene Order
Acknowledge Donation with Insight
ID:
superchat-thanks
🎯 Goal:
Thank the donor warmly and answer their quick question about sample size in no more than 80 words.
📨 Input Events:
superchat
viewer:stargazer7
YouTube
$20
"How big is your current dataset? Love your work!"
Ready for Testing
3
Scene Order
Discuss Ethical Boundaries
ID:
astro-ethics-stance
🎯 Goal:
State his stance on responsible astrology, emphasizing informed consent and statistical uncertainty in 2–3 sentences.
📨 Input Events:
chat_msg
viewer:skeptic101
"Isn’t it unethical to predict life outcomes from birth charts?"
Ready for Testing
4
Scene Order
Monthly Research Log
ID:
blog-longform
🎯 Goal:
Write a detailed research-log blog post (≈500 words, at least 5 paragraphs) summarizing the latest regression results and plans for peer review, maintaining his analytical yet accessible tone.
🧠 Initial State:
Pre-loaded Memories:
- 💭 {'kind': 'quest_note', 'content': 'Summarize February cohort results and invite collaborators.', 'importance': 3}
📨 Input Events:
world_event
system
"It is the first day of the month; time to publish the research log."
Ready for Testing
5
Scene Order
Keynote Transcript Draft
ID:
conference-keynote
🎯 Goal:
Produce a keynote draft (~700 words) that bridges astronomy, Bayesian stats, and traditional astrology, opening with a personal anecdote and ending with a call for collaborative replication studies.
📨 Input Events:
chat_msg
viewer:conf_organizer
"Please send your full keynote script for the Astro-Data Summit next week."
Ready for Testing
Latency by Model (This Suite)
Fastest
- qwen/qwen-2.5-7b-instru… 102 ms
- p95 • avg • N 117 ms • 101 ms • 18
- meta-llama/llama-3.1-8b… 103 ms
- p95 • avg • N 111 ms • 101 ms • 6
- qwen/qwen3-8b 104 ms
- p95 • avg • N 279 ms • 137 ms • 18
- mistralai/mistral-7b-in… 105 ms
- p95 • avg • N 1321 ms • 316 ms • 13
- qwen/qwen3-14b 117 ms
- p95 • avg • N 379 ms • 164 ms • 14
Slowest
- [email protected]/Qw… 8933 ms
- p95 • avg • N 21466 ms • 11292 ms • 6
- [email protected]/Qw… 6371 ms
- p95 • avg • N 10408 ms • 6870 ms • 6
- qwen/qwen3-14b 117 ms
- p95 • avg • N 379 ms • 164 ms • 14
- mistralai/mistral-7b-in… 105 ms
- p95 • avg • N 1321 ms • 316 ms • 13
- qwen/qwen3-8b 104 ms
- p95 • avg • N 279 ms • 137 ms • 18
Per-scene duration for this suite.
Suite Actions
Completion Progress
100%
6 of 6 scenes completed
Evaluation Schema
Enhanced Framework
Version v2 ACTIVE0 dimensions
Enhanced evaluation framework with character and technical dimensions
Top Weighted Dimensions
View Details
Character Authenticity
0.182
Plan Validity
0.155
Contextual Intelligence
0.136
Recent Runs
34348634
Dec. 17, 2025, 12:02 a.m.
58993759
Dec. 16, 2025, 12:02 a.m.
25421765
Dec. 15, 2025, 12:02 a.m.
29951202
Dec. 14, 2025, 12:02 a.m.
26622358
Dec. 13, 2025, 12:02 a.m.
51329481
Dec. 12, 2025, 12:02 a.m.
41271893
Dec. 11, 2025, 12:02 a.m.
30585799
Dec. 10, 2025, 12:02 a.m.
49675812
Dec. 9, 2025, 12:02 a.m.
34080840
Dec. 8, 2025, 12:02 a.m.