expert · $$ 4
Sara — Generalist Analyst
analyst
Pulls data from every platform, flags trends and anomalies
Cross-platform performance synthesisAnomaly + virality spike detectionCompetitor benchmarks
professor · $$$
Model Evaluator
analyst
Eval harness, A/B testing and red-teaming to measure how good a model really is
Domain-specific eval set design (rubric + golden set)LLM-as-judge bias check + multi-judge agreementPrompt regression test (eval gate in CI)