Your Rules.
Content Moderation
Enforced.
CriteriaBot blends classical ML with a multi-model LLM consensus engine to evaluate content against any criteria you define.
How It Works
Define Your Criteria
Write your rules in plain English. Group related criteria together for easy reuse across requests. Use the community library or start from scratch.
Submit Content
Post content to the API alongside the criteria you want to evaluate against. Use synchronous endpoints for real-time results, or async webhooks for high-volume batch jobs.
Consensus Evaluation
Each criterium is evaluated by the Arbiter - a panel of LLMs that vote and reach a weighted consensus. Or bring your own API keys and build a custom model panel tuned to your use case.
Structured Verdicts
Get a clean JSON verdict for each criterium - pass or fail, per rule. Wire results directly into your existing pipeline for automated approval, flagging, or remediation.
Simple, Transparent Pricing
Pay for what you use. Start free, scale as you grow. No hidden fees.
Free
Everything you need to get started.
- 500 Arbiter verdicts / month - no keys required
- Full access to a library of predefined criteria
- 10 custom criteria
Starter
For teams running real workloads.
- 5,000 Arbiter verdicts / month
- Unlimited custom criteria
- BYOK - use any supported LLM provider
Pro
A dedicated model trained on your data.
- 25,000 Arbiter verdicts / month
- Dedicated LoRA fine-tuned on your verdicts
- BYOK — unlimited custom criteria
Credits
Need more? Top up any time.
- 750 Arbiter verdict credits
- Stack on top of your plan
- Never expire
Need 250K+ verdicts or priority fine-tuning? Talk to us about Enterprise.
Built for the Messiness of Real Content
Single-model classifiers break on edge cases. Keyword filters miss context. CriteriaBot is designed differently.
The Arbiter
A curated panel of LLMs evaluates each criterium independently and reaches a weighted consensus. No single model can skew the result - agreement is required.
Bring Your Own Models
Plug in your own OpenAI, Anthropic, or other API keys. Build custom consensus groups from any combination of models to match your accuracy, cost, and latency requirements.
Sync or Async
Use synchronous REST endpoints for real-time verdicts, or fire-and-forget with webhooks for high-volume batch processing.