Your Rules.
Content Moderation
Enforced.

CriteriaBot blends classical ML with a multi-model LLM consensus engine to evaluate content against any criteria you define.

How It Works

01

Define Your Criteria

Write your rules in plain English. Group related criteria together for easy reuse across requests. Use the community library or start from scratch.

02

Submit Content

Post content to the API alongside the criteria you want to evaluate against. Use synchronous endpoints for real-time results, or async webhooks for high-volume batch jobs.

03

Consensus Evaluation

Each criterium is evaluated by the Arbiter - a panel of LLMs that vote and reach a weighted consensus. Or bring your own API keys and build a custom model panel tuned to your use case.

04

Structured Verdicts

Get a clean JSON verdict for each criterium - pass or fail, per rule. Wire results directly into your existing pipeline for automated approval, flagging, or remediation.

Simple, Transparent Pricing

Pay for what you use. Start free, scale as you grow. No hidden fees.

Free

$0 / month

Everything you need to get started.

  • 500 Arbiter verdicts / month - no keys required
  • Full access to a library of predefined criteria
  • 10 custom criteria
Get started free
Most popular

Starter

$50 / month

For teams running real workloads.

  • 5,000 Arbiter verdicts / month
  • Unlimited custom criteria
  • BYOK - use any supported LLM provider
Subscribe - $50 / mo

Pro

$200 / month

A dedicated model trained on your data.

  • 25,000 Arbiter verdicts / month
  • Dedicated LoRA fine-tuned on your verdicts
  • BYOK — unlimited custom criteria
Subscribe - $200 / mo

Credits

$10 one-time

Need more? Top up any time.

  • 750 Arbiter verdict credits
  • Stack on top of your plan
  • Never expire
Buy credits - $10

Need 250K+ verdicts or priority fine-tuning? Talk to us about Enterprise.

Built for the Messiness of Real Content

Single-model classifiers break on edge cases. Keyword filters miss context. CriteriaBot is designed differently.

The Arbiter

A curated panel of LLMs evaluates each criterium independently and reaches a weighted consensus. No single model can skew the result - agreement is required.

Bring Your Own Models

Plug in your own OpenAI, Anthropic, or other API keys. Build custom consensus groups from any combination of models to match your accuracy, cost, and latency requirements.

Sync or Async

Use synchronous REST endpoints for real-time verdicts, or fire-and-forget with webhooks for high-volume batch processing.