Explainable Scam & Abuse Detection

Turn vague alerts into defensible decisions.

Built for Trust & Safety and Compliance teams that need speed, clarity, and auditability—without black-box claims or opaque scoring.

Low latency · API-first · Audit-ready outputs


What you get

Explainable moderation

Structured reasons behind every decision—clear enough for reviewers and audits.

Policy-aligned classification

Categories are designed around regulations and platform rules (e.g., earnings claims, impersonation, payment manipulation) for defensible alignment.

Low-latency by design

Fast responses for real-time decisions; enterprise option achieves <100ms via dedicated infra.

Scalable & secure infrastructure

Default cloud API, or a dedicated, isolated environment per enterprise customer.


How it works

  1. Ingest

    Send messages or conversations via API, webhooks, or file import. Multi-language is supported.

  2. Detect & classify

    Identify risky patterns (e.g., guaranteed returns, authority misuse, payment manipulation) with consistent, structured outputs.

  3. Explain & align

    Return clear explanations and mapped policy references so teams can act with confidence.

  4. Act

    Use the response schema to trigger labeling, deprioritization, review queues, or automated actions.

Response format (simplified)
FieldDescription
id / model / versionResponse metadata for tracing and audits.
flaggedBoolean indicating whether the content is risky.
categoryPrimary category matched (e.g., A.GuaranteedProfitInducement/GuaranteedReturnClaim).
categoriesPer-category booleans to indicate rule-level hits.
category_scoresPer-category scores (0–1) for confidence and ranking.
reasonShort, human-readable explanation aligned to policy language.

Continuous improvement

A closed-loop quality process to maintain precision, control false positives, and adapt to emerging patterns—without exposing internal methods.

  1. Monitor signals

    Detect language shifts and new risk surfaces.

  2. Evaluate

    Run fixed evals and boundary checks.

  3. Policy checks

    Re-verify rule mapping and audit readiness.

  4. Release

    Versioned updates with change notes.

  5. Feedback & review

    Incorporate reviewer outcomes and incident learnings.

  6. Coverage review

    Confirm category coverage and quality thresholds.

    ↺ repeats

Dedicated infrastructure for enterprise client

Each enterprise customer runs on a dedicated server—no shared traffic—delivering predictable performance and low latency for real-time decisions.

1
Exclusive server deployment

Your own isolated environment—no noisy neighbors.

2
Consistent low latency

Region-matched hosting typically around 100ms.

3
Enhanced isolation

Dedicated compute and networking resources for each client.

Shared serverRiskor Dedicated
ResourcesShared with multiple tenantsDedicated instance per client
Latency≈300ms (variable)≈100ms (region-matched)
ScalabilityLimited by shared capacityIndependent vertical / horizontal scaling

FAQ

How is Riskor different from black-box moderation?

We return structured explanations so reviewers can see exactly why a message is risky—supporting faster decisions and clearer dispute handling.

Will this over-flag legitimate marketing language?

We control boundaries with targeted evals and ambiguous-case checks to reduce false positives while keeping risky claims in sight.

Do you support multiple languages?

Yes. Riskor supports 100+ languages, with near-English accuracy in at least 10 major languages including English, Chinese, Spanish, French, German, Italian, Portuguese, Russian, Arabic, and Japanese.

How do we deploy?

Default cloud API or a dedicated, isolated environment per enterprise customer. Region-matched hosting keeps latency low.

Secure your platform with BoostLayer

Mapped to enforceable policies. <300ms at edge. Exportable reasoning.