Explainable Scam & Abuse Detection
Built for Trust & Safety and Compliance teams that need speed, clarity, and auditability—without black-box claims or opaque scoring.
Low latency · API-first · Audit-ready outputs
Structured reasons behind every decision—clear enough for reviewers and audits.
Categories are designed around regulations and platform rules (e.g., earnings claims, impersonation, payment manipulation) for defensible alignment.
Fast responses for real-time decisions; enterprise option achieves <100ms via dedicated infra.
Default cloud API, or a dedicated, isolated environment per enterprise customer.
Send messages or conversations via API, webhooks, or file import. Multi-language is supported.
Identify risky patterns (e.g., guaranteed returns, authority misuse, payment manipulation) with consistent, structured outputs.
Return clear explanations and mapped policy references so teams can act with confidence.
Use the response schema to trigger labeling, deprioritization, review queues, or automated actions.
Field | Description |
---|---|
id / model / version | Response metadata for tracing and audits. |
flagged | Boolean indicating whether the content is risky. |
category | Primary category matched (e.g., A.GuaranteedProfitInducement/GuaranteedReturnClaim). |
categories | Per-category booleans to indicate rule-level hits. |
category_scores | Per-category scores (0–1) for confidence and ranking. |
reason | Short, human-readable explanation aligned to policy language. |
A closed-loop quality process to maintain precision, control false positives, and adapt to emerging patterns—without exposing internal methods.
Detect language shifts and new risk surfaces.
Run fixed evals and boundary checks.
Re-verify rule mapping and audit readiness.
Versioned updates with change notes.
Incorporate reviewer outcomes and incident learnings.
Confirm category coverage and quality thresholds.
Each enterprise customer runs on a dedicated server—no shared traffic—delivering predictable performance and low latency for real-time decisions.
Your own isolated environment—no noisy neighbors.
Region-matched hosting typically around 100ms.
Dedicated compute and networking resources for each client.
Shared server | Riskor Dedicated | |
---|---|---|
Resources | Shared with multiple tenants | Dedicated instance per client |
Latency | ≈300ms (variable) | ≈100ms (region-matched) |
Scalability | Limited by shared capacity | Independent vertical / horizontal scaling |
We return structured explanations so reviewers can see exactly why a message is risky—supporting faster decisions and clearer dispute handling.
We control boundaries with targeted evals and ambiguous-case checks to reduce false positives while keeping risky claims in sight.
Yes. Riskor supports 100+ languages, with near-English accuracy in at least 10 major languages including English, Chinese, Spanish, French, German, Italian, Portuguese, Russian, Arabic, and Japanese.
Default cloud API or a dedicated, isolated environment per enterprise customer. Region-matched hosting keeps latency low.
Mapped to enforceable policies. <300ms at edge. Exportable reasoning.