engineeringSolo eng or technical lead
Decorator on app boot, Slack alert at 3am when hallucination spikes in production. Categorized diagnosis (LAW, ID_NUMBER, NUMBER_DATE), 10-min fix.
Every agent your team runs — WhatsApp chatbot, internal copilot, legal RAG, customer-support. aferiq evaluates continuously, the CTO sees a consolidated quality dashboard, the engineer gets an alert before the client complains.
BYOK · 500 traces/month free · LGPD-native (DPA + SCC) · BR-aware (Lei, CNPJ, INSS)
Question + retrieved context + generated answer → PT-BR LLM judge detects hallucination and categorizes (invented laws, fake tax IDs, fictional gov agencies, etc).
Want more? 500 free traces on a real account
Create free accountPaste the key in your .env and run the app. Brazilian PII (CPF/CNPJ/RG/CEP/email/phone) is redacted before any network call — you can read the regex in the SDK source, it's not a magic flag.
# .env — 1 env var (Sentry-style DSN)
AFERIQ_DSN=https://rg_pk_live_xxx@your-deploy.com.br/api/v1/traces
# main.py — UMA linha, qualquer framework
import openai
import aferiq
aferiq.start() # reads AFERIQ_DSN + auto-patches openai/anthropicLangChain, LangGraph, LlamaIndex via callback. Python decorator for any function. CLI auto-instruments. CrewAI, Haystack, AutoGen, Pydantic AI in /dashboard/integrate.
# 1. App boot — once:
import aferiq
aferiq.start() # reads AFERIQ_DSN from env
# 2. On EVERY chain (RetrievalQA, ConversationalRetrievalChain, LCEL):
chain = RetrievalQA.from_chain_type(
llm=llm,
retriever=vectorstore.as_retriever(),
callbacks=[aferiq.handler()], # ← one line
)
chain.invoke({"query": "..."})
# Trace is in the dashboard, with PII redacted (CPF/CNPJ/email).Solo eng, agency serving clients, or 200-person company with internal agents — aferiq serves who creates, who operates, and who decides budget.
engineeringDecorator on app boot, Slack alert at 3am when hallucination spikes in production. Categorized diagnosis (LAW, ID_NUMBER, NUMBER_DATE), 10-min fix.
product / pmCompare week to week with regression datasets. Run pre-deploy evals. Show clients a quality dashboard — turns into upsell material, not just logs.
leadershipConsolidated view across every agent in the team. Quality score, cost estimate, incidents, top hallucinations. Board reporting, budget decisions, regulatory risk.
Global tools don't speak BR. Building from scratch doesn't scale.
USD pricing, English-only onboarding, no n8n. Great for Bay Area Series A; wrong fit for Brazilian operations.
Library, not product. No cloud, no alerts, no exec dashboard. You install — then build the rest.
200h of senior dev time for v1 + 20% ongoing maintenance. Multiplied by each agent that grows on your team.
Generic hallucination + BR-specific patterns + per-claim diagnosis.
faithfulnessThe default metric. Always run it.
citation_accuracyUseful for debugging bad retrieval.
hallucinationCategorizes invented laws, fabricated CNPJ/CPF, fictional Receita Federal/INSS references, fake government programs. Actionable diagnosis.
The difference is in the details that only matter here: BR-specific judge prompts, auth in SP + LGPD via DPA + SCC, BRL billing, Portuguese support.
Regulated sectors (legal, finance, health) demand DPO + DPA before they sign. We deliver the checklist upfront — no Phase-2 wait.
SDK runs on your machine. redact_pii=True flag strips CPF, CNPJ, RG, CEP, phone and email via BR regex before any POST. What can't leave your environment, doesn't.
OpenAI/Anthropic keys live in workspace_settings encrypted with per-row IV + auth tag. Plaintext only in memory during a request. Never logged.
Ingest API keys hashed with bcrypt. Cross-tenant access blocked via Postgres Row Level Security. Auth callback uses timing-safe comparison.
Strict CSP, X-Frame-Options DENY, HSTS preload, minimal Permissions-Policy. Cookie consent gates PostHog/Sentry per LGPD Art. 7.
DPA template, SCC equivalents for cross-border flows, public subprocessor list at /legal/lgpd. Right-to-erasure via DPO in 15 business days.
aferiq-eval lib on PyPI ships with judge prompts visible. Fork, audit, run self-hosted. No black-box judge.
500 traces/month on the free tier with full coverage of all 3 PT-BR metrics.