Open Source

OSS · 00 / Why community tier

What you get for free.

The community tier exists because adoption beats lock-in. Same governance as the paid platform; lower retention and no enterprise SLA. Upgrade only when you need it.

01 / Policy

Same intercept path

Every OSS-tool call routes through BaseConnector.execute. Nothing is special-cased; community connectors carry the same hard gates as platform connectors.

02 / Audit

Normalized findings

garak, promptfoo, PyRIT, PurpleLlama, agentic-radar all emit into one of three shapes — AIFinding, PentestFinding, AgentVuln — so policies key off severity uniformly.

03 / Sandbox

community-oss profile

Each tool runs in a SHA-pinned Docker image with 1 CPU / 1Gi RAM / 600s timeout, no host networking by default. No host volume mounts, scoped LLM keys.

04 / Free seat

No connector cap

Community connectors don't count toward platform connector caps. Audit retention capped at 30 days; web-only approval routing. Upgrade to extend.

OSS · 01 / AI red-team scanners

Probe the model. Normalize the result.

The four canonical OSS AI red-team scanners, each wrapped so their native output maps into ARX's AIFinding shape. Run them on a schedule, on demand, or as a CI gate.

ai-scanner

MIT CI gate

Lightweight CI-oriented prompt-injection / leak / jailbreak scanner from MetaCTF. Pair with promptfoo or PyRIT for layered coverage.

rules:list · scan:run

GitHub → Workflow

garak

Apache‑2.0 NVIDIA

The "nmap of LLMs." Probe-based vulnerability scanner with families for jailbreak, prompt-injection, leak-replay, encoding bypass, toxicity, and more.

probes:list · scan:run · report:read

GitHub → Workflow

promptfoo

MIT CI-friendly

LLM eval and adversarial red-team plugins. Run promptfoo's harmful / jailbreak / PII / prompt-injection plugins under ARX policy and audit.

eval:run · redteam:run · report:read

GitHub → Workflow

PurpleLlama

Llama Community BYO image

LlamaGuard input/output safety, CodeShield insecure-code detection, CyberSecEval benchmarks. Customer-installed image; ARX never bundles model weights.

llama_guard:scan · code_shield:scan · cyber_sec_eval:run

GitHub → Workflow

PyRIT

MIT Microsoft

Python Risk Identification Tool. RedTeaming, Crescendo, PAIR, TAP, Skeleton Key, XPIA orchestrators against a target model — with full audit.

orchestrators:list · red_team:run · prompt_send:run

GitHub → Workflow

OSS · 02 / Agent posture & runtime

Know what you've shipped. Catch what it does.

Static posture from agentic-radar; runtime detections from agentfence. Both ingest into ARX as AgentVuln findings — uniform severity, uniform policy.

agentic-radar

Apache‑2.0 splx.ai

Static-analyze a LangChain / LlamaIndex / AutoGen / CrewAI agent codebase. Surface tool-misuse risk, missing HITL gates, scope violations, credential exposure.

frameworks:list · scan:run · report:read

GitHub → Workflow

agentfence

Apache‑2.0 Ingest-only

Runtime agent firewall. ARX ingests detections (prompt injection, tool chain abuse, RCE-via-tool, data exfiltration) and applies governance — your control loop, not theirs.

findings:read · rules:read · alerts:ack

GitHub →

OSS · 03 / HTTP pentest

Pentest signal, normalized into ARX findings.

Lightweight HTTP-traffic pentest tooling that emits into the same PentestFinding shape as the autonomous agents below. Replays on HAR files, scans on live targets.

reaper

Apache‑2.0 Ghost Security

HTTP traffic analysis pentest tool. Run rules against HAR captures or live targets; ARX normalizes findings as PentestFindings with CWE / CVSS metadata.

rules:list · scan:run · report:read

GitHub →

OSS · 04 / Autonomous pentest (gated)

One connector. Many providers. Hard gates.

Single pentest_agent meta-connector dispatches to autonomous LLM-driven pentest agents (pentagi, strix, ...). Refuses to run without a signed scope, an attributable initiator, and an LLM spend cap. Exploitation always escalates for human approval.

pentest_agent

Gated Opt-in per org

Meta-connector. Pick a provider (pentagi, strix) at runtime. Hard gates baked in: authorization_artifact, max_llm_spend_usd, initiated_by_user_id. Default policy bundle ESCALATEs every exploit:run.

providers:list · recon:run · scan:run · exploit:run

pentagi → strix → Dryrun workflow

We add providers on customer request — never speculatively. Adding a provider does not add a customer-visible connector; abandoning one drops a dispatch case, not a feature. Deferred providers below.

hexstrike-ai DEFERRED

verify license

0x4m4's hex-strike pentest agent. Provider available on demand.

GitHub →

PentestAgent (0xSojalSec) DEFERRED

single-author

0xSojalSec's PentestAgent. Provider available on demand.

GitHub →

pentest-ai DEFERRED

single-author

0xSteph's pentest-ai. Provider available on demand.

GitHub →

pentagi SHIPPING

Apache‑2.0

vxcontrol's autonomous pentest agent. Default provider for the meta-connector.

GitHub →

pentestagent (GH05TCREW) DEFERRED

single-author

GH05TCREW's pentest agent. Provider available on demand.

GitHub →

PentestGPT DEFERRED

MIT

The original LLM-pentest project. Add as a provider on request.

GitHub →

shannon DEFERRED

verify license

KeygraphHQ's pentest agent. Provider available on demand.

GitHub →

strix SHIPPING

verify license

usestrix autonomous pentest agent. Second supported provider out of the box.

GitHub →

tachi DEFERRED

single-author

davidmatousek's pentest agent. Single-author repo; abandonment risk contained by the meta-connector.

GitHub →

OSS · 05 / Also in scope

Targets, benchmarks, governance toolkits.

Tools that aren't ARX connectors but ship as benchmark workflows or are covered by a related platform connector. Listed for completeness against the original 22-tool brief.

agentdojo

BenchmarkWorkflow

ETH Zürich Spylab's agent benchmark. Drives the agent-governance-baseline workflow as a target — scored against ARX policy outputs.

GitHub → Workflow

ai-goat

TargetWorkflow

dhammon's vulnerable AI app. Used as the target in the pentest-dryrun workflow to validate the approval flow — not a connector.

GitHub → Workflow

damn-vulnerable-llm-agent

TargetWorkflow

Reversec Labs vulnerable LLM agent. Used as the target in the ai-redteam-benchmark workflow — not a connector.

GitHub → Workflow

Microsoft Agent Governance Toolkit

PlatformApache‑2.0

Already shipping as the microsoft_governance platform connector with bidirectional policy / audit / compliance / trust APIs — not duplicated as a community connector.

GitHub → Platform connector

Paperclip

PlatformConnector

Secure document management for agent-produced evidence. ARX governs every compliance bundle, incident report, and policy attestation Paperclip stores — HITL gate before any external share, immutable document chain in the audit log.

docs:read · docs:write · share:write · audit:read

Integration post → Platform connector

seclab-taskflow-agent

Skipped

GitHub Security Lab research artifact. Skipped until a release with a clear API surface lands.

GitHub →

Open-source AI security tools — governed, audited, and free.

What you get for free.

Same intercept path

Normalized findings

community-oss profile

No connector cap

Probe the model. Normalize the result.

Know what you've shipped. Catch what it does.

Pentest signal, normalized into ARX findings.

One connector. Many providers. Hard gates.

Wire your OSS red-team into Arx this afternoon.