Features SDKs Runtime MCP Server Skills Use Cases Integration Platforms Compliance Install
Star on GitHub

The Security Operations Center
for Your AI Agents

Real-time threat detection, behavioral analysis, and compliance scoring across 6 agent platforms. Open source and local-first.

153+
Rules
6
Platforms
167+
APIs
<100ms
Latency

Why AI Agent Security Matters

AI agents are proliferating across enterprises, but security tooling hasn't kept pace. The attack surface is expanding faster than defenses can adapt.

0
GitHub stars on OpenClaw

Making it a high-value target for supply chain attacks

0
ClawHub skills with critical flaws

283 of 3,984 skills expose credentials (Koi Security audit)

0
Exposed instances on Shodan

Agent frameworks running without authentication

Comprehensive Security Coverage

From secrets detection to attack chain correlation, AgentsMon covers the full spectrum of AI agent security operations.

Threat Detection

153+ rules across secrets, code vulnerabilities, prompt injection, PII, and sandbox escape patterns.

Behavioral Analysis

Command monitoring, SSRF detection, path traversal, and sandbox escape prevention.

Aegis AI Orchestrator

Global AI agent that orchestrates fleets, investigates threats, and manages platform integrity through natural language.

Compliance Scoring

10 compliance frameworks including OWASP LLM Top 10, NIST AI RMF, EU AI Act, SOC 2, GDPR, HIPAA, and ISO 42001 with MITRE ATLAS mapping.

Agent Sandbox

53 escape detection rules, 4-level trust scoring, and OWASP ASI compliance per agent.

Attack Correlation

6 pre-defined attack chains with temporal clustering and automatic linking.

Skill Security

Pre-install scanning, ClawHavoc IOC matching, and capability risk scoring. Catches prompt injection and supply chain attacks that VirusTotal's signature scanning misses.

Enterprise Export Pipeline

OpenTelemetry (OTLP), Prometheus metrics, SIEM export (CEF/OCSF), webhooks, and bulk CSV/JSON download. One platform, every observability tool.

Up and Running in Minutes

Three steps to comprehensive AI agent security monitoring.

1

Install

$ pip install agentsmon
$ npm install @agentsmon/sdk
$ clawhub install agentsmon

Python SDK, Node.js SDK, or OpenClaw plugin. Pick your platform.

2

Monitor

Real-time event ingestion, behavioral analysis, and threat detection begin automatically.

3

Protect

Receive alerts, run AI-powered investigations, and generate compliance reports.

Built for Every Team

Whether you're securing a single coding assistant or governing a fleet of autonomous agents, AgentsMon has you covered.

Security Operations

Detect credential leaks, prompt injection, SSRF, and sandbox escapes in real time. Correlate events into attack chains and receive autonomous threat investigations from the Sentinel Agent.

153+ detection rules
6 attack chain definitions
AI-powered auto-investigation

Compliance & GRC

Map every finding to OWASP LLM Top 10, OWASP Agentic Top 10, and MITRE ATLAS. Generate immutable audit trails with SHA-256 hash chains. Export evidence in CEF or OCSF for your SIEM.

10 compliance frameworks
Immutable audit log with integrity verification
JSON, CSV, CEF, OCSF export

Platform Engineering

Govern agent permissions with 4-level trust scoring. Sandbox agents with 53 escape detection rules. Monitor session lifecycles and detect cross-session data leakage across your entire fleet.

4-level trust system (Intern → Autonomous)
53 sandbox escape patterns
Auto-demotion on violations

AI / ML Engineering

Scan agent extensions before installation. Detect prompt injection, jailbreaks, and memory poisoning. Get AI-generated explanations for every finding and remediation guidance.

Pre-install skill scanning
20+ prompt injection rules
AI-powered remediation

Red Team & Pen Testing

Test agent defenses against known attack chains. Simulate container escapes, SSRF, and confused deputy attacks. Validate detections against MITRE ATLAS techniques and 15+ tracked CVEs.

35+ MITRE ATLAS techniques
15+ framework CVEs tracked
IOC + CVE lookup tools

DevOps & SRE

Export to Splunk, Datadog, Grafana, or Elastic via OTLP. Scrape Prometheus metrics. Push alerts to Slack/PagerDuty via webhooks. Bulk download events in CSV/JSON/JSONL.

OpenTelemetry (OTLP) export
Prometheus /metrics endpoint
Webhook alerts + SIEM (CEF/OCSF)

Platform Integration Guides

Connect your AI agent platform to AgentsMon in minutes

OC

OpenClaw

Native skill integration, zero config. Install the AgentsMon skill and monitoring starts automatically with full event coverage.

bash scripts/setup.sh
View Full Guide
LC

LangChain

Python SDK callback handler for LLM and tool monitoring. Drop in a single callback to capture all chain events and tool invocations.

pip install agentsmon[langchain]
View Full Guide
CA

CrewAI

Python SDK step/task callbacks for crew monitoring. Attach to any Crew instance to capture delegation, task completion, and tool usage.

pip install agentsmon[crewai]
View Full Guide
AG

AutoGPT

Python SDK hooks for autonomous agent monitoring. Instrument the agent loop to track every command, file, and browse action.

pip install agentsmon
View Full Guide
MC

MCP

Server middleware for tool call security. Send MCP server events directly to the ingestion API for real-time tool injection detection.

POST /api/ingest/mcp
View Full Guide
++

Generic / REST

Universal REST API for any agent framework. Send structured events from any language or platform using a simple HTTP POST.

POST /api/ingest/generic
View Full Guide
New

Meet Aegis
Your Global AI Orchestrator

Most security tools are reactive — they detect and report. Aegis is agentic. Inspired by the OpenClaw chat concept, Aegis serves as the central figure of your mission control. It doesn't just watch; it performs tasks, schedules maintenance, and investigates deep forensics through a unified conversational interface.

Orchestration Chat
A unified hub for directing your fleet. "Aegis, find the top spenders and restrict their subnets."
Autonomous Maintenance
Aegis schedules scans, validates connectors, and prunes old logs autonomously based on your policy.
Governance & Guardrails
Every action taken by Aegis is sandboxed and logged to the tamper-proof audit trail for full accountability.
Sentinel Agent — Live Status
agent-prod-07
92
agent-staging-03
67
agent-dev-01
12
Recent Autonomous Investigations
12:04:31 CRITICAL agent-prod-07: SSRF to cloud metadata + credential exfiltration
11:58:12 HIGH agent-staging-03: Repeated prompt injection attempts
11:42:05 INFO agent-dev-01: Schema validation failures (false positive)

Agent Runtime Platform

From monitoring to operating system. Register, configure, test, benchmark, and deploy agents with security baked into every layer.

📋

Agent Registry

Version-controlled agent definitions with lifecycle management. Draft → Testing → Active → Paused → Retired.

CRUD Versioning Cloning Multi-provider

Agent Executor

Three execution modes: dry_run (simulate), test_run (trust=0), production (full trust). Every tool call runs through the monitoring pipeline.

ReAct loop Sandbox Trust-gated Timeout
🧩

Skill Marketplace

Register, scan, and approve skills. 12 security patterns detect dangerous capabilities and scripts before agents can use them. Goes beyond VirusTotal's hash matching with behavioral capability analysis and agent-specific threat detection.

Security scan Approval Trust gating Flagging Beyond VirusTotal
📊

Benchmarking

5 built-in safety tests + custom suites. Per-category scoring (safety, capability, performance, compliance) with A/B comparison.

Safety tests Scoring Comparison Regression
0
API Endpoints
0
Database Tables
0
New Services
0
Passing Tests

MCP Server Interface

Sandbox-as-a-Service. Any MCP-compatible agent can call into AgentsMon to safely execute commands, scan code, and query security intelligence — all trust-gated.

npx agentsmon-mcp

🛡️

sandbox_execute

Trust-gated command execution. L0 callers get analysis-only; L1+ get approval. 150+ escape patterns checked before any command runs.

Trust-gated Escape detection Auto-register
🔍

scan_code

Scan any code for secrets, vulnerabilities, and prompt injection patterns. 20 injection rules + full secret/code scanner.

Injection Secrets Vulnerabilities
📡

query_security

Full security intelligence access: dashboard, events, findings, alerts, correlations, and threat context — all via one tool.

Dashboard Events IOC/CVE
🔗

Universal Access

Two access modes: native MCP over STDIO for Claude/GPT agents, or HTTP POST for any client. No SDK required.

STDIO HTTP 8 tools
0
MCP Tools
0
Management Endpoints
0
Access Modes
0
Passing Tests

Skills Interface

34 purpose-built skills that teach agents how to interact with the sandbox runtime. Each skill is a SKILL.md file with CLI commands — no SDK, no code, just instructions agents follow.

base/skills/*/SKILL.md

Core Orchestration

8 skills

Task lifecycle, sprint planning, canvas deployment, agent collaboration, vault secrets, and dynamic agent spawning.

agentsmon-work agentsmon-plan agentsmon-canvas agentsmon-collab agentsmon-ops agentsmon-agents agentsmon-connections agentsmon-auth

Safety & Security

5 skills

Access control, code auditing, skill validation, security configuration, and container management.

secure-setup agent-access-control clauditor skill-vetter arcane-docker-manager

Research & Knowledge

6 skills

Academic research, context anchoring, skill discovery, error pattern learning, and live data from Reddit & HackerNews.

academic-research context-anchor agentsmon-skills-discovery error-learning reddit-browser hackernews-reader

Design & Content

3 skills

Canvas design patterns with Lucide SVG reference, technical documentation, and canvas validation testing.

canvas-design technical-documentation agentsmon-canvas-test

Reasoning & Planning

5 skills

Sequential thinking chains, adaptive reasoning, project management, team orchestration, and context window optimization.

sequential-thinking adaptive-reasoning project-management-skills agent-team-orchestration context7

DevOps & Code Quality

7 skills

Git workflows, PR preparation and review, code cleanup, active maintenance, and agent commons patterns.

git-workflow prepare-pr merge-pr review-pr technical-deslop active-maintenance agent-commons
0
SKILL.md Files
0
Autonomous Agents
0
Skill Categories
0
Lines of SDK Code

Skills vs MCP — Why Both?

Skills (SKILL.md)
  • Plain-text instructions agents follow autonomously
  • Wrap CLI commands (agentsmon-cli) that call REST APIs
  • No runtime, no SDK, no code — just markdown
  • Agents learn capabilities through reading, not tool registration
  • Used by the 10-agent fleet inside the sandbox
MCP Server
  • Programmatic tool interface for external agents
  • STDIO or HTTP access modes with trust gating
  • 8 structured tools with schemas and validation
  • Used by Claude, GPT, and third-party agents calling in
  • Security boundary: sandbox_execute, scan_code, query_security

Multi-Engine File Scanner

Like VirusTotal’s 70+ AV engines — but purpose-built for AI agent threats. 6 detection engines scan every file for prompt injection, sandbox escapes, supply chain poisoning, and more.

🔬

6 Detection Engines

SecurityScanner, BehavioralMonitor, AdvancedThreatDetector, SupplyChainAnalyzer, SandboxMonitor, and YaraRuleEngine — each engine produces an independent verdict.

🧬

YARA-Like Rules

10 built-in rules detect crypto miners, reverse shells, C2 beacons, credential harvesting, and more. Add custom regex rules via API.

🔄

Scheduled Rescans

Like VirusTotal’s daily re-scans. Previously clean files get re-checked. Reputation drift triggers alerts when status changes.

0
Detection Engines
0
YARA Rules
0
Scanner Endpoints
0
Total API Endpoints
0
Passing Tests

Detection at Scale

Continuously expanding rule library built from real-world agent threat research.

0
Secret Detection
Rules
0
Code Security
Rules
0
Prompt Injection
Rules
0
Sandbox Escape
Rules
0
Command
Patterns
0
Tracked
CVEs
Tier 16

Self-Validation Platform

AgentsMon validates itself. 13 scenarios exercise every detection engine with crafted malicious inputs, producing a publishable score and grade.

13
Validation
Scenarios
100+
Test Cases per
Validation Run
A-F
Letter Grade
Report

How It Works

1. ValidationOrchestrator registers a validator agent via AgentRegistry

2. Creates 13 validation skills via SkillRegistry

3. Each scenario calls real service methods with known-malicious inputs

4. Asserts detection results: did the rule fire? Was the severity correct?

5. Produces a weighted score (0-100), letter grade (A-F), and publishable JSON/Markdown report

Analysis February 2026

OpenClaw + VirusTotal: A Step Forward, Not a Solution

OpenClaw integrated VirusTotal to scan ClawHub skills — SHA-256 hash lookup against 70+ AV engines. Good for catching known malware. But OpenClaw acknowledged it's "not a silver bullet" — prompt injection, sandbox escapes, and capability abuse evade signature scanning entirely.

7.1% of ClawHub skills (283 of 3,984) contain critical credential-exposing flaws that are code patterns, not malware binaries. VirusTotal won't flag eval(userInput) or a webhook exfiltration URL. AgentsMon will.

Threat Type AgentsMon VirusTotal
Known malware (trojans, stealers) Via IOCs 70+ engines
Prompt injection (22 rules)
Sandbox escape (53 rules)
Supply chain IOCs (12 patterns)
Capability risk analysis 36+ patterns
Credential exposure in code 29 rules
Runtime behavioral analysis 115+ patterns
Trust-gated execution 4 levels

Source: The Hacker News, Feb 2026. Koi Security audit of 3,984 ClawHub skills.

Docker E2E Integration Testing

One command spins up 4 containers, exercises every event type through the full analysis pipeline, and reports gaps. Loops until 100% satisfied.

14
Test Scenarios

Agent registration, dangerous commands, SSRF, sandbox escape, prompt injection, trust lifecycle, correlation chains, dashboard verification, SSE streaming

4
Docker Services

Backend (Node 20 + SQLite), Frontend (Vite + nginx), OpenClaw Simulator, Test Agent — all orchestrated via Docker Compose

60+
Assertions

Every event type verified end-to-end: ingestion → analysis → anomaly detection → alert creation → dashboard display → SSE broadcast

./e2e/run.sh openclaw --loop --verbose
New

Client SDKs & Shield Mode

Two product tiers: Auditor (passive monitoring) and Shield (active prevention). Both ship as pip/npm packages with fail-open safety.

PY

Python SDK

pip install agentsmon
Auditor: Drop-in callbacks for LangChain, CrewAI, AutoGPT. Decorators for custom agents.
Shield: Command blocking, prompt injection, PII detection/redaction, tool permissioning, domain whitelisting.
# Auditor — 3 lines
import agentsmon
client = agentsmon.init("http://localhost:18800")
client.track_command("echo hello")
# Shield — pre-flight blocking
from agentsmon.guard import AgentsMonGuard
guard = AgentsMonGuard(client, allowed_tools=["search"])
result = guard.check_command("rm -rf /")
LangChain CrewAI AutoGPT Generic 83 tests
JS

Node.js SDK

npm install @agentsmon/sdk
Auditor: Event batching, auto-flush, retry with fail-open. Track commands, files, usage, and agent lifecycle.
Shield: Same Guard features as Python — command blocking, PII, tool permissioning, domain whitelisting.
// Auditor
import { AgentsMonClient } from '@agentsmon/sdk';
const client = new AgentsMonClient('http://localhost:18800');
client.trackCommand('echo hello');
// Shield
import { AgentsMonGuard } from '@agentsmon/sdk';
const guard = new AgentsMonGuard(client);
const r = await guard.checkCommand('rm -rf /');
Generic TypeScript 28 tests

Shield Mode Features

🚫
Command Blocking
Pre-flight safety checks
🔒
Prompt Injection
20+ detection rules
👤
PII Detection
10 rules + redaction
🔧
Tool Permissioning
Allowlist enforcement
🌐
Domain Whitelisting
URL + subdomain checks
Fail-Open Safety
Never blocks on errors

Multi-Platform Comparison

One security layer across every agent framework. See exactly what's covered.

OC
OpenClaw Native
LC
LangChain
CA
CrewAI
AG
AutoGPT
MC
MCP
++
Generic
Feature OC LC CA AG MCP GEN
Security Scanning
Secret detection (29 rules)
Code vulnerability scanning (32 rules)
Prompt injection detection (20+ rules)
Sandbox escape detection (53 rules)
Dependency audit (npm)
CVE tracking (framework-specific) 4 2 1 4
Behavioral Monitoring
Dangerous command detection (115+)
File access anomaly detection
SSRF & network request analysis
PII detection & redaction (10 rules)
Data exfiltration detection
Memory poisoning detection
Cross-session leakage detection
AI-Powered Intelligence
Sentinel Agent (autonomous investigation)
Attack chain correlation (6 chains)
Trust level management (4 levels)
AI-powered finding explanations
Compliance & Audit
OWASP LLM Top 10 scoring
OWASP ASI Top 10 compliance
MITRE ATLAS mapping (35+ techniques)
Immutable audit trail (SHA-256)
SIEM export (CEF + OCSF)
OpenTelemetry (OTLP) export
Prometheus metrics + webhooks
Platform-Specific Integrations
Config baseline & drift monitoring
Skill / extension pre-install scanning
MCP tool injection & server monitoring
Gateway security (IP blocking, brute force)
CDP / WebSocket hijacking detection
File integrity monitoring (SHA-256 baseline)
Serialization CVE detection
Autonomous action resource limits
Full support
Not applicable

Built for Security Teams

A purpose-built dashboard for monitoring AI agent security posture in real time.

localhost:3000/dashboard
Compliance Score
87
OWASP LLM Top 10
Recent Alerts
Hardcoded API key detected
Prompt injection attempt
Suspicious command execution
Config baseline drift
SSRF to cloud metadata
System Status
Events / hr 1,247
Active Sessions 3
Findings Today 12
Rules Active 86
Uptime 99.9%

Standards & Compliance

Built on industry frameworks to integrate with your existing security program.

OWASP LLM Top 10 (10/10)
MITRE ATLAS (35+ techniques)
OCSF v1.3
CEF (SIEM Export)
OpenTelemetry (OTLP)
Prometheus Metrics
Immutable Audit Log
OWASP Agentic Top 10 (ASI)
CSA AI Trust Framework
SOC 2 Type II
NIST AI RMF
EU AI Act
ISO 42001
GDPR
HIPAA

Open Source. Local-First. Community-Driven.

Install AgentsMon in seconds. All data stays on your machine. No cloud dependency, no telemetry, no vendor lock-in. The Sentinel Agent starts automatically — set AI_API_KEY to enable AI-powered investigations.

Python SDK
$ pip install agentsmon
Node.js SDK
$ npm install @agentsmon/sdk
OpenClaw Plugin
$ clawhub install agentsmon
Or clone the backend directly
$ git clone https://github.com/agentsmon/agentsmon.git
$ cd agentsmon && npm install
$ npm run dev