freshcrate
Skin:/
Home > #ai-safety

Tag: #ai-safety

44 packages • ⭐ 2,668 total stars

ISC-Benchv0.0.6🌳 Mature799

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

cordumv1.1.0🌿 Growing465

The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.

@openguardrails/moltguardmain@2026-05-01🌿 Growing342

AI agent security plugin for OpenClaw: prompt injection detection, PII sanitization, and monitoring dashboard

rust-docs-mcp-serverv1.3.1💤 Dormant270

🦀 Prevents outdated Rust code suggestions from AI assistants. This MCP server fetches current crate docs, uses embeddings/LLMs, and provides accurate context via a tool call.

orbitv2.7.1🌿 Growing250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

NeuronFSmain@2026-05-06🌿 Growing137

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI — ₩0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

node9-proxyv1.29.0🌿 Growing118

The Execution Security Layer for the Agentic Era. Providing deterministic "Sudo" governance and audit logs for autonomous AI agents.

instarv0.17.14🌿 Growing59

Persistent Claude Code agents with scheduling, sessions, memory, and Telegram.

arifOSv2026.05.22-birthday🌱 Seedling41

ArifOS — Constitutional MCP kernel for governed AI execution. AAA architecture: Architect · Auditor · Agent. Built for the open-source agentic era.

arifosv2026.05.22-birthday🌱 Seedling41

ArifOS — Constitutional MCP kernel for governed AI execution. AAA architecture: Architect · Auditor · Agent. Built for the open-source agentic era.

COREv2.6.0🌱 Seedling30

A thing that uses AI to write perfect applications. For those who want to know how: a governance runtime enforcing immutable constitutional rules on AI coding agents.

speclockv5.5.2🌱 Seedling22

AI Constraint Engine by Sandeep Roy — stops AI from breaking what you locked. 100/100 on Claude's adversarial test suite. 42 MCP tools. Works with Bolt.new, Lovable, Claude Code, Cursor. Free & open s

ThumbGatev1.27.2🌱 Seedling16

Self-improving agent governance: 👍/👎 → Pre-Action Gates that block repeat AI mistakes. Stop paying for the same mistake twice.

Nrekiv11.4.2🌱 Seedling10

MCP plugin that intercepts AI agent edits in RAM, validates them (TypeScript compiler + gopls + pyright), auto-heals missing imports, and commits atomically. If anything breaks, disk stays untouched

claude-scholarmain@2026-06-01🌱 Seedling9

🚀 Simplify your research workflow with Claude Scholar, the complete configuration for Claude Code in data science, AI, and academic writing.

moralstackv0.4.0🌱 Seedling8

MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe

pattern8main@2026-06-05🌱 Seedling7

Enforce zero-trust rules for AI agents to prevent hallucinations, unsafe actions, and policy bypasses

ASAN-Architecture0.0.0💤 Dormant6

ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.

contemplative-agentv2.5.0🌱 Seedling4

A self-improving AI agent that learns from experience. Runs entirely on a local 9B model. Security by absence — dangerous capabilities were never built.

aletheiamain@2026-06-04🌱 Seedling4

Operating framework for AI-assisted work with decision, governance, validation, and learnings before execution.

Secure-Agent-Launchermain@2026-06-03🌱 Seedling3

Block AI agent access to sensitive macOS paths and log all actions to protect private data during command execution.

fourgodsmaster@2026-04-19🌱 Seedling3

AI 助手的模組化能力框架:記憶、防禦、診斷、品質穩定 | Modular capability framework for AI assistants | Claude Code / Cursor / Any LLM

awesome-anthropicmain@2026-06-05🌱 Seedling2

A curated, daily-updated list of awesome resources, tools, SDKs, papers, and projects for Anthropic & Claude AI

algorithm-11v1.0.0🌱 Seedling2

A structured reasoning and decision architecture for stable, interpretable, and hallucination‑resistant AI systems. An open standard for human–AI collaboration and autonomous systems.

AgentGuardmain@2026-06-05🌱 Seedling1

Protect AI agents by detecting and blocking prompt, command injection, Unicode bypass, and social engineering attacks with customizable security controls.

artguardmain@2026-06-04🌱 Seedling1

Scan AI artifacts like agent skills and config files for security risks, privacy issues, and instruction-level attacks with a Python CLI tool.

Riverbraid-Goldsmain@2026-06-03🌱 Seedling1

Cluster manifest, orchestration, and stationary state verification for Riverbraid.

Riverbraid-GPG-Goldmain@2026-06-03🌱 Seedling1

The identity anchor and sovereign GPG verification petal for the Riverbraid organization.

Riverbraid-Cognitionmain@2026-06-02🌱 Seedling1

Cognitive architecture and meaning processing layer adjacent to the Riverbraid core.

Riverbraid-Interface-Goldmain@2026-06-02🌱 Seedling1

The deterministic UI contract and relational interface substrate for the Riverbraid cluster.

Riverbraid-Crypto-Goldmain@2026-06-02🌱 Seedling1

Cryptographic integrity layer for Riverbraid seals, hashes, and signatures.

Riverbraid-Temporal-Goldmain@2026-06-02🌱 Seedling1

Temporal contracts and governed time based state logic for Riverbraid.

phronesisermain@2026-06-02🌱 Seedling1

Add provably safe ethical constraints to AI agents via Phronesis

Riverbraid-Manifest-Goldmain@2026-05-28🌱 Seedling1

The central directory and Merkle Root mapping for the 17-petal Riverbraid v1.5.0 substrate.

System-Constitutionmain@2026-04-16🌱 Seedling1

🚀 Define your architecture with System Constitution to keep your AI coding agents in check, ensuring stability and compliance as your project evolves.

.githubv1.5.0-genesis🌱 Seedling1

Organization profile and public entry surface for Riverbraid.

Neuroverseos-governancev0.3.0🌱 Seedling1

Deterministic governance engine for AI agents. Enforce rules defined in .md governance files across AI systems.