freshcrate — #ai-safety

Home > #ai-safety

Tag: #ai-safety

44 packages • ⭐ 2,668 total stars

ISC-Benchv0.0.6🌳 Mature⭐799

Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.

adversarial-attacks agent-safety ai-safety benchmark frontier-models jailbreak large-language-models llm-safety pythonby wuyoscar

cordumv1.1.0🌿 Growing⭐465

The open agent control plane. Govern autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and any framework.

agent-framework agentic-ai ai-agent ai-governance ai-orchestration ai-safety audit-trail autonomous-agents goby cordum-io

@openguardrails/moltguardmain@2026-05-01🌿 Growing⭐342

AI agent security plugin for OpenClaw: prompt injection detection, PII sanitization, and monitoring dashboard

ai-safety ai-security-gateway data-sanitization guard npm openclaw openguardrails prompt-injection securityby OpenGuardrails

rust-docs-mcp-serverv1.3.1💤 Dormant⭐270

🦀 Prevents outdated Rust code suggestions from AI assistants. This MCP server fetches current crate docs, uses embeddings/LLMs, and provides accurate context via a tool call.

ai ai-safety caching cargo coding-assistant context-aware crates-io developer-tools rustby Govcraft

orbitv2.7.1🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-assistant ai-gateway ai-safety anthropic chatbot developer-tools elasticsearch llm pythonby schmitech

NeuronFSmain@2026-05-06🌿 Growing⭐137

mkdir beats vector DB. B-tree NeuronFS: 0-byte folders govern AI — ₩0 infrastructure, ~200x token efficiency. OS-native constraint engine for LLM agents.

ai-agent ai-safety cursor-rules file-system go guardrails llm model-agnostic multi-agentby rhino-acoustic

node9-proxyv1.29.0🌿 Growing⭐118

The Execution Security Layer for the Agentic Era. Providing deterministic "Sudo" governance and audit logs for autonomous AI agents.

ai-safety ai-security claude-code gemini gemini-cli llm llm-agent mcp-server typescriptby node9-ai

instarv0.17.14🌿 Growing⭐59

Persistent Claude Code agents with scheduling, sessions, memory, and Telegram.

agent-framework agent-identity agent-infrastructure agent-memory agent-skills ai-agents ai-safety autonomous-agents typescriptby JKHeadley

arifOSv2026.05.22-birthday🌱 Seedling⭐41

ArifOS — Constitutional MCP kernel for governed AI execution. AAA architecture: Architect · Auditor · Agent. Built for the open-source agentic era.

agentic-ai agi ai ai-agents ai-governance ai-safety claude-code constitutional-ai model-context-protocol pythonby ariffazil

arifosv2026.05.22-birthday🌱 Seedling⭐41

ArifOS — Constitutional MCP kernel for governed AI execution. AAA architecture: Architect · Auditor · Agent. Built for the open-source agentic era.

agentic-ai agi ai ai-agents ai-governance ai-safety claude-code constitutional-ai model-context-protocol pythonby ariffazil

COREv2.6.0🌱 Seedling⭐30

A thing that uses AI to write perfect applications. For those who want to know how: a governance runtime enforcing immutable constitutional rules on AI coding agents.

agentic-ai ai-agents ai-governance ai-safety autonomous-agents autonomous-coding code-generation constitutional-ai pythonby DariuszNewecki

speclockv5.5.2🌱 Seedling⭐22

AI Constraint Engine by Sandeep Roy — stops AI from breaking what you locked. 100/100 on Claude's adversarial test suite. 42 MCP tools. Works with Bolt.new, Lovable, Claude Code, Cursor. Free & open s

agents-md ai-coding ai-safety claude-code code-quality constraint-engine copilot cursor javascript mcpby sgroy10

ThumbGatev1.27.2🌱 Seedling⭐16

Self-improving agent governance: 👍/👎 → Pre-Action Gates that block repeat AI mistakes. Stop paying for the same mistake twice.

agent-reliability ai-agents ai-cost-optimization ai-safety amp claude-code codex cursor javascript mcpby IgorGanapolsky

Nrekiv11.4.2🌱 Seedling⭐10

MCP plugin that intercepts AI agent edits in RAM, validates them (TypeScript compiler + gopls + pyright), auto-heals missing imports, and commits atomically. If anything breaks, disk stays untouched

acid-transactions ai-code-review ai-safety auto-healing claude-code code-safety gopls lsp mcp typescriptby Ruso-0

claude-scholarmain@2026-06-01🌱 Seedling⭐9

🚀 Simplify your research workflow with Claude Scholar, the complete configuration for Claude Code in data science, AI, and academic writing.

academic academic-papers academic-research ai-safety arxiv claude-code literature-review mcp texby jessevanwyk1

moralstackv0.4.0🌱 Seedling⭐8

MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe

activitypub ai-safety audit-trail compliance decentralized deliberative-ai euaiactcompliance federation prompt-engineering pythonby fdidonato

pattern8main@2026-06-05🌱 Seedling⭐7

Enforce zero-trust rules for AI agents to prevent hallucinations, unsafe actions, and policy bypasses

agent ai ai-safety claude cursor devtools framework gemini mcp pythonby NVFivem

ASAN-Architecture0.0.0💤 Dormant⭐6

ASAN: A conceptual architecture for a self-creating (autopoietic), energy-efficient, and governable multi-agent AI system.

agent-caching ai-architecture ai-efficiency ai-framework ai-governance ai-safety asan autopoietic-aiby Variable-Fox

contemplative-agentv2.5.0🌱 Seedling⭐4

A self-improving AI agent that learns from experience. Runs entirely on a local 9B model. Security by absence — dangerous capabilities were never built.

agent-framework agent-simulation ai-agent ai-ethics ai-safety ai-security autonomous-agent contemplative-ai pythonby shimo4228

aletheiamain@2026-06-04🌱 Seedling⭐4

Operating framework for AI-assisted work with decision, governance, validation, and learnings before execution.

agentic-systems ai ai-developer-tools ai-development ai-framework ai-safety ai-systems artificial-intelligence typescriptby nevitonsantana

Secure-Agent-Launchermain@2026-06-03🌱 Seedling⭐3

Block AI agent access to sensitive macOS paths and log all actions to protect private data during command execution.

agents ai ai-agent ai-agents ai-safety ai-security claude cli pythonby fobi28

fourgodsmaster@2026-04-19🌱 Seedling⭐3

AI 助手的模組化能力框架：記憶、防禦、診斷、品質穩定 | Modular capability framework for AI assistants | Claude Code / Cursor / Any LLM

ai-agent ai-assistant ai-framework ai-memory ai-safety ai-security ai-tools claudeby Ryo-Hunter

awesome-anthropicmain@2026-06-05🌱 Seedling⭐2

A curated, daily-updated list of awesome resources, tools, SDKs, papers, and projects for Anthropic & Claude AI

ai ai-safety anthropic awesome awesome-list claude claude-ai claude-api html model-context-protocolby Omrigotlieb

algorithm-11v1.0.0🌱 Seedling⭐2

A structured reasoning and decision architecture for stable, interpretable, and hallucination‑resistant AI systems. An open standard for human–AI collaboration and autonomous systems.

ai ai-collaboration ai-framework ai-safety alignment architecture artificial-intelligence autonomous-systems pythonby gormenz-svg

AgentGuardmain@2026-06-05🌱 Seedling⭐1

Protect AI agents by detecting and blocking prompt, command injection, Unicode bypass, and social engineering attacks with customizable security controls.

ai ai-agents ai-governance ai-regulation ai-safety anthropic-claude claude debugging mcp pythonby astecka-m

artguardmain@2026-06-04🌱 Seedling⭐1

Scan AI artifacts like agent skills and config files for security risks, privacy issues, and instruction-level attacks with a Python CLI tool.

ai ai-agent ai-safety aiartdetection aivshuman artauthentication artguard arttech mcpby Zorropiscina

Riverbraid-Refusal-Goldmain@2026-06-03🌱 Seedling⭐1

Deterministic refusal and boundary enforcement layer for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Goldsmain@2026-06-03🌱 Seedling⭐1

Cluster manifest, orchestration, and stationary state verification for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Safety-Goldmain@2026-06-03🌱 Seedling⭐1

Riverbraid v1.5.0 | Resonant Intelligence Architecture

agent-framework ai-framework ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence deterministic javascriptby Riverbraid

Riverbraid-GPG-Goldmain@2026-06-03🌱 Seedling⭐1

The identity anchor and sovereign GPG verification petal for the Riverbraid organization.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Coremain@2026-06-03🌱 Seedling⭐1

Foundational invariants and verification surfaces for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Cognitionmain@2026-06-02🌱 Seedling⭐1

Cognitive architecture and meaning processing layer adjacent to the Riverbraid core.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems alignment cognition javascriptby Riverbraid

Riverbraid-Memory-Goldmain@2026-06-02🌱 Seedling⭐1

Meaning scoped persistence and state retention rules for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Interface-Goldmain@2026-06-02🌱 Seedling⭐1

The deterministic UI contract and relational interface substrate for the Riverbraid cluster.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Crypto-Goldmain@2026-06-02🌱 Seedling⭐1

Cryptographic integrity layer for Riverbraid seals, hashes, and signatures.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Temporal-Goldmain@2026-06-02🌱 Seedling⭐1

Temporal contracts and governed time based state logic for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems clock-sync cognitive-architecture javascriptby Riverbraid

Riverbraid-Vision-Goldmain@2026-06-02🌱 Seedling⭐1

Governed vision input and perception contract surface for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Audio-Goldmain@2026-06-02🌱 Seedling⭐1

Governed audio input and output contract surface for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

Riverbraid-Action-Goldmain@2026-06-02🌱 Seedling⭐1

Governed action execution surface for Riverbraid.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

phronesisermain@2026-06-02🌱 Seedling⭐1

Add provably safe ethical constraints to AI agents via Phronesis

ai-safety code-generation constraints deontic-logic ethics hyperpolymath idris2 iser rustby hyperpolymath

Riverbraid-Manifest-Goldmain@2026-05-28🌱 Seedling⭐1

The central directory and Merkle Root mapping for the 17-petal Riverbraid v1.5.0 substrate.

agent-framework ai-framework ai-governance ai-reasoning ai-safety ai-systems cognitive-architecture decentralized-intelligence javascriptby Riverbraid

System-Constitutionmain@2026-04-16🌱 Seedling⭐1

🚀 Define your architecture with System Constitution to keep your AI coding agents in check, ensuring stability and compliance as your project evolves.

agent-architecture ai ai-safety alembic cli code-generation domain-driven-design fastapi typescriptby enescoban43

.githubv1.5.0-genesis🌱 Seedling⭐1

Organization profile and public entry surface for Riverbraid.

agent-framework ai-framework ai-reasoning ai-safety ai-systems cognitive-architecture deterministic-ai javascript modular-aiby Riverbraid

Neuroverseos-governancev0.3.0🌱 Seedling⭐1

Deterministic governance engine for AI agents. Enforce rules defined in .md governance files across AI systems.

agent-framework agent-guardrails agent-harness ai ai-agents ai-governance ai-guardrails ai-safety mcp-server typescriptby NeuroverseOS

Tag: #ai-safety

Trending in #ai-safety