freshcrate

Search results for "corpus"

51 results found
trafilaturaπŸ“2.0.0πŸ›οΈ Flagship⭐5,758

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML.

gensimπŸ“4.4.0πŸ›οΈ Flagship⭐16,395

Python framework for fast Vector Space Modelling

ringπŸ“ring-tw-team@0.4.3🌿 Growing⭐175

89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles β€” as a Claude

minutesπŸ“v0.13.3🌳 Mature⭐1,116

Every meeting, every idea, every voice note β€” searchable by your AI. Open-source, privacy-first conversation memory layer.

claude-code-skillsπŸ“v2026.04.21🌿 Growing⭐416

Plugin suite + bundled MCP servers for Claude Code. Full delivery lifecycle: Agile pipeline with multi-model AI review, project bootstrap, documentation generation, codebase audits, performance optimi

pdf_oxideπŸ“v0.3.37🌳 Mature⭐630

The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5Γ— faster than industry leaders, 100% pass rate on 3,830 PDFs.

npcpyπŸ“v1.4.21🌳 Mature⭐1,307

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

vespaπŸ“v8.675.23πŸ›οΈ Flagship⭐6,886

AI + Data, online. https://vespa.ai

SeekStormπŸ“v3.0.0🌳 Mature⭐1,865

SeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.

mentisdbπŸ“0.9.3.39🌿 Growing⭐64

Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett

Vibe-SkillsπŸ“v3.0.4🌳 Mature⭐1,645

Vibe-Skills is an all-in-one AI skills package. It seamlessly integrates expert-level capabilities and context management into a general-purpose skills package, enabling any AI agent to instantly upgr

synaptic-memoryπŸ“v0.16.0🌱 Seedling⭐27

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

llmtraceπŸ“v0.2.0🌱 Seedling⭐46

Zero-code LLM security & observability proxy. Real-time prompt injection detection, PII scanning, and cost control for OpenAI-compatible APIs. Built in Rust.

paiml-mcp-agent-toolkitπŸ“v3.14.0🌿 Growing⭐149

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

CoWork-OSπŸ“v0.5.35🌿 Growing⭐240

Operating System for your personal AI Agents with Security-first approach. Multi-channel (WhatsApp, Telegram, Discord, Slack, iMessage), multi-provider (Claude, GPT, Gemini, Ollama), fully self-hosted

semiontπŸ“v0.4.20🌿 Growing⭐52

Semiont supports human+ai collaborative knowledge work. Use it as: a Wiki, Semantic Layer, Context Graph, Knowledge Base, Annotator, Research Tool, or Agentic Memory...

prism-mcpπŸ“v9.3.0🌿 Growing⭐128

The Mind Palace for AI Agents β€” Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi

UltraRAGπŸ“v0.3.0.2🌳 Mature⭐5,510

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

LRATπŸ“0.0.0🌱 Seedling⭐39

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

AutoRAGπŸ“v0.3.22🌳 Mature⭐4,713

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

WaxπŸ“waxmcp-v0.1.19🌳 Mature⭐711

Single-file memory layer for AI agents, sub mili-second RAG on Apple Silicon. Metal Optimized On-Device. No Server. No API. One File. Pure Swift

cyllamaπŸ“0.2.11🌱 Seedling⭐25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

aiwgπŸ“v2026.3.2🌿 Growing⭐120

Cognitive architecture for AI-augmented software development. Specialized agents, structured workflows, and multi-platform deployment. Claude Code Β· Codex Β· Copilot Β· Cursor Β· Factory Β· Warp Β· Windsur

DeepCodeπŸ“v1.2.0πŸ›οΈ Flagship⭐15,244

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

OpenContractsπŸ“v3.0.0.b4🌳 Mature⭐1,283

Humans and AI agents, building knowledge bases together. Self-hosted document annotation, version control, semantic search, and MCP.

aragπŸ“v0.1.0🌿 Growing⭐252

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

cortexdbπŸ“v2.18.1🌱 Seedling⭐32

A lightweight, embeddable vector database library for Go AI projects.

spacy-loggersπŸ“1.0.5🌱 Seedling⭐12

Logging utilities for SpaCy

freelanceπŸ“v1.3.3🌱 Seedling⭐7

Graph-based workflow enforcement and persistent memory for AI coding agents. Define structured workflows in YAML. Enforce them at tool boundaries via MCP. Build a persistent knowledge graph that grow

multi-agent-ralph-loopπŸ“main@2026-04-20🌿 Growing⭐126

Autonomous orchestration framework for Claude Code with MemPalace-inspired memory (4-layer stack, 818-token wake-up), parallel-first Agent Teams (6 teammates), Aristotle First Principles methodology,

agentic-chatopsπŸ“main@2026-04-20🌿 Growing⭐100

3-tier agentic ChatOps (n8n + GPT-4o + Claude Code) implementing all 21 patterns from "Agentic Design Patterns" β€” solo operator managing 137 devices

yao-meta-skillπŸ“main@2026-04-19🌿 Growing⭐297

YAO = Yielding AI Outcomes. A lightweight but rigorous system for creating, evaluating, packaging, and governing reusable agent skills.

AGI-Alpha-Agent-v0πŸ“main@2026-04-18🌿 Growing⭐284

META‑AGENTIC α‑AGI πŸ‘οΈβœ¨ β€” Mission 🎯 End‑to‑end: Identify πŸ” β†’ Out‑Learn πŸ“š β†’ Out‑Think 🧠 β†’ Out‑Design 🎨 β†’ Out‑Strategise β™ŸοΈ β†’ Out‑Execute ⚑

MatryoshkaπŸ“main@2026-04-18🌿 Growing⭐122

MCP server for token-efficient large document analysis via the use of REPL state

Awesome-Agent-MemoryπŸ“main@2026-04-16🌿 Growing⭐363

Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.

rag-chatbotπŸ“main@2026-04-14🌿 Growing⭐407

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

plamenπŸ“main@2026-04-09🌿 Growing⭐220

Autonomous Web3 security audit agent for Claude Code

agentshieldπŸ“v1.4.0🌿 Growing⭐522

AI agent security scanner. Detect vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, ECC plugin, and GitHub App integration. πŸ›‘οΈ

engram-memoryπŸ“v1.0.0🌱 Seedling⭐71

Agent memory and conflict detection platform. We're hiring contributors check HIRING.md

markdown-vault-mcpπŸ“v1.27.0🌱 Seedling⭐5

Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing

synthadocπŸ“v0.1.0🌱 Seedling⭐66

Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-

kernelπŸ“v3.97.0🌱 Seedling⭐12

kbot β€” the AI agent that dreams, learns, and evolves. 764+ tools, 35 agents, 20 providers. Music production, iPhone control, financial analysis, cyber threat intel. Always-on daemon. Runs offline. npm

doryπŸ“v0.1.0🌱 Seedling⭐14

One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.

locallensπŸ“v0.0.3🌱 Seedling⭐7

Search your files by talking to them - 100% offline

claude-ruby-grape-railsπŸ“v1.13.4🌱 Seedling⭐5

Claude Code plugin for Ruby, Rails, Grape, PostgreSQL, Redis, and Sidekiq development

second-brainπŸ“1.0🌱 Seedling⭐461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

agentic-news-generatorπŸ“main@2026-04-20🌱 Seedling⭐1

Generate a custom newspaper with an AI agent based on your favorite YouTube channels.

artguardπŸ“main@2026-04-21🌱 Seedling⭐1

Scan AI artifacts like agent skills and config files for security risks, privacy issues, and instruction-level attacks with a Python CLI tool.

pyannote-audio4.0.4🌱 Seedling

State-of-the-art speaker diarization toolkit