Search results for "documents"
Microsoft Azure Cognitive Search Client Library for Python
Microsoft Azure Blob Storage Client Library for Python
SPDX parser and tools.
Python API and tools to manipulate OpenDocument files
Tools for stamping and signing PDF files
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Python tool and library for decrypting and encrypting MS Office files using a password or other keys
Lightweight, extensible schema and data validation tool for Pythondictionaries.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
Agentic DOCX Redlining Engine. Enables LLMs to read Word documents and inject native Track Changes (w:ins, w:del) and Comments without breaking formatting. Includes Model Context Protocol (MCP) Server
Python client library for IBM Cloudant
A sphinx extension that automatically documents argparse commands and options
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
LLM-powered knowledge base from your Claude Code, Codex CLI, Copilot, Cursor & Gemini sessions. Karpathy's LLM Wiki pattern — implemented and shipped.
89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles — as a Claude
Autonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.
⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.
An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9
Open Source AI Platform - AI Chat with advanced features that works with every LLM
One brain, many harnesses. Portable .agent/ folder (memory + skills + protocols) that plugs into Claude Code, Cursor, Windsurf, OpenCode, OpenClaw, Hermes, or DIY Python — and keeps its knowledge when
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Knowledge Engine for AI Agent Memory in 6 lines of code
LlamaIndex is the leading document agent and OCR platform
Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
RAG pipeline security testing toolkit - 27 techniques across 6 kill chain phases, mapped to MITRE ATLAS
MCP server that gives any LLM its own computer — managed Docker workspaces with live browser, terminal, code execution, document skills, and autonomous sub-agents. Self-hosted, open-source, pluggable
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Infrastructure that connects LLMs to ERPNext. Frappe Assistant Core works with the Model Context Protocol (MCP) to expose ERPNext functionality to any compatible Language Model
MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search & Drive with AI - Comprehensive Google Workspace / G Suite MCP Server & CLI Tool
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
The Multi-Agent Custom Automation Engine Solution Accelerator is an AI-driven system that manages a group of AI agents to accomplish tasks based on user input. Powered by Microsoft Agent Framework, Az
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
A Claude Code skill that turns your Obsidian vault into a living second brain — autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Give your AI agents persistent memory.
AI-powered development framework with task management, 41 agents, 83 skills, and MCP tools for Cursor, Claude Code, Gemini, Codex & OpenCode. File-based memory that survives across sessions.
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
OSCAL tools for AI agents
The official Python SDK for Model Context Protocol servers and clients
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
Harness LLMs with Multi-Agent Programming
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
"RAG-Anything: All-in-One RAG Framework"
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
High-Performance Engine for Multi-Vector Search
Open-source multi-agent AI assistant powered by LangGraph, FastAPI & Next.js — 16+ agents, Human-in-the-Loop, MCP integration, voice TTS, RAG, 500+ metrics, 6 languages.
Crawl4AI MCP Server: Extract content from web pages, PDFs, Office docs, YouTube videos with AI-powered summarization. 17 tools, token reduction, production-ready.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
Lad MCP Server: Autonomous code & system design review for AI coding agents (Claude Code, Cursor, Codex, etc.). Features multi-model consensus via OpenRouter and context-aware reviews via Serena.
Ultra-Lightweight, Pure Python Multimodal Agent.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Tools for labeling human languages with IETF language tags
Graph RAG with pure vector search, achieving SOTA performance in multi-hop reasoning scenarios.
PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with 🦀 by the humans at https://kilo.ai
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Structured Outputs
RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker
AI Agent Backend Platform on FastAPI — MCP server + AI orchestration + async DDD architecture. Zero-boilerplate CRUD, auto domain discovery, 14 Claude Code AI development skills.
JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.
A Model Context Protocol (MCP) server that interfaces with Adobe Photoshop's Python API. Enables LLMs to execute image editing operations, automate workflows, and manage Photoshop tasks through struct
🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting🛡⚔️
One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.
AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Unified framework for building enterprise RAG pipelines with small, specialized models
Humans and AI agents, building knowledge bases together. Self-hosted document annotation, version control, semantic search, and MCP.
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
Creates JUnit XML test result documents that can be read by tools such as Jenkins
Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-
Prompt Driven Development Command Line Interface
Build, deploy, and orchestrate event-driven agents natively on Apache Flink® and Apache Kafka®
Agentic Coding Rules, Templates etc...
Memory library for building stateful agents
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
🔥 An autonomous AI agent that runs your deep learning experiments 24/7 while you sleep. Zero-cost monitoring, Leader-Worker architecture, constant-size memory.
Framework for AI agents to build and maintain an Obsidian wiki using Karpathy's LLM Wiki pattern
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
AI travel planner with 7 specialized agents, RAG, and tool-calling. Built with CrewAI & LangChain. Generates personalized itineraries with flights, hotels, activities, and cultural tips. Production-re
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
The Developer's Guide to AI - A Field Guide for the Working Developer
Markdown-first work-memory protocol for existing agents, with maintained knowledge, candidate notes, evals, and an example KB.
Agentic AI assistant on Telegram, powered by Claude Code. Runs locally with shell access, spec-driven PR reviews, layered security, persistent memory, and scheduled jobs. Your machine, your data, your
Description: Self-hosted graph-based associative memory for personal AI agents. Spreading activation, emotional weighting, zero LLM cost.
MCP server for guiding Coding Agents via end-to-end requirements to implementation plan pipeline
Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing
🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil
A thing that uses AI to write perfect applications. For those who want to know how: a governance runtime enforcing immutable constitutional rules on AI coding agents.
Auto-Use Computer Use — drives your OS, browser, scours the web, writes your code. One agent, end to end.
Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.
sphinxcontrib-devhelp is a sphinx extension which outputs Devhelp documents
MCP server for searching and retrieving Claude Agent Skills using vector search
Search your files by talking to them - 100% offline
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Starter app for building AI SaaS (RAG, Agentic workflow) applications
sphinxcontrib-qthelp is a sphinx extension which outputs QtHelp documents
An open-source, self-hosted API that turns standard email providers (Mailgun, SES, SendGrid) into "Inbox-as-a-Service" for AI Agents.
Autonomous overnight codebase improvement agent for Claude Code. Run it before bed, wake up to production-ready fixes.
AI-powered PRD generation for Claude Code with taskmaster integration
AI patient advocacy tool for cancer treatment. Understand labs, find clinical trials, track treatment — all from your phone. Open source, used in active treatment.
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
Assistant IA avancé (RAG, outils, Légifrance, OCR, skills, export de fichiers, historique) conçu principalement pour un usage avec AlbertAPI (DiNum)
The open framework for extensible & grounded AI agent orchestration.
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
Python LLM-RAG deep agent using LangChain, LangGraph and LangSmith built on Quart web microframework and served using Hypercorn ASGI and WSGI web server.
Broken RAG For The Broken Souls
Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.
Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
The localhost AI Agent Runtime -- Chat UI, Tools, RAG, and MCP in one pip install
Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API
Enforce zero-trust rules for AI agents to prevent hallucinations, unsafe actions, and policy bypasses
Practical CLI tool for maintaining open source repositories.
Hybrid cloud-local AI Employee that runs 24/7 on a cloud VM, monitors Gmail/WhatsApp, drafts responses, and queues approvals via git-synced Obsidian vault. Human-in-the-loop safety gates for email, so
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
📚 Learn from diverse sources with OmniLearnAI, an intelligent platform that combines documents, videos, and more, all with reliable citations.
Decrypt WeChat databases on macOS by extracting encryption keys to access and export chat records with support for searching and AI query integration.
Automate shell tasks using a local Ollama model that plans, executes, and fixes commands without cloud or API dependencies.
📄 Enable smart document and data search with AI-powered chat, vector search, and SQL querying across multiple file formats.
🤖 Generate secure, automated repo documentation and pull request checks with a safe-by-default toolchain for coding agents.
🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.
Automate binary analysis by coordinating LLM agents with Ghidra, enabling scalable and precise reverse engineering workflows.
🧠 Build an offline RAG chatbot to answer questions from PDFs, adapting responses based on user experience levels with a smooth chat interface.
Enable any language model with permanent, searchable memory using a lightweight middleware for on-demand retrieval and continuous learning.
🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock
An event-driven, async-first, step-based way to control the execution flow of AI applications like Agents.
Microsoft Corporation Azure AI Projects Client Library for Python
llama-index indices llama-cloud integration
Assistant plugin for Pinecone SDK
PyMuPDF Layout turns PDFs into structured data 10× faster than vision-based tools using AI trained on PDF internals, not images. CPU-only. No GPU required.
A structured reasoning and decision architecture for stable, interpretable, and hallucination‑resistant AI systems. An open standard for human–AI collaboration and autonomous systems.
A stateful AI agent framework powered by the Cognitive Lattice to solve complex tasks with persistent memory and reliable tool orchestration.
Demo RAG API (FastAPI, OpenAI, ChromaDB, Docker) automatically generated using the OpenAI Codex CLI tool. Highlights Codex's capability for rapid, complex application development.
