Search results for "ocr"
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
LlamaIndex is the leading document agent and OCR platform
AgenticX is a unified, production-ready multi-agent platform โ Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m
Assistant IA avancรฉ (RAG, outils, Lรฉgifrance, OCR, skills, export de fichiers, historique) conรงu principalement pour un usage avec AlbertAPI (DiNum)
๐ 2026 ๆ็ณป็ป็ AI Agent ้ๆๆๅ๏ฝๆบ่ฝไฝๅฎๆๆ็จ ยท ๅฎๆดๅญฆไน ่ทฏๅพ + ๅฎๆ้กน็ฎ + ้ข่ฏ้ขๅบ ยท ๅฏนๆ ๅคงๆจกๅๅบ็จๅผๅๅทฅ็จๅธๅฒไฝ ยท ่ฆ็LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธ็บง้จ็ฝฒไธๅพฎ่ฐ ยท ไป0ๅฐไผไธ็บง่ฝๅฐ + ไปๅญฆไน ๅฐไธ็บฟ้กน็ฎ + ้ข่ฏๅๅคไธไฝๅ
Unified framework for building enterprise RAG pipelines with small, specialized models
AI skills that turns coding agents into UiPath experts.
Search your files by talking to them - 100% offline
Build autonomous AI agents in Python.
Open security scanner for AI supply chain: agents, MCP, containers, cloud, GPU, and runtime with blast-radius analysis.
Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf
MCP server that gives any LLM its own computer โ managed Docker workspaces with live browser, terminal, code execution, document skills, and autonomous sub-agents. Self-hosted, open-source, pluggable
๐ค The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Nextcloud MCP Server
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
"RAG-Anything: All-in-One RAG Framework"
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
๐ซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features โ fully self-hosted via Ollama.
๐ Enable local LLMs with real-time Google search, live feeds, OCR, and video insights using noapi-google-search-mcp server tools.
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ built to help studen
Agentica: Lightweight async-first Python framework for AI agents. ่ฝป้็บงๅผๆญฅไผๅ ็AI Agentๆกๆถ๏ผๆฏๆๅทฅๅ ท่ฐ็จใRAGใๅคๆบ่ฝไฝๅMCPใ
Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
An AI guardian that remembers, watches, and acts.
The official Python library for the llama-cloud API
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Microsoft Azure Cognitive Search Client Library for Python
Interface between LLMs and your data
