Tag: #rag
83 packages • ⭐ 331,157 total stars
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Build AI Agents, Visually
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future ses
Build resilient language agents as graphs.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9
A lightweight, lightning-fast, in-process vector database
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python
AI + Data, online. https://vespa.ai
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google
HelixDB is an open-source graph-vector database built from scratch in Rust.
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
EdegQuake 🌋 High-performance GraphRAG inspired from LightRag written in Rust; Transform documents into intelligent knowledge graphs for superior retrieval and generation
TrustRAG:The RAG Framework within Reliable input,Trusted output
Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF creation & editing. 0.8ms mean, 5× faster than industry leaders, 100% pass rate on 3,830 PDFs.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Agentic RAG R1 Framework via Reinforcement Learning
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
Give your AI agents persistent memory.
All-in-one terminal workspace — local shells, SSH, SFTP, remote IDE, AI agent, and file manager in a single native binary. Built with Tauri 2 and pure Rust SSH (no OpenSSL). Smart reconnect, MCP, RAG,
Official Repo of Moss
Swift-based vector database for on-device RAG using MLTensor and MLX Embedders
FlexRAG: A RAG Framework for Information Retrieval and Generation.
Buddhist Digital Text Platform — 9,200+ texts, 500+ sources, 8 UI languages, AI Q&A (RAG), knowledge graph, full-text search
Nextcloud MCP Server
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
🦀 Agentic RAG for drug intelligence · 57 skills · 15 task categories · DTI · ADR · DDI · PGx · Repurposing · Powered by LangGraph
🐙 Drop-in tools that give AI agents reliable, permission-aware access to external systems.
OasisDB: A minimal and lightweight vector database
编程导航 2025 年 AI 开发实战新项目,基于 Spring Boot 3 + Java 21 + Spring AI 构建 AI 恋爱大师应用和 ReAct 模式自主规划智能体YuManus,覆盖 AI 大模型接入、Spring AI 核心特性、Prompt 工程和优化、RAG 检索增强、向量数据库、Tool Calling 工具调用、MCP 模型上下文协议、AI Agent 开发、Curs
Complete open-source AI collaboration suite and multi-agent platform featuring LLM orchestration, automation, and virtual assistants. Scales seamlessly from small deployments to large enterprise envir
:sparkles: :dna: Turing ES - Enterprise Search, Semantic Navigation, Chatbot using Search Engine and Generative AI.
Build, deploy, and orchestrate event-driven agents natively on Apache Flink® and Apache Kafka®
Teleton: Autonomous AI Agent for Telegram & TON Blockchain
The app framework built for AI coding agents. Own every line. Your AI already knows how to build on it.
Reference Implementations for the RAG bootcamp
This project implements a comprehensive framework for Knowledge Graph Retrieval Augmented Generation (KG-RAG). It focuses on financial data from SEC 10-Q filings and explores how knowledge graphs can
A lightweight, embeddable vector database library for Go AI projects.
Graph RAG with pure vector search, achieving SOTA performance in multi-hop reasoning scenarios.
Paper-first SPY options validation platform with broker-backed scorecards, hard risk gates, paired-trade accounting, and live dashboards.
All-in-one local AI hub for Obsidian — LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
RAG pipeline security testing toolkit - 27 techniques across 6 kill chain phases, mapped to MITRE ATLAS
The memory system your AI agent deserves. 4-stage hybrid retrieval — Vector + BM25 + Knowledge Graph + Neural Reranker — in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
Universal memory layer for AI applications. Self-host in minutes. Open source.
Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.
Provide a local, privacy-focused AI assistant for Telegram that runs fully on your machine without sending data to the cloud.
Stream responses from OpenAI and Anthropic models with lightweight C++ tools for efficient large language model integration.
Provide token-efficient, distilled QA docs for AI coding agents to generate accurate test code quickly and reduce token usage significantly
Enable AI agents to autonomously create, evaluate, and evolve skills across any marketplace without user intervention.
Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API
📰 Fetch and summarize news articles locally using a Retrieval-Augmented Generation system powered by AI models for efficient information access.
Provide a unified context intelligence layer for AI agents with seamless integration and full capability in a single Python package.
Enable AI agents to retain memory across sessions using persistent storage designed for continuous context retention.
Enable AI agents with fast, local semantic memory to search and recall knowledge from text files without servers or complex setup.
Search and analyze medical literature across PubMed, ClinicalTrials.gov, and Europe PMC using AI to support clinical and research decisions.
Enable AI agents to search, crawl, and extract web data with IP rotation, CAPTCHA handling, and rate limit management via CLI and Python.
Transform Claude into a local AI assistant for Mac that controls apps, manages tasks, and remembers context across sessions.
Extract tables precisely from PDFs and convert them to clean HTML for RAG pipelines, running fast on CPU without external dependencies.
Simulate antenna designs instantly in your browser using NEC2-powered, open-source software with WebAssembly and Docker support.
Store, consolidate, and recall coding agent memories with provenance tracking using SQLite and FAISS for fast, structured knowledge access.
Build semantic vector databases from code and docs to enable AI agents to understand and navigate your entire codebase effectively.
Capture and summarize Claude Code sessions into searchable, browsable engineering journals with a web UI and automated daily entries.
Enable local document ingestion and retrieval-augmented generation with a secure, .NET-based pipeline that keeps data on your machine.
Provide context-based, accurate answers to syllabus questions using AI powered by Retrieval-Augmented Generation for effective student learning.
Provide a curated list of tools, resources, and best practices for adopting AI-driven autonomous software development with agentic engineering.
Implement Recursive Language Models using Deno and Pyodide to enable scalable, code-driven prompt processing with modular sub-agent calls.
Control autonomous AI agents by enforcing behavior rules to prevent unauthorized actions, improve focus, and boost execution efficiency.
Detect underpriced products by scraping deals and using AI models to estimate fair prices for profit opportunity identification.
Demonstrate a proof-of-concept exploit for CVE-2026-2441, a high-risk Chrome use-after-free vulnerability in the Blink CSS engine.
Explore alternatives to Discord with a curated list of early-stage apps, evaluating features, hosting, and encryption to guide your choice.
Showcase delivers a modern developer portfolio built with TypeScript and React, focusing on interactivity and clean architecture for a seamless user experience.
Broken RAG For The Broken Souls
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
