Search results for "embedding"
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
The memory system your AI agent deserves. 4-stage hybrid retrieval โ Vector + BM25 + Knowledge Graph + Neural Reranker โ in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
The agent engineering platform
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
A Markdown-first memory system, a standalone library for any AI agent. Inspired by OpenClaw.
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.
Build, deploy, and orchestrate event-driven agents natively on Apache Flinkยฎ and Apache Kafkaยฎ
Memory library for building stateful agents
Dragon Brain โ persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
AI-powered spec generation and review using multi-repo code graph intelligence for backend teams that ship to production.
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (co
A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
LlamaIndex is the leading document agent and OCR platform
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo with: medusa scan --git user/repo
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified framework for building enterprise RAG pipelines with small, specialized models
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
Give your AI agents persistent memory.
The Pinecone Python client
Nextcloud MCP Server
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIโuse as library or standalone service.
Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal
๐ง Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.
"RAG-Anything: All-in-One RAG Framework"
High-Performance Engine for Multi-Vector Search
Lightweight semantic code search engine โ 2-stage vector + FTS + RRF fusion + MCP server for Claude Code
The highest-scoring AI memory system ever benchmarked that isn't reliant on LLM reranking. And it's free & burns less tokens.
๐ก All-in-one AI framework for semantic search, LLM orchestration and language model workflows
๐ง Enhance AI conversations with Cognio, a persistent memory server that retains context and enables meaningful semantic search across sessions.
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
Project CodeGuard is an open-source, model-agnostic security framework that embeds secure-by-default practices into AI coding agent workflows. It provides comprehensive security rules that guide AI as
An easy-to-use framework for modular RAG
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Local-first Agentic Memory Layer for MCP Agents โข 25 tools โข Hybrid search (FTS5 + vector + MMR) โข GDPR โข 100% local
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
๐ Enhance retrieval with REFRAG, using micro-chunking and fast indexing for optimized RAG systems that improve efficiency and effectiveness.
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
๐ Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.
Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API
Broken RAG For The Broken Souls
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ built to help studen
PromptManager is a desktop application for cataloguing, searching, and executing AI prompts, and much more.
๐ค Recommend TV shows by matching favorites, averaging embeddings, and finding similar titles using fuzzy search and vector similarity.
๐ ๏ธ Simplify your tasks with MineContext, an open-source AI tool that provides context-aware support for clarity and efficiency in work and study.
No description
Turn AI into a persistent, memory-powered collaborator. Universal MCP Server (supports HTTP, STDIO, and WebSocket) enabling cross-platform AI memory, multi-agent coordination, and context sharing. Bui
Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services
๐ฆพ A productionโready research outreach AI agent that plans, discovers, reasons, uses tools, autoโbuilds cited briefings, and drafts tailored emails with toolโchaining, memory, tests, and turnkey Dock
