Search results for "chunk"
Official Box Python SDK
SDK for interacting with LangGraph API
A collection of framework independent HTTP protocol utils.
More routines for operating on iterables, beyond itertools
Ultra-lightweight pure Python package to check if a file is binary or text.
The Context Optimization Layer for LLM Applications
β‘ Lightweight offline AI agent for local models. No cloud, no API keys β just your GPU.
Local knowledge graph for AI agents. Hybrid search + MCP server for Obsidian vaults.
Local-first identity, memory, and secrets for AI agents. Portable state across models and harnesses.
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Data transformation framework for AI. Ultra performant, with incremental processing. π Star if you like it!
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
SeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.
EdegQuake π High-performance GraphRAG inspired from LightRag written in Rust; Transform documents into intelligent knowledge graphs for superior retrieval and generation
RAG pipeline security testing toolkit - 27 techniques across 6 kill chain phases, mapped to MITRE ATLAS
Memory that lasts and compounds. MentisDB gives agents durable memory so they do not just remember, they improve over time. It stores append-only thought chains plus a Git-like skills registry, lett
Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T
Plano is an AI-native proxy and data plane for agentic apps β with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.
The worldβs fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generati
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra
The app framework built for AI coding agents. Own every line. Your AI already knows how to build on it.
The memory system your AI agent deserves. 4-stage hybrid retrieval β Vector + BM25 + Knowledge Graph + Neural Reranker β in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms
The Mind Palace for AI Agents β Autonomous Cognitive OS with affect-tagged memory (valence engine), token-economic RL (surprisal gate + UBI), Hebbian learning, ACT-R spreading activation, Synapse Engi
The official TypeScript/Node client for the Pinecone vector database
vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth
Give your AI agents persistent memory.
All-in-one local AI hub for Obsidian β LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.
trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
[GenAI Application Development Framework] π Build GenAI application quick and easy π¬ Easy to interact with GenAI agent in code using structure data and chained-calls syntax π§© Use Event-Driven Flow
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
Harness LLMs with Multi-Agent Programming
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
"RAG-Anything: All-in-One RAG Framework"
This is a Ruby implementation of MCP (Model Context Protocol) client
A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.
AI-powered spec generation and review using multi-repo code graph intelligence for backend teams that ship to production.
Graph RAG with pure vector search, achieving SOTA performance in multi-hop reasoning scenarios.
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
Anti-detection browser server for AI agents β REST API wrapping Camoufox engine with OpenClaw plugin support
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities
AI Agent Backend Platform on FastAPI β MCP server + AI orchestration + async DDD architecture. Zero-boilerplate CRUD, auto domain discovery, 14 Claude Code AI development skills.
Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.
Buddhist Digital Text Platform β 9,200+ texts, 500+ sources, 8 UI languages, AI Q&A (RAG), knowledge graph, full-text search
Unified framework for building enterprise RAG pipelines with small, specialized models
A lightweight, embeddable vector database library for Go AI projects.
3-tier agentic ChatOps (n8n + GPT-4o + Claude Code) implementing all 21 patterns from "Agentic Design Patterns" β solo operator managing 137 devices
ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.
MCP server for token-efficient large document analysis via the use of REPL state
OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp
Zero trust LLM gateway. OpenAI-compatible proxy with semantic routing and load balancing across OpenAI, Anthropic, Ollama, vLLM, and any compatible backend. Identity-based access, virtual A
Keryx: The Fullstack TypeScript Framework for MCP and APIs
π€ Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.
Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing
Build your AI team with Crewform. Orchestrate specialized, autonomous agents to collaborate on complex tasks and connect outputs to your stack. β AI Orchestration for Everyone
Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-
A universal CLI for Weaviate, Milvus, Chroma, Qdrant, and other vector DBs to help view, list, create, delete, and search collections and documents in collections for development, test, and debugging
A Multi-Agentic AI Assistant/Builder
Production-grade TypeScript AI runtime focused on reliability, governance, and reproducible LLM systems. Multi-provider gateway, agents, RAG, workflows, policy engine, audit trails, and deterministic
MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer safe
One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.
Search your files by talking to them - 100% offline
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Control robots and physical hardware with natural language through Strands Agents.
β‘πΎ Vectro β Compress LLM embeddings π§ π Save memory, speed up retrieval, and keep semantic accuracy π―β¨ Lightning-fast quantization for Python + Mojo, vector DB friendly ποΈ, and perfect for RAG pip
A curated list of vector database solutions, libraries, and resources for AI applications - https://vectordb.works
The open framework for extensible & grounded AI agent orchestration.
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base
Broken RAG For The Broken Souls
Let your agent write code and execute code directly in the browser with WASM
Local-first AI assistant β 9 specialized agents (code, web, debug, securityβ¦), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your
The localhost AI Agent Runtime -- Chat UI, Tools, RAG, and MCP in one pip install
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate β built to help studen
Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API
High-Performance Tokenizer implementation in PHP.
