freshcrate

Search results for "embeddings"

Clear filters
98 results found (Python)
restai📁v6.1.45🌿 Growing483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

AutoRAG📁v0.3.22🌳 Mature4,712

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

RAGLight📁3.4.7🌳 Mature658

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

txtai📁v9.7.0🏛️ Flagship12,412

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

rasputin-memory📁v0.9.1🌱 Seedling17

The memory system your AI agent deserves. 4-stage hybrid retrieval — Vector + BM25 + Knowledge Graph + Neural Reranker — in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb

claude-code-plugins-plus-skills📁v4.26.0🌳 Mature1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

mcp-memory-service📁v10.39.1🌳 Mature1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

KohakuTerrarium📁v1.1.0🌿 Growing208

KohakuTerrarium is a general-purpose AI agent framework and batteries-included app for building, running, and composing self-contained agents and multi-agent teams, with built-in tools, sub-agents, pe

jcodemunch-mcp📁v1.70.0🌳 Mature1,523

The leading, most token-efficient MCP server for GitHub source code exploration via tree-sitter AST parsing

cyllama📁0.2.11🌱 Seedling22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

litellm📁v1.83.7-stable🌳 Mature42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

langchain📁langchain-core==1.3.0🌳 Mature133,178

The agent engineering platform

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

RAPTOR📁0.0.0🌱 Seedling13

RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis

memora📁v0.2.27🌿 Growing395

Give your AI agents persistent memory.

pinecone-python-client📁v8.1.2🌿 Growing432

The Pinecone Python client

LRAT📁0.0.0🌱 Seedling34

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

neurostack📁v0.11.1🌱 Seedling41

Your second brain, starting today. CLI + MCP server that helps you build, maintain, and search a knowledge vault that gets better every day. Works with any AI provider. Local-first, zero-prereq instal

memsearch📁v0.3.1🌿 Growing1,167

A Markdown-first memory system, a standalone library for any AI agent. Inspired by OpenClaw.

RAG-Anything📁v1.2.10🏛️ Flagship16,761

"RAG-Anything: All-in-One RAG Framework"

fast-plaid📁1.4.5🌿 Growing245

High-Performance Engine for Multi-Vector Search

cognithor📁v0.92.2🌿 Growing94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

cognita📁0.0.0🌳 Mature4,419

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

PythonClaw📁0.0.0🌱 Seedling22

OpenClaw reimagined in pure Python — autonomous AI agent with memory, RAG, skills, web dashboard, voice input, daemon, and multi-channel support.

LLM-API-Key-Proxy📁dev/build-20260301-1-b62f6e4🌿 Growing465

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

JRVS📁0.0.0🌿 Growing236

JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.

quickstart-streaming-agents📁master@2026-04-21🌿 Growing67

Build, deploy, and orchestrate event-driven agents natively on Apache Flink® and Apache Kafka®

honcho📁main@2026-04-21🌿 Growing2,030

Memory library for building stateful agents

Dragon-Brain📁v1.1.0🌱 Seedling43

Dragon Brain — persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur

AgenticX📁v0.3.7🌿 Growing105

AgenticX is a unified, production-ready multi-agent platform — Python SDK + CLI (agx) + Studio server + Machi desktop app. Features Meta-Agent orchestration, 15+ LLM providers, MCP Hub, hierarchical m

synaptic-memory📁v0.16.0🌱 Seedling25

Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

fabric-rti-mcp📁0.5.3🌿 Growing107

MCP server for Fabric Real-Time Intelligence (https://aka.ms/fabricrti) supporting tools for Eventhouse (https://aka.ms/eventhouse), Azure Data Explorer (https://aka.ms/adx, and other RTI services (co

DeepCode📁v1.2.0🏛️ Flagship15,244

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

PAI-RAG📁v0.4.3🌿 Growing455

An easy-to-use framework for modular RAG

arag📁v0.1.0🌿 Growing247

A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.

py-gpt📁v2.7.12🌳 Mature1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

babyagi📁v0.1.0🏛️ Flagship22,220

🤖 Transform internal knowledge retrieval with a secure, on-premise RAG-powered chatbot that enhances efficiency through natural language queries.

markdown-vault-mcp📁v1.27.0🌱 Seedling5

Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing

server-nexe📁v1.0.0-beta🌱 Seedling9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

LLM-Agent-Paper-daily📁main@2026-04-21🌱 Seedling20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llama_index📁v0.14.21🌿 Growing48,501

LlamaIndex is the leading document agent and OCR platform

OpenOutreach📁main@2026-04-20🌿 Growing1,418

Linkedin Automation Tool: Describe your product. Define your target market. The AI finds the leads for you.

AGI-Alpha-Agent-v0📁main@2026-04-18🌿 Growing283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

crewAI📁1.14.2🌿 Growing48,611

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

llmware📁v0.4.6🌿 Growing14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

LLM-Wiki📁main@2026-04-18🌱 Seedling7

Autonomous knowledge base plugin for Claude Code - captures reserch, ideas, and decisions into an interlinked wiki with reserch-on-miss, semantic search, and a Wikipedia-style web UI. Knowledge compou

vllm-mlx📁v0.2.8🌿 Growing798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

Corbell📁1.0.3🌿 Growing187

AI-powered spec generation and review using multi-repo code graph intelligence for backend teams that ship to production.

plamen📁main@2026-04-09🌿 Growing214

Autonomous Web3 security audit agent for Claude Code

mcp-gateway-registry📁v1.0.18🌿 Growing576

Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a

kuzu-memory📁v1.12.9🌱 Seedling22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

codexlens-search📁v0.8.0🌱 Seedling44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

Omnispindle📁v0.0.9🌱 Seedling9

A comprehensive MCP-based todo management system, that serves as a central nervous system for Madness Interactive, a multi-project task coordination workshop.

qwe-qwe📁v0.17.6🌱 Seedling35

⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.

dory📁v0.1.0🌱 Seedling14

One memory layer for every AI agent. Local-first, markdown source of truth, and CLI/HTTP/MCP native. Your agent forgot who you are. Again. Dory fixes that.

arxiv-mcp-server📁0.0.0🌱 Seedling14

arXiv MCP Server Client 🐙 enables AI assistants to search, retrieve, analyze, and summarize arXiv papers with features like author/category browsing, trends, and citation insights.

locallens📁v0.0.3🌱 Seedling7

Search your files by talking to them - 100% offline

zettelforge📁v2.4.0🌱 Seedling25

Agentic memory for CTI in Python — STIX knowledge graphs, threat-actor alias resolution, offline-first RAG, MCP server for Claude Code and LangChain agents

AutoViralAI📁0.0.0🌱 Seedling11

Autonomous AI agent that researches viral content, generates posts, publishes them, measures engagement — and rewrites its own strategy based on what worked. Self-learning loop powered by LangGraph +

apiclaw📁v2.0.0🌱 Seedling7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

mnemos-mcp📁main@2026-04-21🌱 Seedling4

🧠 Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.

engram-memory-community📁main@2026-04-20🌱 Seedling6

The highest-scoring AI memory system ever benchmarked that isn't reliant on LLM reranking. And it's free & burns less tokens.

claude-skills-mcp📁v1.0.6🌱 Seedling378

MCP server for searching and retrieving Claude Agent Skills using vector search

Cognio📁main@2026-04-21🌱 Seedling2

🧠 Enhance AI conversations with Cognio, a persistent memory server that retains context and enables meaningful semantic search across sessions.

YouTubeGPT📁v3.3.1🌱 Seedling14

YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.

synapse-ai📁v1.0.0🌱 Seedling1

Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM — local or cloud.

nornweave📁v0.1.8🌱 Seedling10

An open-source, self-hosted API that turns standard email providers (Mailgun, SES, SendGrid) into "Inbox-as-a-Service" for AI Agents.

m3-memory📁v2026.4.20🌱 Seedling4

Local-first Agentic Memory Layer for MCP Agents • 25 tools • Hybrid search (FTS5 + vector + MMR) • GDPR • 100% local

refrag📁main@2026-04-21🌱 Seedling1

🚀 Enhance retrieval with REFRAG, using micro-chunking and fast indexing for optimized RAG systems that improve efficiency and effectiveness.

Flipkart-Product-Recommender-RAG📁main@2026-04-21🌱 Seedling2

🛒 Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.

bigrag📁main@2026-04-20🌱 Seedling2

Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API

DOX📁main@2026-04-15🌱 Seedling1

Broken RAG For The Broken Souls

uniAI📁0.0.0🌱 Seedling1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

second-brain📁1.0🌱 Seedling461

Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera

MARM-Systems📁mcp-server💤 Dormant258

Turn AI into a persistent, memory-powered collaborator. Universal MCP Server (supports HTTP, STDIO, and WebSocket) enabling cross-platform AI memory, multi-agent coordination, and context sharing. Bui

TV-Show-Recommender-AI📁main@2026-04-21🌱 Seedling1

🤖 Recommend TV shows by matching favorites, averaging embeddings, and finding similar titles using fuzzy search and vector similarity.

Government-Citizen-Services-Voice-Agent📁main@2026-04-15🌱 Seedling1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

Agentic-AI-Pipeline📁v1.0.0💤 Dormant63

🦾 A production‑ready research outreach AI agent that plans, discovers, reasons, uses tools, auto‑builds cited briefings, and drafts tailored emails with tool‑chaining, memory, tests, and turnkey Dock

OpenTelemetry LlamaIndex instrumentation

OpenTelemetry crewAI instrumentation

flashinfer-python📁0.6.8.post1🌱 Seedling

FlashInfer: Kernel Library for LLM Serving

OpenTelemetry Cohere instrumentation

OpenTelemetry Anthropic instrumentation

azure-ai-inference📁1.0.0b9🌱 Seedling

Microsoft Azure AI Inference Client Library for Python

OpenTelemetry Langchain instrumentation

OpenTelemetry Vertex AI instrumentation

llama-index indices llama-cloud integration

chromadb📁1.5.8🌱 Seedling

Chroma.

pinecone8.1.2🌱 Seedling

Pinecone client and SDK

qdrant-client📁1.17.1🌱 Seedling

Client library for the Qdrant vector search engine

mistralai📁2.4.1🌱 Seedling

Python Client SDK for the Mistral AI API.

cohere📁6.1.0🌱 Seedling

No description

ai-news-scraper📁2.9.7💤 Dormant8

AI News Scraper & Semantic Search: A Python application that scrapes news articles, uses GenAI to generate summaries and identify topics, and provides semantic search capabilities through vector embed