freshcrate

Search results for "rerank"

26 results found
new-api📁v0.12.14🌳 Mature26,168

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers

axonhub📁v0.9.35🌳 Mature3,013

⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.

OmniRoute📁v3.6.9🌳 Mature2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

litellm📁v1.83.7-stable🌳 Mature42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

WeKnora📁v0.4.0🌳 Mature13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agentic-memory📁0.0.0🌿 Growing162

No description

by lhl
MODULAR-RAG-MCP-SERVER📁0.0.0🌳 Mature783

A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.

memind📁main@2026-04-21🌿 Growing360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

ai-agents-from-zero📁main@2026-04-20🌿 Growing264

🚀 2026 最系统的 AI Agent 速成指南|智能体实战教程 · 完整学习路径 + 实战项目 + 面试题库 · 对标大模型应用开发工程师岗位 · 覆盖LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / 提示词 · 企业级部署与微调 · 从0到企业级落地 + 从学习到上线项目 + 面试准备一体化

project-golem📁main@2026-04-18🌿 Growing344

OS-level autonomous AI agent with long-term memory, multi-agent coordination, Titan Chronos scheduler & Moltbot Social Core

Matryoshka📁main@2026-04-18🌿 Growing119

MCP server for token-efficient large document analysis via the use of REPL state

milvus📁v2.6.15🌿 Growing43,734

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

solon-ai📁v3.10.2🌿 Growing352

Java AI application development framework (supports LLM-tool,skill; RAG; MCP; Agent-ReAct,Team-Agent). Compatible with java8 ~ java25. It can also be embedded in SpringBoot, jFinal, Vert.x, Quarkus, a

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

MakerAi📁master@2026-04-11🌿 Growing159

The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp

kuzu-memory📁v1.12.9🌱 Seedling22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

pinecone-ts-client📁v7.2.0🌱 Seedling269

The official TypeScript/Node client for the Pinecone vector database

pinecone-python-client📁v8.1.2🌱 Seedling429

The Pinecone Python client

RAGLight📁3.4.7🌱 Seedling656

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

codexlens-search📁v0.8.0🌱 Seedling44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

rex-cli📁v0.17.0🌱 Seedling27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

uniAI📁0.0.0🌱 Seedling1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

PAI-RAG📁v0.4.3🌱 Seedling450

An easy-to-use framework for modular RAG

agentica📁1.2.3🌱 Seedling277

Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。

bigrag📁main@2026-04-20🌱 Seedling2

Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API

CoexistAI📁v2.6💤 Dormant464

CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi