freshcrate — Search

Search results for "rerank"

26 results found

new-api 📁v0.12.14🌳 Mature⭐26,168

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers

ai-gateway claude deepseek gemini go newapi openai rerankby QuantumNousGo

axonhub 📁v0.9.35🌳 Mature⭐3,013

⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.

agent agents ai anthropic anthropic-api api-gateway claude claude-code goby loopljGo

OmniRoute 📁v3.6.9🌳 Mature⭐2,435

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

typescriptby diegosouzapwTypeScript

litellm 📁v1.83.7-stable🌳 Mature⭐42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

WeKnora 📁v0.4.0🌳 Mature⭐13,819

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.

agent agentic ai chatbot chatbots embeddings evaluation generative-ai goby TencentGo

agentic-memory 📁0.0.0🌿 Growing⭐162

No description

by lhl

MODULAR-RAG-MCP-SERVER 📁0.0.0🌳 Mature⭐783

A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.

pythonby jerry-ai-devPython

memind 📁main@2026-04-21🌿 Growing⭐360

Self-evolving cognitive memory and context engine for AI agents in Java. Empowering 24/7 proactive agents like OpenClaw with understanding and SOTA performance.

ai ai-agent ai-agents ai-memory context-engineering java memory openclawby openmemindJava

ai-agents-from-zero 📁main@2026-04-20🌿 Growing⭐264

🚀 2026 最系统的 AI Agent 速成指南｜智能体实战教程 · 完整学习路径 + 实战项目 + 面试题库 · 对标大模型应用开发工程师岗位 · 覆盖LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / 提示词 · 企业级部署与微调 · 从0到企业级落地 + 从学习到上线项目 + 面试准备一体化

agent agent-framework ai-agent aigc coze dify gpt langchain pythonby didililiPython

project-golem 📁main@2026-04-18🌿 Growing⭐344

OS-level autonomous AI agent with long-term memory, multi-agent coordination, Titan Chronos scheduler & Moltbot Social Core

ai-agent autonomous-agents chatbot discord-bot golem javascript long-term-memory multi-agent nodejsby ArvincreatorJavaScript

Matryoshka 📁main@2026-04-18🌿 Growing⭐119

MCP server for token-efficient large document analysis via the use of REPL state

ai-assistant document-analysis llm llm-tools mcp mcp-server model-context-protocol typescriptby yogthosTypeScript

milvus 📁v2.6.15🌿 Growing⭐43,734

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

anns cloud-native diskann distributed embedding-database embedding-similarity embedding-store faiss goby milvus-ioGo

solon-ai 📁v3.10.2🌿 Growing⭐352

Java AI application development framework (supports LLM-tool,skill; RAG; MCP; Agent-ReAct,Team-Agent). Compatible with java8 ~ java25. It can also be embedded in SpringBoot, jFinal, Vert.x, Quarkus, a

ai chat deepseek embedding function-call java llm mcp-clientby opensolonJava

rag-chatbot 📁main@2026-04-14🌿 Growing⭐402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

chatbot chromadb gpu lamacpp llama3 llm python qwen3-5 ragby umbertogriffoPython

MakerAi 📁master@2026-04-11🌿 Growing⭐159

The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp

ai-agents claude delphi embeddings fpc free-pascal function-calling gemini pascalby gustavoeenriquezPascal

kuzu-memory 📁v1.12.9🌱 Seedling⭐22

Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.

pythonby bobmatnycPython

pinecone-ts-client 📁v7.2.0🌱 Seedling⭐269

The official TypeScript/Node client for the Pinecone vector database

llm pinecone semantic-search similarity-search typescript vector-databaseby pinecone-ioTypeScript

pinecone-python-client 📁v8.1.2🌱 Seedling⭐429

The Pinecone Python client

pythonby pinecone-ioPython

RAGLight 📁3.4.7🌱 Seedling⭐656

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

agentic-ai agentic-rag agentic-workflow artificial-intelligence data-science framework huggingface lmstudio pythonby Bessouat40Python

codexlens-search 📁v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

pythonby catlog22Python

rex-cli 📁v0.17.0🌱 Seedling⭐27

Local-first AI agent bootstrap: Playwright Browser MCP + ContextDB for Codex CLI, Claude Code, Gemini CLI, and OpenCode.

ai-agent automation browser-automation claude-code cli codex-cli contextdb gemini-cli javascriptby rexleimoJavaScript

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

PAI-RAG 📁v0.4.3🌱 Seedling⭐450

An easy-to-use framework for modular RAG

pythonby aigc-appsPython

agentica 📁1.2.3🌱 Seedling⭐277

Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架，支持工具调用、RAG、多智能体和MCP。

actionflow agent agentica agents langchain llm multi-agent python workflowsby shibing624Python

bigrag 📁main@2026-04-20🌱 Seedling⭐2

Self-hostable RAG platform - document ingestion, embedding, and vector search behind a simple REST API

ai embeddings python rag rag-pipeline vector-databaseby bigintPython

CoexistAI 📁v2.6💤 Dormant⭐464

CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi

agentic-ai fastapi github jupyter notebook langchain langgraph map mcp-server redditby SPTholeJupyter Notebook