freshcrate — Search

Search results for "inference"

91 results found (Python)

ContextPilot 📁v0.4.1🌿 Growing⭐79

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

ai-agents context-api context-engineering hermes-agent inference-optimization openclaw prompt-engineering pythonby EfficientContextPython

lm-proxy 📁v3.2.2🌿 Growing⭐114

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

ai anthropic api-proxy fastapi google-ai language-models llm llm-api pythonby NayjestPython

rasputin-memory 📁v0.9.1🌱 Seedling⭐17

The memory system your AI agent deserves. 4-stage hybrid retrieval — Vector + BM25 + Knowledge Graph + Neural Reranker — in <150ms. Self-hosted, $0/query, built for agents that need to actually rememb

agent-memory ai ai-memory bm25 embeddings falkordb hybrid-search inference python ragby jcartuPython

claude-code-plugins-plus-skills 📁v4.26.0🌳 Mature⭐1,995

423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.

agent-skills ai ai-agents anthropic automation claude-code claude-code-plugins developer-tools mcp pythonby jeremylongshorePython

mcp-memory-service 📁v10.39.1🌳 Mature⭐1,643

Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.

agent-memory agentic-ai ai-agents autogen claude crewai knowledge-graph langgraph pythonby doobidooPython

cyllama 📁0.2.11🌱 Seedling⭐22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

agents cython cython-wrapper llama-cpp python python3 rag stable-diffusion-cpp whisper-cppby shakfuPython

litellm 📁v1.83.7-stable🌳 Mature⭐42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

ai-gateway anthropic azure-openai bedrock gateway langchain litellm llm pythonby BerriAIPython

Constrained-Text-Generation-Studio 📁0.0.0🌿 Growing⭐216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

pythonby HellisotherpeoplePython

LLM-Agents-Ecosystem-Handbook 📁0.0.0🌳 Mature⭐508

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

ai ai-agent ai-agents fine-tuning finetuning-llms freamework llm llmops pythonby oxbshwPython

pinecone-python-client 📁v8.1.2🌿 Growing⭐432

The Pinecone Python client

pythonby pinecone-ioPython

droid-llm-hunter 📁v1.0.0🌿 Growing⭐100

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

android python scanning-tool vulnerability-scannersby roomkangaliPython

RAGLight 📁3.4.7🌳 Mature⭐658

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

agentic-ai agentic-rag agentic-workflow artificial-intelligence data-science framework huggingface lmstudio pythonby Bessouat40Python

RAG-Anything 📁v1.2.10🏛️ Flagship⭐16,761

"RAG-Anything: All-in-One RAG Framework"

multi-modal-rag python retrieval-augmented-generationby HKUDSPython

openlit 📁openlit-1.18.1🌿 Growing⭐2,358

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers,

ai-observability amd-gpu clickhouse distributed-tracing genai gpu-monitoring grafana langchain pythonby openlitPython

GhostDesk 📁v7.1.0🌱 Seedling⭐39

Give any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm

agentic ai-agent automation autonomous-agent browser-automation claude computer-use docker pythonby YV17labsPython

RAGElo 📁0.4.0🌿 Growing⭐128

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

pythonby zetaalphavectorPython

JRVS 📁0.0.0🌿 Growing⭐236

JRVS AI Agent with JARCORE autonomous coding engine - RAG knowledge base, web scraping, calendar, code generation. Powered by whatever local AI you choose.

pythonby XthebuilderPython

aura 📁main@2026-04-21🌱 Seedling⭐47

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running loca

active-inference affective-computing apple-silicon artificial-consciousness autonomous-agent cognitive-architecture cognitive-science consciousness pythonby youngbryan97Python

vllm 📁v0.19.1🌿 Growing⭐76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference pythonby vllm-projectPython

monocle 📁v0.7.8🌿 Growing⭐72

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

generative-ai linux-foundation llm-agent llm-inference llms observability opentelemetry oss pythonby monocle2aiPython

Agentic-RAG-R1 📁0.0.0🌿 Growing⭐412

Agentic RAG R1 Framework via Reinforcement Learning

agentic grpo python rag rlby jiangxinkePython

vllm-mlx 📁v0.2.8🌿 Growing⭐798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

anthropic apple-silicon audio-processing claude-code computer-vision image-understanding inference llm pythonby waybarriosPython

arthur-engine 📁2.1.529🌿 Growing⭐75

Make AI work for Everyone - Monitoring and governing for your AI/ML

agentic benchmarking evaluation genai guardrails llm ml monitoring pythonby arthur-aiPython

Gito 📁v4.0.3🌿 Growing⭐210

An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issues—such as security vulnerabilities, bugs, and maintainability concerns.

ai ai-code-analysis ai-code-review ai-code-reviewer ai-coding ai-coding-assistant code-analysis code-audit pythonby NayjestPython

DeepCode 📁v1.2.0🏛️ Flagship⭐15,244

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

agentic-coding llm-agent pythonby HKUDSPython

mcp 📁2026.04.20260414152327🌿 Growing⭐8,740

Official MCP Servers for AWS

aws mcp mcp-client mcp-clients mcp-host mcp-server mcp-servers mcp-tools pythonby awslabsPython

Anthropic-Cybersecurity-Skills 📁v1.2.0🌿 Growing⭐5,443

754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Cop

ai-agents claude-code cloud-security cybersecurity devsecops ethical-hacking incident-response infosec pythonby mukul975Python

RIGEL 📁0.0.0🌱 Seedling⭐26

A Multi-Agentic AI Assistant/Builder

agentic-ai ai-assistant ai-framework chatbot dbus groq linux llm pythonby Zerone-LaboratoriesPython

server-nexe 📁v1.0.0-beta🌱 Seedling⭐9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

ai apple-silicon embeddings fastapi llama-cpp llm local-ai mlx python vector-databaseby jgoy-labsPython

LLM-Agent-Paper-daily 📁main@2026-04-21🌱 Seedling⭐20

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

llm llm-agent pythonby Lyz103Python

awesome-code-agents 📁main@2026-04-20🌿 Growing⭐94

A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding — they're redefining how software changes the world.

pythonby EuniAIPython

orbit 📁v2.6.6🌿 Growing⭐250

One API for 20+ LLM providers, your databases, and your files — self-hosted, open-source AI gateway with RAG, voice, and guardrails.

ai-assistant ai-gateway ai-safety anthropic chatbot developer-tools elasticsearch llm pythonby schmitechPython

AGI-Alpha-Agent-v0 📁main@2026-04-18🌿 Growing⭐283

META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Execute ⚡

agentic agentic-ai agentic-framework ai aiagent aiagents llm meta-agentic pythonby MontrealAIPython

OmicsClaw 📁main@2026-04-18🌿 Growing⭐116

Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full research paper.

bioinformatics knowledge-graph llm-agent multi-agents multi-omics python single-cell spatial-transcriptomicsby TianGzlabPython

ag2 📁v0.12.0🌿 Growing⭐4,383

AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x

a2a ag2 agent-framework agentic agentic-ai ai ai-agents-framework aiagents pythonby ag2aiPython

sdk-python 📁v1.36.0🌿 Growing⭐5,602

A model-driven approach to building AI agents in just a few lines of code.

agentic agentic-ai agents ai anthropic autonomous-agents bedrock genai pythonby strands-agentsPython

AReaL 📁v1.0.3🌿 Growing⭐5,017

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

agent llm llm-agent llm-reasoning machine-learning-systems mlsys python reinforcement-learning rlby inclusionAIPython

llmware 📁v0.4.6🌿 Growing⭐14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

agents generative-ai-tools llamacpp llm onnx openvino parsing python retrieval-augmented-generationby llmware-aiPython

agenticSeek 📁main@2026-04-11🌿 Growing⭐25,891

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993

agentic-ai agents ai autonomous-agents deepseek-r1 llm llm-agents python voice-assistantby FosowlPython

EvoScientist 📁v0.0.7🌿 Growing⭐2,731

🔬 Harness Vibe Research with Self-evolving AI Scientists

ai-agent ai4science multi-agent-system python vibe-researchby EvoScientistPython

UltraRAG 📁v0.3.0.2🌿 Growing⭐5,480

A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

deepseek demo easy embedding flask gpt huggingface-transformers llm pythonby OpenBMBPython

qwe-qwe 📁v0.17.6🌱 Seedling⭐35

⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.

agent ai ai-agent pythonby deepfounder-aiPython

swing-trading-agent 📁0.0.0🌱 Seedling⭐7

Multi-agent swing trading system — automated screening, research, and execution with backtesting and live trading

ai-agent algorithmic-trading backtesting llm-agent multi-agent paper-trading python quantitative-finance stock-marketby kevmyungPython

0xClaw 📁0.0.0🌱 Seedling⭐10

🦀 The first autonomous hackathon agent stop assisting and start competing (🏆 Hackathon Champion Project).

agent-framework agentic-ai ai-agent autonomous autonomous-agents code-generation generative-ai hackathon llm-agent pythonby 0xclaw-aiPython

zettelforge 📁v2.4.0🌱 Seedling⭐25

Agentic memory for CTI in Python — STIX knowledge graphs, threat-actor alias resolution, offline-first RAG, MCP server for Claude Code and LangChain agents

agentic-memory ai-agent claude-code cti cybersecurity knowledge-graph langchain llm pythonby rolandpgPython

My_AI 📁v7.2.0🌱 Seedling⭐7

Local-first AI assistant — 9 specialized agents (code, web, debug, security…), 10M token vector memory, mobile relay via secure tunnel, real-time web search and document processing. Runs 100% on your

ai-assistant chatbot chromadb code-generation customtkinter document-processing gui local-llm pythonby gonicolas12Python

Ultimate-Agent-Directory 📁0.0.0🌱 Seedling⭐51

🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

agent agentic agentic-ai agents boilerplate boilerplate-application boilerplate-template pythonby moshehbenavrahamPython

llm_context_benchmarks 📁0.0.0🌱 Seedling⭐59

📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

ai benchmarking llms pythonby ivanfioravantiPython

apiclaw 📁v2.0.0🌱 Seedling⭐7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

ai-agents ai-tools api-platform claude llm mcp model-context-protocol pythonby nordsymPython

Open-Sable 📁v1.7.0🌱 Seedling⭐18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

agentic agentic-ai ai ai-assistant open-source pythonby IdeoaLabsPython

codexlens-search 📁v0.8.0🌱 Seedling⭐44

Lightweight semantic code search engine — 2-stage vector + FTS + RRF fusion + MCP server for Claude Code

pythonby catlog22Python

vllm-cli 📁v0.2.5💤 Dormant⭐491

A command-line interface tool for serving LLM using vLLM.

llm llm-inference llm-tools python vllmby Chen-zexiPython

Somi 📁Mineralization🌱 Seedling⭐21

Local-first AI agent framework with GUI, memory, web search, personality constructs, speech i/o, tools, skills, CLI & Telegram features — fully self-hosted via Ollama.

ai-agents ai-framework arti automation cli gui homeb local pythonby Somi-ProjectPython

daiv 📁v2.0.0🌱 Seedling⭐18

Your AI-powered SWE teammate, built into your git workflow

ai-agent anthropic deepagents genai git github gitlab google-gemini-ai pythonby srtabPython

reina 📁v1.0.0🌱 Seedling⭐35

Autonomous AI agent for Crustocean, powered by Hermes Agent from Nous Research

ai-agent autonomous-agent chatbot crustocean hermes-agent llm nous-research pythonby CrustoceanPython

LettuceDetect 📁0.1.8💤 Dormant⭐565

Lightweight hallucination detection framework for RAG applications

bert hallucination-detection hallucination-evaluation information-extraction nlp python pytorch token-classificationby KRLabsOrgPython

robots 📁v0.3.8🌱 Seedling⭐44

Control robots and physical hardware with natural language through Strands Agents.

agentic agentic-ai ai genai gr00t lerobot machine-learning pythonby strands-labsPython

cloneme 📁0.0.0💤 Dormant⭐38

CloneMe is an advanced AI platform that builds your digital twin—an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

ai ai-assistant automation autonomous-agent chatbot conversational-ai developer-tools digital-twin pythonby vibheksoniPython

Compiler 📁v2🌱 Seedling⭐20

A tool that compiles messy natural language prompts into a structured intermediate representation (IR) and optionally sends them to LLMs like ChatGPT for cleaner, more reliable responses.

artificial-intelligence llms prompt prompt-engineering pythonby madara88645Python

uniAI 📁0.0.0🌱 Seedling⭐1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen

ai chromadb django genai information-retrieval llm local-llm ollama python vector-databaseby git-pratap-shreyPython

evo-agents 📁master@2026-04-19🌱 Seedling⭐3

Complete Workspace Template for OpenClaw - Full agent lifecycle with unified memory system (Markdown + SQLite), self-evolution, RAG. Not for SubAgent/Skill use.

agent ai-memory bge-m3 chinese-nlp fts5 local-ai markdown memory-system python ragby luoboaskPython

KAG 📁v0.8.0💤 Dormant⭐8,688

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base

knowledge-graph large-language-model logical-reasoning multi-hop-question-answering python trustfulnessby OpenSPGPython

multi-agent-orchestration-framework 📁v0.1.0🌱 Seedling⭐26

Modular multi-agent orchestration framework powered by LangGraph and FastAPI.

agent ai-framework fastapi langchain langgraph llm memory multi-agent pythonby yx-fanPython

Grinta-Agent 📁main@2026-04-20🌱 Seedling⭐1

Local-first autonomous coding agent that plans, executes, validates, and finishes software tasks end-to-end.

ai-coding autonomous-agent coding-agent developer-tools fastapi llm local-first model-context-protocol pythonby josephseniorPython

Government-Citizen-Services-Voice-Agent 📁main@2026-04-15🌱 Seedling⭐1

Autonomous, multilingual AI voice agent using ElevenLabs, LangGraph, and RAG for government services

conversational-ai elevenlabs fastapi govtech langgraph python rag voice-agentby AutomaticarePython

apache-tvm-ffi 📁0.1.10🌱 Seedling

tvm ffi

inference learning machine pypiby TVM FFI teamPython

flashinfer-python 📁0.6.8.post1🌱 Seedling

FlashInfer: Kernel Library for LLM Serving

pypiby FlashInfer teamPython

torchao 📁0.17.0🌱 Seedling

Package for applying ao techniques to GPU models

pypiby pypiPython

azure-ai-inference 📁1.0.0b9🌱 Seedling

Microsoft Azure AI Inference Client Library for Python

azure pypi sdkby Microsoft CorporationPython

roboflow 📁1.3.3🌱 Seedling

Official Python package for working with the Roboflow API

pypiby RoboflowPython

faster-whisper 📁1.2.1🌱 Seedling

Faster Whisper transcription with CTranslate2

ctranslate2 inference openai pypi quantization speech transformer whisperby Guillaume KleinPython

xgrammar 📁0.1.33🌱 Seedling

Efficient, Flexible and Portable Structured Generation

inference learning machine pypiby MLC TeamPython

ctranslate2 📁4.7.1🌱 Seedling

Fast inference engine for Transformer models

cuda inference machine mkl neural nmt opennmt pypi translationby OpenNMTPython

cmdstanpy 📁1.3.0🌱 Seedling

Python interface to CmdStan

pypiby Stan Dev TeamPython

genai-prices 📁0.0.57🌱 Seedling

Calculate prices for calling LLM inference APIs.

pypiby Samuel ColvinPython

timm 📁1.0.26🌱 Seedling

PyTorch Image Models

image-classification pypi pytorchby pypiPython

torchmetrics 📁1.9.0🌱 Seedling

PyTorch native Metrics

ai deep learning machine metrics pypi pytorchby Lightning-AI et al.Python

pinecone8.1.2🌱 Seedling

Pinecone client and SDK

cloud database pinecone pypi vectorby pypiPython

qdrant-client 📁1.17.1🌱 Seedling

Client library for the Qdrant vector search engine

client matching neural pypi search vectorby Andrey VasnetsovPython

tritonclient2.67.0🌱 Seedling

Python client library and utilities for communicating with Triton Inference Server

client grpc http inference pypi server service tensorrt tritonby NVIDIA Inc.Python

keras 📁3.14.0🌱 Seedling

Multi-backend Keras

pypiby pypiPython

blis 📁1.3.3🌱 Seedling

The Blis BLAS-like linear algebra library, as a self-contained C-extension.

pypiby Matthew HonnibalPython

cohere 📁6.1.0🌱 Seedling

No description

pypiby pypiPython

sagemaker 📁3.8.0🌱 Seedling

Open source library for training and deploying models on Amazon SageMaker.

ai amazon aws huggingface ml mxnet pypi pytorch tensorflowby Amazon Web ServicesPython

sglang 📁0.5.10.post1🌱 Seedling

SGLang is a fast serving framework for large language models and vision language models.

pypiby pypiPython

astroid 📁4.1.2🌱 Seedling

An abstract syntax tree for Python with inference support.

abstract analysis code pypi python static syntax treeby pypiPython

transformers 📁5.5.4🌱 Seedling

Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

deep-learning llm machine-learning nlp pypi python pytorch transformer vlmby The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huPython

mypy 📁1.20.2🌱 Seedling

Optional static typing for Python

pypiby pypiPython

setuptools-scm 📁10.0.5🌱 Seedling

the blessed package to manage your versions by scm tags

pypiby pypiPython

google-cloud-aiplatform 📁1.148.1🌱 Seedling

Vertex AI API client library

pypiby Google LLCPython

medicalAI 📁v1.2.9-rc⚰️ Archived⭐21

Medical-AI is a AI framework specifically for Medical Applications https://aibharata.github.io/medicalAI/

ai-framework keras medical-applications medical-imaging pdf-report prediction python tensorflow tensorflow2by aibharataPython