freshcrate

Search results for "gpt-oss"

Clear filters
18 results found (Python)
sglang📁0.5.10.post1🏛️ Flagship26,220

SGLang is a fast serving framework for large language models and vision language models.

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

cognithor📁v0.92.3🌿 Growing115

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

mcp-client-for-ollama📁v0.28.0🌳 Mature655

A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features include agent mode, multi-server, model switching, streaming responses, tool management, human-in-the-l

jarvis📁v1.28.0🌿 Growing300

Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects

vmlx📁v1.3.34🌿 Growing348

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

LRAT📁0.0.0🌱 Seedling39

The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.

LLM-API-Key-Proxy📁dev/build-20260301-1-b62f6e4🌿 Growing465

Universal LLM Gateway: One API, every LLM. OpenAI/Anthropic-compatible endpoints with multi-provider translation and intelligent load-balancing.

CASSIA📁v1.3.1🌿 Growing89

CASSIA: A Multi-Agent LLM-Based Single-Cell Cell Type Annotation Framework

py-gpt📁v2.7.12🌳 Mature1,738

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

ollamafreeapi📁main@2026-04-15🌿 Growing172

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w

rag-chatbot📁main@2026-04-14🌿 Growing407

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

llm_context_benchmarks📁0.0.0🌱 Seedling59

📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimiz

server-nexe📁v1.0.2-beta🌱 Seedling9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

vllm-cli📁v0.2.5💤 Dormant491

A command-line interface tool for serving LLM using vLLM.

rag-agent📁master@2026-04-21🌱 Seedling7

Python LLM-RAG deep agent using LangChain, LangGraph and LangSmith built on Quart web microframework and served using Hypercorn ASGI and WSGI web server.