freshcrate

Search results for "gemma"

42 results found
flashinfer-python📁0.6.8.post1🏛️ Flagship5,467

FlashInfer: Kernel Library for LLM Serving

xgrammar📁0.1.33🌳 Mature1,637

Efficient, Flexible and Portable Structured Generation

ctranslate2📁4.7.1🌳 Mature4,444

Fast inference engine for Transformer models

chinese-llm-benchmark📁v5.10🏛️ Flagship5,889

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括359个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.5、ernie4.5、Min

llamafarm📁v0.0.31🌳 Mature819

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

ollama📁v0.21.0🏛️ Flagship169,635

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

llama.cpp📁b8871🏛️ Flagship105,537

LLM inference in C/C++

GENesis-AGI📁v3.0a8🌱 Seedling22

Autonomous AI agent with persistent memory, self-learning, and earned autonomy. Cognitive partner that remembers, learns, and evolves.

qwe-qwe📁v0.17.6🌱 Seedling35

⚡ Lightweight offline AI agent for local models. No cloud, no API keys — just your GPU.

osaurus📁0.17.0🏛️ Flagship5,082

Own your AI. The native macOS harness for AI agents -- any model, persistent memory, autonomous execution, cryptographic identity. Built in Swift. Fully offline. Open source.

Joanium📁v2026.421.1🌱 Seedling23

Your smart, reliable, and friendly personal AI assistant.

OmniRoute📁v3.6.9🌳 Mature3,250

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for

edgequake📁v0.10.12🌳 Mature1,915

EdegQuake 🌋 High-performance GraphRAG inspired from LightRag written in Rust; Transform documents into intelligent knowledge graphs for superior retrieval and generation

vllm📁v0.19.1🏛️ Flagship77,587

A high-throughput and memory-efficient inference and serving engine for LLMs

AReaL📁v1.0.3🏛️ Flagship5,075

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

vllm-mlx📁v0.2.8🌳 Mature917

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

AgenticGoKit📁v0.5.9🌿 Growing142

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o

vmlx📁v1.3.34🌿 Growing348

vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers MLX Studio. Image gen/edit, OpenAI/Anth

UGTLive📁0.0.0🌿 Growing75

An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)

LocalAI📁v4.1.3🏛️ Flagship45,672

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

jan📁v0.7.9🏛️ Flagship42,053

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

txtai📁v9.7.0🏛️ Flagship12,412

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

houtini-lm📁v2.8.0🌿 Growing71

MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.

cactus📁0.0.0🌿 Growing50

LLM Agent that leverages cheminformatics tools to provide informed responses.

cyllama📁0.2.11🌱 Seedling25

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

tsunami📁main@2026-04-21🌱 Seedling16

autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.

awesome-opensource-ai📁main@2026-04-20🌿 Growing2,849

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.

agentic-chatops📁main@2026-04-20🌿 Growing100

3-tier agentic ChatOps (n8n + GPT-4o + Claude Code) implementing all 21 patterns from "Agentic Design Patterns" — solo operator managing 137 devices

awesome-ai-tools📁main@2026-04-19🌿 Growing390

🔴 VERY LARGE AI TOOL LIST! 🔴 Curated list of AI Tools - Updated 2026

ollamafreeapi📁main@2026-04-15🌿 Growing172

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w

TomoriBot📁v0.7.904🌱 Seedling34

A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! | 長期記憶やペルソナ、ツール連携を完備。 次世代の「自律型AIエージェント」Discordボット!

memory_agent_hub📁main@2026-04-20🌱 Seedling40

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

openclaw.net📁0.0.0🌱 Seedling275

Self-hosted OpenClaw gateway + agent runtime in .NET (NativeAOT-friendly)

toolbridge📁v2.0.0🌱 Seedling76

Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native support. Use local or cloud models with full agent capabilities.

Ultimate-Agent-Directory📁0.0.0🌱 Seedling51

🤖 The most comprehensive directory of AI agent frameworks, platforms, tools, and resources - hundreds of curated entries covering open-source, no-code, enterprise, and autonomous solutions. NEW Boil

server-nexe📁v1.0.2-beta🌱 Seedling9

Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine — zero data sent to external services.

CoexistAI📁v2.6💤 Dormant470

CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi

apiclaw📁v2.0.0🌱 Seedling7

The API layer for AI agents. Dashboard + 22K APIs + 18 Direct Call providers. MCP native.

vikramaditya📁main@2026-04-20🌱 Seedling5

Autonomous VAPT platform. Give it a target (FQDN, IP, CIDR) — it hunts, it reports. Inspired by the Obsidian Order.

OllamaRAG📁main@2026-04-21🌱 Seedling5

🤖 Build a smart AI assistant that learns from any website using a Retrieval-Augmented Generation framework with local models powered by Ollama.

PromptDrifter📁main@2026-04-19🌱 Seedling8

🧭 PromptDrifter – one‑command CI guardrail that catches prompt drift and fails the build when your LLM answers change.