freshcrate

Search results for "llama"

Clear filters
59 results found (Python)
npcpy๐Ÿ“v1.4.21๐ŸŒณ Matureโญ1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

opik๐Ÿ“2.0.6๐ŸŒณ Matureโญ18,767

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

restai๐Ÿ“v6.1.45๐ŸŒฟ Growingโญ483

RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat

cyllama๐Ÿ“0.2.11๐ŸŒฑ Seedlingโญ22

A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp

litellm๐Ÿ“v1.83.7-stable๐ŸŒณ Matureโญ42,951

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi

Constrained-Text-Generation-Studio๐Ÿ“0.0.0๐ŸŒฟ Growingโญ216

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

llamafarm๐Ÿ“v0.0.31๐ŸŒฟ Growingโญ825

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

openinference๐Ÿ“python-openinference-instrumentation-google-genai-v0.1.15๐ŸŒฟ Growingโญ913

OpenTelemetry Instrumentation for AI Observability

GhostDesk๐Ÿ“v7.1.0๐ŸŒฑ Seedlingโญ39

Give any AI agent a full desktop โ€” it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm

cognithor๐Ÿ“v0.92.2๐ŸŒฟ Growingโญ94

Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us

llama_index๐Ÿ“v0.14.21๐ŸŒฟ Growingโญ48,501

LlamaIndex is the leading document agent and OCR platform

PlanExe๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ365

Create a plan from a description in minutes

vektori๐Ÿ“main@2026-04-19๐ŸŒฟ Growingโญ72

Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.

SmarterRouter๐Ÿ“2.2.5๐ŸŒฟ Growingโญ105

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

claude-engram๐Ÿ“main@2026-04-17๐ŸŒฑ Seedlingโญ13

Persistent memory and session intelligence for AI coding assistants. Auto-tracks mistakes, decisions, and context via hooks. Mines your full session history for patterns, predictions, and cross-sessio

AutoGPT๐Ÿ“autogpt-platform-beta-v0.6.56๐ŸŒฟ Growingโญ183,319

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

ollamafreeapi๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ144

OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w

llmware๐Ÿ“v0.4.6๐ŸŒฟ Growingโญ14,857

Unified framework for building enterprise RAG pipelines with small, specialized models

parlant๐Ÿ“v3.3.1๐ŸŒฟ Growingโญ17,899

The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.

rag-chatbot๐Ÿ“main@2026-04-14๐ŸŒฟ Growingโญ402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

vllm-mlx๐Ÿ“v0.2.8๐ŸŒฟ Growingโญ798

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac

RIGEL๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ26

A Multi-Agentic AI Assistant/Builder

ai-agents-from-zero๐Ÿ“main@2026-04-20๐ŸŒฟ Growingโญ264

๐Ÿš€ 2026 ๆœ€็ณป็ปŸ็š„ AI Agent ้€ŸๆˆๆŒ‡ๅ—๏ฝœๆ™บ่ƒฝไฝ“ๅฎžๆˆ˜ๆ•™็จ‹ ยท ๅฎŒๆ•ดๅญฆไน ่ทฏๅพ„ + ๅฎžๆˆ˜้กน็›ฎ + ้ข่ฏ•้ข˜ๅบ“ ยท ๅฏนๆ ‡ๅคงๆจกๅž‹ๅบ”็”จๅผ€ๅ‘ๅทฅ็จ‹ๅธˆๅฒ—ไฝ ยท ่ฆ†็›–LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธš็บง้ƒจ็ฝฒไธŽๅพฎ่ฐƒ ยท ไปŽ0ๅˆฐไผไธš็บง่ฝๅœฐ + ไปŽๅญฆไน ๅˆฐไธŠ็บฟ้กน็›ฎ + ้ข่ฏ•ๅ‡†ๅค‡ไธ€ไฝ“ๅŒ–

orbit๐Ÿ“v2.6.6๐ŸŒฟ Growingโญ250

One API for 20+ LLM providers, your databases, and your files โ€” self-hosted, open-source AI gateway with RAG, voice, and guardrails.

AGI-Alpha-Agent-v0๐Ÿ“main@2026-04-18๐ŸŒฟ Growingโญ283

METAโ€‘AGENTIC ฮฑโ€‘AGI ๐Ÿ‘๏ธโœจ โ€” Mission ๐ŸŽฏ Endโ€‘toโ€‘end: Identify ๐Ÿ” โ†’ Outโ€‘Learn ๐Ÿ“š โ†’ Outโ€‘Think ๐Ÿง  โ†’ Outโ€‘Design ๐ŸŽจ โ†’ Outโ€‘Strategise โ™Ÿ๏ธ โ†’ Outโ€‘Execute โšก

vllm๐Ÿ“v0.19.1๐ŸŒฟ Growingโญ76,155

A high-throughput and memory-efficient inference and serving engine for LLMs

sdk-python๐Ÿ“v1.36.0๐ŸŒฟ Growingโญ5,602

A model-driven approach to building AI agents in just a few lines of code.

ai-real-estate-assistant๐Ÿ“dev@2026-04-13๐ŸŒฟ Growingโญ159

Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.

deep-research-mcp๐Ÿ“main@2026-04-13๐ŸŒฟ Growingโญ58

MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research

agenticSeek๐Ÿ“main@2026-04-11๐ŸŒฟ Growingโญ25,891

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. ๐Ÿ”” Official updates only via twitter @Martin993

Open-Sable๐Ÿ“v1.7.0๐ŸŒฑ Seedlingโญ18

Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int

PromptDrifter๐Ÿ“main@2026-04-19๐ŸŒฑ Seedlingโญ8

๐Ÿงญ PromptDrifter โ€“ oneโ€‘command CI guardrail that catches prompt drift and fails the build when your LLM answers change.

AutoRAG๐Ÿ“v0.3.22๐ŸŒฑ Seedlingโญ4,693

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

instructor๐Ÿ“v1.15.1๐ŸŒฑ Seedlingโญ12,743

structured outputs for llms

lm-proxy๐Ÿ“v3.2.2๐ŸŒฑ Seedlingโญ111

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIโ€”use as library or standalone service.

agent2๐Ÿ“v0.1.0๐ŸŒฑ Seedlingโญ25

The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.

droid-llm-hunter๐Ÿ“v1.0.0๐ŸŒฑ Seedlingโญ95

Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).

mnemos-mcp๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ4

๐Ÿง  Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.

RAGLight๐Ÿ“3.4.7๐ŸŒฑ Seedlingโญ656

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec

YouTubeGPT๐Ÿ“v3.3.1๐ŸŒฑ Seedlingโญ14

YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.

edsl๐Ÿ“wasm-wheel๐ŸŒฑ Seedlingโญ454

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

py-gpt๐Ÿ“v2.7.12๐ŸŒฑ Seedlingโญ1,724

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe

watchtower๐Ÿ“1.0.2๐ŸŒฑ Seedlingโญ51

Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us

uniAI๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ€” built to help studen

Phantom๐Ÿ“v0.8.0๐ŸŒฑ Seedlingโญ107

Autonomous Offensive Security Intelligence AI-powered multi-agent penetration testing

cloneme๐Ÿ“0.0.0๐Ÿ’ค Dormantโญ38

CloneMe is an advanced AI platform that builds your digital twinโ€”an AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th

MOP๐Ÿ“0.0.0๐ŸŒฑ Seedlingโญ1

A local LLM-based autonomous agent orchestration platform featuring async background tasks, context-isolated sub-agents, dynamic knowledge injection, and strict security approval gates (Plan Mode).

local-rag-server๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ2

Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.

Flipkart-Product-Recommender-RAG๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ2

๐Ÿ›’ Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.

gptme๐Ÿ“v0.31.0๐ŸŒฑ Seedlingโญ4,266

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

AnyCam2Ros๐Ÿ“master@2026-04-21๐ŸŒฑ Seedlingโญ1

๐Ÿ“ท Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.

asya-chat-ui๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Build multi-organization LLM chat platforms with model routing, tool execution, usage analytics, and OpenAI-compatible APIs.

langgraph-llama-cpp-starter๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

๐Ÿค– Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.

Mini-o3๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

๐Ÿง  Enhance visual search with Mini-o3, providing state-of-the-art multi-turn reasoning and easy-to-use training code for advanced AI applications.

auto-re-agent๐Ÿ“main@2026-04-21๐ŸŒฑ Seedlingโญ1

Automate binary analysis by coordinating LLM agents with Ghidra, enabling scalable and precise reverse engineering workflows.

LettuceDetect๐Ÿ“0.1.8๐Ÿ’ค Dormantโญ545

Lightweight hallucination detection framework for RAG applications

replicate-python๐Ÿ“1.0.7๐Ÿ’ค Dormantโญ900

Python client for Replicate