Search results for "llama"
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropi
Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
OpenTelemetry Instrumentation for AI Observability
Give any AI agent a full desktop โ it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
LlamaIndex is the leading document agent and OCR platform
Create a plan from a description in minutes
Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.
SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
Persistent memory and session intelligence for AI coding assistants. Auto-tracks mistakes, decisions, and context via hooks. Mines your full session history for patterns, predictions, and cross-sessio
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w
Unified framework for building enterprise RAG pipelines with small, specialized models
The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
A Multi-Agentic AI Assistant/Builder
๐ 2026 ๆ็ณป็ป็ AI Agent ้ๆๆๅ๏ฝๆบ่ฝไฝๅฎๆๆ็จ ยท ๅฎๆดๅญฆไน ่ทฏๅพ + ๅฎๆ้กน็ฎ + ้ข่ฏ้ขๅบ ยท ๅฏนๆ ๅคงๆจกๅๅบ็จๅผๅๅทฅ็จๅธๅฒไฝ ยท ่ฆ็LangChain / LangGraph / Coze / Dify / MCP / skills / LLM / RAG / ๆ็คบ่ฏ ยท ไผไธ็บง้จ็ฝฒไธๅพฎ่ฐ ยท ไป0ๅฐไผไธ็บง่ฝๅฐ + ไปๅญฆไน ๅฐไธ็บฟ้กน็ฎ + ้ข่ฏๅๅคไธไฝๅ
One API for 20+ LLM providers, your databases, and your files โ self-hosted, open-source AI gateway with RAG, voice, and guardrails.
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
A high-throughput and memory-efficient inference and serving engine for LLMs
A model-driven approach to building AI agents in just a few lines of code.
Open-Source Intelligent Command Layer
Advanced AI Real Estate Assistant using RAG, LLMs, and Python. Features market analysis, property valuation, and intelligent search.
MCP server for OpenAI's Deep Research APIs, Gemini Deep Research Agent, and Hugging Face's Open Deep Research
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. ๐ Official updates only via twitter @Martin993
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
๐งญ PromptDrifter โ oneโcommand CI guardrail that catches prompt drift and fails the build when your LLM answers change.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
structured outputs for llms
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPIโuse as library or standalone service.
The production runtime for AI agents. Schema in, API out. Built on PydanticAI + FastAPI.
Droid LLM Hunter is a tool to scan for vulnerabilities in Android applications using Large Language Models (LLMs).
๐ง Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connec
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
Structured Outputs
Watchtower is a simple AI-powered penetration testing automation CLI tool that leverages LLMs and LangGraph to orchestrate agentic workflows that you can use to test your websites locally. Generate us
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ built to help studen
Autonomous Offensive Security Intelligence AI-powered multi-agent penetration testing
CloneMe is an advanced AI platform that builds your digital twinโan AI that chats like you, remembers details, and supports multiple platforms. Customizable, memory-driven, and hot-reloadable, it's th
A local LLM-based autonomous agent orchestration platform featuring async background tasks, context-isolated sub-agents, dynamic knowledge injection, and strict security approval gates (Plan Mode).
Deploy a local, multi-user RAG system to query PDF and DOCX documents using a local LLM without cloud or API dependencies.
๐ Build a leading-edge e-commerce recommendation system using RAG architecture, Groq Llama 3, LangChain, and AstraDB, deployed on Kubernetes for scalability.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
๐ท Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.
Build multi-organization LLM chat platforms with model routing, tool execution, usage analytics, and OpenAI-compatible APIs.
๐ค Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.
๐ง Enhance visual search with Mini-o3, providing state-of-the-art multi-turn reasoning and easy-to-use training code for advanced AI applications.
Automate binary analysis by coordinating LLM agents with Ghidra, enabling scalable and precise reverse engineering workflows.
Lightweight hallucination detection framework for RAG applications
