Search results for "lama"
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generat
Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
Build Agentic AI solutions on AWS, using latest OSS Agentic Frameworks.
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
Portable multi-agent AI developer setup for Claude Code + Ollama. Role-based local LLM orchestration via Bash β plan, code, review, commit. Zero Dependency. Works with any language stack.
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. πYC W23
OpenTelemetry Instrumentation for AI Observability
LlamaIndex is the leading document agent and OCR platform
Create a plan from a description in minutes
Memory that remembers the story not just the facts. Three layer sentence graph for AI agents -> Facts, Episodes, raw Sentences. One DB. Zero config.
Persistent memory and session intelligence for AI coding assistants. Auto-tracks mistakes, decisions, and context via hooks. Mines your full session history for patterns, predictions, and cross-sessio
Agents-flex is A Lightweight Java AI Application Development Framework.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
OllamaFreeAPI: Free Distributed API for Ollama LLMs Public gateway to our managed Ollama servers with: - Zero-configuration access to 50+ models - Auto load-balanced across global nodes - Free tier w
Unified framework for building enterprise RAG pipelines with small, specialized models
The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling customer interactions.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
π§ PromptDrifter β oneβcommand CI guardrail that catches prompt drift and fails the build when your LLM answers change.
π§ Transform documentation chaos into a structured memory system with Mnemos, your self-hosted, multi-context knowledge server for developers.
Automate Codex CLI tasks using OpenClaw to write prompts, approve commands, check results, and interact via terminal or Telegram.
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Local AI anywhere, for everyone β LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate β built to help studen
A local LLM-based autonomous agent orchestration platform featuring async background tasks, context-isolated sub-agents, dynamic knowledge injection, and strict security approval gates (Plan Mode).
π‘οΈ Enable secure, read-only SSH access for LLM agents to audit servers, run diagnostics, and inspect logs without risking data changes.
No description
Glassmorphic web interface for Hermes Agent β your self-hosted AI assistant
Lean Rust AI agent: 6MB binary, 7.9MB RAM. OpenClaw replacement. Telegram + Discord + GitHub auto-PR. Ollama/Anthropic support.
π· Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.
π οΈ Simplify tool calls for any LLM with AnyToolCall, an OpenAI-compatible middleware that bypasses native constraints through prompt injection.
π Build an enterprise-ready RAG system to enhance technical documentation querying with LangGraph and multi-step reasoning workflows.
Enable local document ingestion and retrieval-augmented generation with a secure, .NET-based pipeline that keeps data on your machine.
Build multi-organization LLM chat platforms with model routing, tool execution, usage analytics, and OpenAI-compatible APIs.
Power advanced AI to create films using text, images, audio, and video inputs with a flexible quad-modal filmmaking engine.
BRUNELLA AGENT SYSTEM (BAS) β A JΓVΕ DIGITΓLIS SZERVEZETE
π€ Explore and utilize top open-source tools for running, fine-tuning, and building LLMs entirely locally, without cloud dependencies or API keys.
π€ Build intelligent, offline LLM agents with LangGraph and llama-cpp-python using this starter template for local, private tool-calling applications.
π§ Enhance visual search with Mini-o3, providing state-of-the-art multi-turn reasoning and easy-to-use training code for advanced AI applications.
No description
Automate binary analysis by coordinating LLM agents with Ghidra, enabling scalable and precise reverse engineering workflows.
π₯ Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.
LSP server leveraging LLMs for code completion (and more?)
