Search results for "multimodal"
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Ultra-Lightweight, Pure Python Multimodal Agent.
PraisonAI ๐ฆ โ Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
LLM API load-balancing gateway. LLM API ่ด่ฝฝๅ่กก็ฝๅ ณ.
Framework for AI Backend. Build and run AI agents like microservices - scalable, observable, and identity-aware from day one.
โก๏ธAI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-Onโก๏ธ, supports Ch
AgentWard โ Built for all, hardened for OpenClaw.
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
EdgeCrab ๐ฆ A Super Powerful Personal Assistant inspired by NousHermes and OpenClaw โ Rust-native, blazing-fast terminal UI, ReAct tool loop, multi-provider LLM support, ACP protocol, gateway adapters
An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-s
A simple Zotero plugin that brings your own LLM into the side panel.
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
Uni is a modern, embedded database that combines property graph (OpenCypher), vector search, and columnar storage (Lance) into a single, cohesive engine. It is designed for applications requiring loca
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
Knowledge Engine for AI Agent Memory in 6 lines of code
โพ๏ธ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack โพ๏ธ
Autonomous Agents (LLMs) research papers. Updated Daily.
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
๐ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, m
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
The ultimate space for work and life โ to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level โ enabling multi-agent collaboration, effo
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a c
This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers
This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules โ running loca
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding โ they're redefining how software changes the world.
One API for 20+ LLM providers, your databases, and your files โ self-hosted, open-source AI gateway with RAG, voice, and guardrails.
Framework for AI agents to build and maintain an Obsidian wiki using Karpathy's LLM Wiki pattern
Self-hosted personal AI agent that lives in your DMs. Describe any workflow: triage Gmail, pull a Giphy feed, build a Slack bot, monitor markets. It writes the code, runs it, schedules it, and saves i
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.
Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o
The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp
The video search layer for AI agents. Search video by meaning โ across speech, visuals, and on-screen text.
Must-read papers on Repository-level Code Generation & Issue Resolution ๐ฅ
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
The LLM Evaluation Framework
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
The Go client for Chroma vector database
An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare an
Integrate cutting-edge LLM technology quickly and easily into your apps
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Open-source framework for conversational voice AI agents
AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.
High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+ vector databases, and agent orchestration, all f
A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! ๏ฝ ้ทๆ่จๆถใใใซใฝใใใใผใซ้ฃๆบใๅฎๅใ ๆฌกไธไปฃใฎใ่ชๅพๅAIใจใผใธใงใณใใDiscordใใใ๏ผ
The AI-Native Search Database. Unifies vector, text, structured and semi-structured data in a single engine, enabling hybrid search and in-database AI workflows.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
"RAG-Anything: All-in-One RAG Framework"
๐ซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
We gave AI agents a brain. Memory, planning, continuity, and self-repair โ the missing cognitive architecture layer. Runs on your Mac.
Open-source AI browser agent for Chrome and Firefox
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate โ built to help studen
ZimaOS Blue - A Local-First Agent Runtime for Bold Builders. Out-of-the-Box, Open-Source, Universal, Vendor-Neutral
Install your own AI DJ Being. She searches, downloads, listens, mixes, and generates music โ autonomously. 30hrs for $0.04.
Cloud native, ultra-high performance AI&API gateway, LLM API management, distribution system, open platform, supporting all AI APIs.๐ฆไบๅ็ใ่ถ ้ซๆง่ฝ AI&API็ฝๅ ณ๏ผLLM API ็ฎก็ใๅๅ็ณป็ปใๅผๆพๅนณๅฐ๏ผๆฏๆๆๆAI API๏ผไธ้ไบOpenAIใAzureใ
๐ค Analyze financial data effortlessly with FinRobot, an open-source AI agent platform powered by large language models for insightful decision-making.
FlexRAG: A RAG Framework for Information Retrieval and Generation.
