Search results for "video"
ๅงๆฌๅ้ๆบ่ฝไฝ๏ผPenShot๏ผ๏ผๅงๆฌโๅ้โ็ๆฎตโprompt | ๅบไบ LangGraph+LLM๏ผ่ชๅจ่งฃๆไปปๆๆ ผๅผๅงๆฌ๏ผ็ๆ Sora/Veo/Runway ็ญๆจกๅๅฏ็จ็่ฟ่ดฏtext-to-videoๆ็คบ่ฏใไฟๆ่ง่ฒ/ๅงๆ ่ทจ็ๆฎตไธ่ด๏ผๆฏๆ MCP/REST API/ๅฝๆฐ่ฐ็จ | Pythonๅบ + A2A้ๆใ๏ผLLM-powered screenplay-to-video-prompt a
Seedance 2.0 Shot Design Skills
Open-source framework for conversational voice AI agents
AI Agent ้ฉฑๅจ็ๅผๆบ่ง้ข็ๆๅทฅไฝๅฐ โ ๅฐ่ฏดโ่ง่ฒ/ๅบๆฏ/้ๅ ท่ฎพ่ฎกโๅงๆฌโๅ้ๅพโ่ง้ข๏ผ่ทจ้ๅคด่ง่ฒไธๅบๆฏไธ่ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
PraisonAI ๐ฆ โ Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
423 plugins, 2,849 skills, 177 agents for Claude Code. Open-source marketplace at tonsofskills.com with the ccpi CLI package manager.
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
๐ฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models
๐ซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
An open-source AI assistant framework with skills and agent architecture
The Official Model Context Protocol (MCP) server for Kagi search & other tools.
MCP Server for Computer Use in Windows
Exposes internet search tools for use by LLM-backed Assist in Home Assistant
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Tool that just makes your open source project better using LLM agents
๐ฌ AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.
"RAG-Anything: All-in-One RAG Framework"
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
MCP server to manage Facebook and Instagram Ads (Meta Ads)
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
MCP server and Claude plugin for Postgres skills and documentation. Helps AI coding tools generate better PostgreSQL code.
๐กโ๏ธAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting๐กโ๏ธ
Multi-agent memory consistency platform. We're hiring contributorsโcheck HIRING.md
Crawl4AI MCP Server: Extract content from web pages, PDFs, Office docs, YouTube videos with AI-powered summarization. 17 tools, token reduction, production-ready.
Hands-on workshop: Build a multi-agent AI system from scratch โ Deep Research Agent + Writing Workflow served as MCP servers. Includes code, slides, and video (coming soon)
Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Connect AI models like Claude & GPT with robots using MCP and ROS.
A SEC EDGAR MCP (Model Context Protocol) Server
Video editing MCP server for AI agents. 83 tools, 858 tests collected, 3 interfaces. Works with Claude Code, Cursor, and any MCP client. Local, fast, free.
Memory library for building stateful agents
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.
AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Unified framework for building enterprise RAG pipelines with small, specialized models
Agent Zero AI framework
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
A Claude Code skill that turns your Obsidian vault into a living second brain โ autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Learn it. Build it. Ship it for others.
Claude Code skills, architectural principles, and alternative approaches for AI-assisted development
๐ฆ The first autonomous hackathon agent stop assisting and start competing (๐ Hackathon Champion Project).
Agent memory and conflict detection platform. We're hiring contributors check HIRING.md
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
Self-evolving deep research system for Claude Code. Zero API keys.
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM โ local or cloud.
Control robots and physical hardware with natural language through Strands Agents.
Computer Environments Elicit General Agentic Intelligence in LLMs
๐ Enable local LLMs with real-time Google search, live feeds, OCR, and video insights using noapi-google-search-mcp server tools.
AI-native SaaS framework that builds full-stack apps using autonomous AI agents
AI co-pilot for ComfyUI โ 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
FlexRAG: A RAG Framework for Information Retrieval and Generation.
Equip AI agents with internet access to gather real-time data from restricted or hard-to-reach online sources.
๐ Remove watermarks from OpenAI Sora 2 videos using precise spectral analysis to keep video quality intact and watermark-free.
๐ฅ Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.
๐ท Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.
๐ Build an enterprise-ready RAG system to enhance technical documentation querying with LangGraph and multi-step reasoning workflows.
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
A framework for elegantly configuring complex applications
Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more.
Google Ai Generativelanguage API client library
SGLang is a fast serving framework for large language models and vision language models.
tox is a generic virtualenv management and test command line tool
Microsoft Azure Blob Storage Client Library for Python
Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Fast, Extensible Progress Meter
Render rich text, tables, progress bars, syntax highlighting, markdown and more to the terminal
