Search results for "video"
Seedance 2.0 Shot Design Skills
Natural language โ ComfyUI workflow JSON. 34 built-in templates, 360+ node definitions, auto model download. Supports txt2img, img2img, txt2vid, img2vid, audio, 3D generation across SD1.5/SDXL/S
AI Agent ้ฉฑๅจ็ๅผๆบ่ง้ข็ๆๅทฅไฝๅฐ โ ๅฐ่ฏดโ่ง่ฒ/ๅบๆฏ/้ๅ ท่ฎพ่ฎกโๅงๆฌโๅ้ๅพโ่ง้ข๏ผ่ทจ้ๅคด่ง่ฒไธๅบๆฏไธ่ด | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Oll
PraisonAI ๐ฆ โ Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous agents that research, plan, code, and execute tasks. Deployed in 5 lines of code with built-in memory, R
The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for
I'm going to build my own OpenClaw, with blackjack... and bun!
Save 120+ Hours of Setup Pain (I did it for you) โ Launch Your OpenClaw Agent Teams with 1 Command (15+ Recipes)
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for pers
๐ฑ A little course on Reinforcement Learning Environments for evaluating and training Language Models
RAPTOR (Robust AI-Powered Toolkit for Operational Robots) is an AI-native Content Insight Engine that transforms passive media storage into an intelligent knowledge platform through automated analysis
The Official Model Context Protocol (MCP) server for Kagi search & other tools.
An easy to use GUI-based tool that performs live translations using OCR and LLMs (Either cloud or local only)
AI image generation CLI powered by Gemini 3 Pro. Green screen transparency, reference images, style transfer. Also a Claude Code plugin.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
๐ฌ AI-powered YouTube Shorts automation tool using LLMs, real-time search, and text-to-speech. Create engaging short-form videos with automated research, voiceovers, and subtitles.
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. With its modular architecture it's
MCP server to manage Facebook and Instagram Ads (Meta Ads)
Autonomous Agents (LLMs) research papers. Updated Daily.
๐ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website
Cutting-edge Full-stack AI Platform delivered as a SaaS (Software as a Service). Built on a robust technology stack, integrated with powerful APIs such as OpenAI and Replicate, offers a seamless exper
Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.
The ultimate space for work and life โ to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level โ enabling multi-agent collaboration, effo
Multi-agent memory consistency platform. We're hiring contributorsโcheck HIRING.md
Convoke extends BMAD Method AI agents with two types of installable modules: Teams bring new agents for a domain, Skills add new capabilities to existing agents. Install them independently or combine
Crawl4AI MCP Server: Extract content from web pages, PDFs, Office docs, YouTube videos with AI-powered summarization. 17 tools, token reduction, production-ready.
A modular MCP server that provides commonly used developer tools for AI coding agents
Hands-on workshop: Build a multi-agent AI system from scratch โ Deep Research Agent + Writing Workflow served as MCP servers. Includes code, slides, and video (coming soon)
Internal Safety Collapse: Turning the LLM or an AI Agent into a sensitive data generator.
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
The official Rust SDK for the Model Context Protocol
Model Context Protocol - MCP for Mifos X
This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
Memory library for building stateful agents
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of ta
Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
2026 swarm Agent ๅนด๏ผswarm Agent ใAgent teamใ ai codingใskillใmemoryใevolveใagentic RL ็ญ AI Agent้ๅ
Assorted useful tools, almost entirely generated using LLMs
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
Automated security investigation tool using Microsoft MCP Servers, GitHub Copilot, Python Modules and custom copilot-instructions.
AG2 (formerly AutoGen): The Open-Source AgentOS.Join us at: https://discord.gg/sNGSwQME3x
The ThoughtSpot MCP Server
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.
A Model Context Protocol (MCP) server for automating openMSX emulator instances. This server provides comprehensive tools for MSX software development, testing, and automation through standardized MCP
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Nuwax Agent OS - The world's first universal agent operating system, building your private vertical general-purpose agent. ้็จๆบ่ฝไฝๆไฝ็ณป็ป๏ผๆ้ ไฝ ็งๆ็ๅ็ฑป้็จๆบ่ฝไฝใๆฐไธไปฃAIๅบ็จ่ฎพ่ฎกใๅผๅใๅฎ่ทตๅนณๅฐ๏ผๆ ้ไปฃ็ ๏ผ่ฝปๆพๅๅปบ๏ผ้ๅๅ็ฑปไบบ็พค๏ผๆฏๆๅค็ง็ซฏๅๅธๅAPI๏ผๆไพๅฎๅ็
Unified framework for building enterprise RAG pipelines with small, specialized models
Autospec is an open-source AI agent that takes a web app URL and autonomously QAs it, and saves its passing specs as E2E test code
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
A curated list of awesome works related to high dimensional structure/vector search & database
Agent Zero AI framework
Summon your AI superpower โ voice, vision, and autonomous action
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade o
AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
The AI Operating System for Delphi. 100% native framework with RAG 2.0 for knowledge retrieval, autonomous agents with semantic memory, visual workflow orchestration, and universal LLM connector. Supp
Learn to build AI agents with Strands framework. Covers LLM integration via Amazon Bedrock/Anthropic, AWS service connections, tool implementation with MCP/A2A protocols, and agent evaluation using La
Open-source, self-improving autonomous agent swarm๐
The video search layer for AI agents. Search video by meaning โ across speech, visuals, and on-screen text.
Open-source framework for conversational voice AI agents
Token-efficient browser MCP server โ structured web pages for AI agents, not raw accessibility dumps
A Claude Code skill that turns your Obsidian vault into a living second brain โ autonomous writes, thinking tools, knowledge ingestion, scheduled agents, and _CLAUDE.md for cross-surface context.
Enterprise-ready MCP Gateway & Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access for both autonomous AI agents and AI coding a
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Production-ready MCP server exposing JustOneAPI endpoints to AI agents with raw JSON responses.
Open-Sable is a local-first autonomous agent framework with AGI-inspired cognitive subsystems (goals, memory, metacognition, tool use). It can run continuously on your machine, integrate with chat int
An open-source AI assistant framework with skills and agent architecture
The Go client for Chroma vector database
trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
A comprehensive Model Context Protocol (MCP) server that enables AI assistants to control Unreal Engine through the native C++ Automation Bridge plugin. Built with TypeScript and C++.
Playwright MCP server
LLM-driven debugger server โ give your AI agents step-through debugging superpowers
MCP Server for Computer Use in Windows
A highly customizable personal AI assistant for Discord featuring smart agentic AI features such as memory, personas, tool usage, and more! ๏ฝ ้ทๆ่จๆถใใใซใฝใใใใผใซ้ฃๆบใๅฎๅใ ๆฌกไธไปฃใฎใ่ชๅพๅAIใจใผใธใงใณใใDiscordใใใ๏ผ
Exposes internet search tools for use by LLM-backed Assist in Home Assistant
AI ้ฉฑๅจ UI ็ๆๅๅๅธ็ไฝไปฃ็ ๅนณๅฐ๏ผๅบไบTailwindCss๏ผ้่ฟๆๆฝๅฏ่งๅๅฟซ้ๆๅปบ็ฐไปฃๅๅๅบๅผUIใๅจๆ่ชๅฎไน็ปไปถใๅคไธป้ขใๅค่ฏญ่จ็็ฝ็ซๅบ็จใAI-powered UI generation and publishing low code platform, built on TailwindCSS, enabling rapid drag-and-drop visual creatio
Tool that just makes your open source project better using LLM agents
"RAG-Anything: All-in-One RAG Framework"
My personal Claude Code and OpenAI Codex setup with battle-tested skills, commands, hooks, agents and MCP servers that I use daily.
YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.
๐ค Develop enterprise AI agents with integrated tools for chat, video, image editing, and secure multi-tenant workflows.
๐ฌ 500+ curated Seedance 2.0 video generation prompts โ cinematic, anime, UGC, ads, meme styles. Includes Seedance API guides, character consistency tips, and advanced video workflows.
Explore curated Seedance 2.0 prompts with proven results, clear sources, and ready-to-use templates for faster content generation.
๐ซ CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
We gave AI agents a brain. Memory, planning, continuity, and self-repair โ the missing cognitive architecture layer. Runs on your Mac.
๐ฅ Generate precise video prompts for Jimeng Seedance 2.0 using structured patterns, templates, and AI agent compatibility.
Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM โ local or cloud.
Customize Claude Code's system prompts, create custom toolsets, input pattern highlighters, themes/thinking verbs/spinners, customize input box & user message styling, support AGENTS.md, unlock privat
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities
๐กโ๏ธAI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and compliance reporting๐กโ๏ธ
Open-source AI browser agent for Chrome and Firefox
Computer Environments Elicit General Agentic Intelligence in LLMs
๐ Enable local LLMs with real-time Google search, live feeds, OCR, and video insights using noapi-google-search-mcp server tools.
๐ Remove watermarks from SORA 2 video generations to enhance clarity and accessibility. Experience seamless AI-generated content without distractions.
AI-native SaaS framework that builds full-stack apps using autonomous AI agents
Connect AI models like Claude & GPT with robots using MCP and ROS.
A SEC EDGAR MCP (Model Context Protocol) Server
AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and o
๐จ Generate diverse AI content effortlessly with powerful models for text-to-image, image-to-image, text-to-video, and more.
AI co-pilot for ComfyUI โ 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
Generate a custom newspaper with an AI agent based on your favorite YouTube channels.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!
Equip AI agents with internet access to gather real-time data from restricted or hard-to-reach online sources.
Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.
๐ Remove watermarks from OpenAI Sora 2 videos using precise spectral analysis to keep video quality intact and watermark-free.
Generate reliable short finance explainer videos with script, slides, voice, subtitles, and batch-ready rendering in a stable, modular workflow.
Power advanced AI to create films using text, images, audio, and video inputs with a flexible quad-modal filmmaking engine.
๐ฌ Provide unofficial API access and documentation for Seedance 2.0 to enable video generation with ByteDanceโs model.
๐ฅ Generate AI-driven videos with Seedance 2.0, offering precise physics, lip-sync, and prompt accuracy for seamless content creation.
Simplify AI agent deployment and management with OpenClaw-Turboโs secure, intuitive interface optimized for Linux and Chinese language support.
Business Apps Made Simple with Asp.Net Core MVC / TypeScript
Open source local sandboxing for running AI generated code.
๐จ Enhance cinematic image quality with ComfyUI-None-upup. This AI engine offers nodes for clarity, brightness, and video processing to elevate your visuals.
CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate research workflows using LLMs, web search, Reddit, YouTube, and mappi
๐ท Transform any camera into ROS2 image topics for seamless integration with robotic systems and effective VLA model deployment.
๐ Build an enterprise-ready RAG system to enhance technical documentation querying with LangGraph and multi-step reasoning workflows.
Explore alternatives to Discord with a curated list of early-stage apps, evaluating features, hosting, and encryption to guide your choice.
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
Skip to content github / docs Code Issues 80 Pull requests 35 Discussions Actions Projects 2 Security Insights Merge branch 'main' into 1862-Add-Travis-CI-migration-table 1862-Add-Travis-CI-migration
General Framework for Dota 2 AI Competitions
FlexRAG: A RAG Framework for Information Retrieval and Generation.
David AI is a free and open-source collection of customizable, production-ready UI components built with Tailwind CSS.
Robust, fast, scalable, and sandboxed open-source online code execution system for humans and AI.
