Search results for "jpeg"
A tool to determine the content type of a file with deep learning
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
A wrapper around the pdftoppm and pdftocairo command line tools to convert PDF to a PIL Image list.
Submit and manage Forma (https://joinforma.com) claims from the command line and Model Context Protocol (MCP) clients
Universal AI Development Platform with MCP server integration, multi-provider support, and professional CLI. Build, test, and deploy AI applications with multiple ai providers.
Agentic framework | Self-improving memory | Pluggable tool extensions | Sandbox execution
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Give any AI agent a full desktop — it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python
📄 Production-ready MCP server for PDF processing - 5-10x faster with parallel processing and 94%+ test coverage
MeiGen-AI-Design-MCP — Turn Claude Code / OpenClaw into your local Lovart. Local ComfyUI, 1,400+ prompt library, multi-direction parallel generation.
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
Secure AI conversations with documents, video, audio, and more. Personal workspaces for focused context, group spaces for shared insight. Classify docs, reuse prompts, and extend with modular features
Playwright MCP server
"RAG-Anything: All-in-One RAG Framework"
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
Assorted useful tools, almost entirely generated using LLMs
Open-Source Intelligent Command Layer
Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text — no LLM needed). MCP server for AI agents with 500x fewer tokens. Extract, observe, script
Generic markdown collection MCP server with FTS5 + semantic search, frontmatter-aware indexing, and incremental reindexing
🎨 100+ selected GPT Image 1.5 prompts with images, multilingual support, and instant gallery preview. Open-source prompt engineering library
Ambient intelligence that sees what you see, hears what you hear, and acts on your behalf
a jinja2 extension to use humanize library inside jinja2 templates
AI coding agent for your terminal, implemented in pure Rust
🤖 Develop enterprise AI agents with integrated tools for chat, video, image editing, and secure multi-tenant workflows.
Second Brain is a desktop application that acts as a personal knowledge base, using retrieval-augmented generation (RAG), multimodal AI models, and a hybrid lexical/semantic search algorithm to intera
Syllabus-aware RAG study assistant for university students. Answers strictly from your own notes & PDFs, unit-scoped retrieval, cross-encoder reranking, and a hallucination gate — built to help studen
Generate reliable short finance explainer videos with script, slides, voice, subtitles, and batch-ready rendering in a stable, modular workflow.
Official integrations for SnapRender Screenshot API — MCP server, SDKs, OpenClaw, ChatGPT Actions, Postman
🖼️ Convert images quickly between formats with ImC, a fast and simple CLI tool built on Pillow for efficient batch processing and clean command usage.
