Search results for "qa"
89 skills and 38 specialized agents that enforce proven engineering practices for AI-assisted development. TDD, systematic debugging, parallel code review, and 10-gate development cycles โ as a Claude
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
MCP Server for Computer Use in Windows
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Give any AI agent a full desktop โ it sees the screen, clicks, types, and runs apps like a human. Automate anything with a UI: browsers, legacy software, internal tools. No API needed. One Docker comm
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP tools, 5-tier memory, A2A protocol, knowledge vault, voice, browser automation, Computer-us
A-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
A modular RAG (Retrieval-Augmented Generation) system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
OpenClaw Q&A ็คพๅบ โ AI Agent ่ฎฐๅฟ็ณป็ปใๅคAgentๆถๆใ่ฟๅ็ณป็ปใๅ ท่บซAI | ้พ่พ่ถ้ฆ ๐ฆ
Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.
auto-wing is a tool that uses LLM to assist automated testing
Self-hosted orchestration layer for autonomous AI agent teams. Shared memory, heartbeat scheduling, vault-first secrets, and cross-model peer review โ one command to deploy.
autonomous AI agent that builds full-stack apps. local models. no cloud. no API keys. runs on your hardware.
A curated list of products, benchmarks, and research papers on autonomous code agents. Beyond coding โ they're redefining how software changes the world.
[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2
METAโAGENTIC ฮฑโAGI ๐๏ธโจ โ Mission ๐ฏ Endโtoโend: Identify ๐ โ OutโLearn ๐ โ OutโThink ๐ง โ OutโDesign ๐จ โ OutโStrategise โ๏ธ โ OutโExecute โก
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
Unified framework for building enterprise RAG pipelines with small, specialized models
Lightweight, embedded graph-based memory system for AI applications. Fast (<3ms recall), offline-first, with MCP server support for Claude and other AI tools.
Claude Code skills, architectural principles, and alternative approaches for AI-assisted development
Project-agnostic, composable AI workflow automation via pi packages and Claude Code plugins.
JSON Agents - A universal JSON-native standard for describing AI agents, their capabilities, tools, runtimes, and governance in a portable, framework-agnostic format. Based on RFC 8259, JSON Schema 2
Transform Claude Code into a full development team. 11 specialized agents (Architect, Engineer, QA, Security, UX, DevOps, and more), persistent memory across sessions, and 25,000+ on-demand skills. Wo
Build AI agents that actually do things. Synapse is an open-source platform for creating, connecting, and orchestrating AI agents powered by any LLM โ local or cloud.
Claude Code skills collection โ CCA study guides, Twitter research, MCP review, auto-iteration tools
AI co-pilot for ComfyUI โ 113 tools for workflow authoring, model provisioning, and iterative rendering. Multi-provider (Claude, GPT-4o, Gemini, Ollama). Ships as MCP server or standalone CLI.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge base
FlexRAG: A RAG Framework for Information Retrieval and Generation.
An automated, agentic exploratory testing tool that performs comprehensive QA testing on web applications, simulating human user interactions through various input methods (mouse, keyboard, TAB naviga
Provide open-source AI bots for Lark to automate tasks like brainstorming, project planning, content creation, and monitoring within a secure chat interface.
๐ฆพ A productionโready research outreach AI agent that plans, discovers, reasons, uses tools, autoโbuilds cited briefings, and drafts tailored emails with toolโchaining, memory, tests, and turnkey Dock
A complete web automation framework for end-to-end testing.
behave is behaviour-driven development, Python style
A plugin for flake8 finding likely bugs and design problems in your program. Contains warnings that don't belong in pyflakes and pycodestyle.
Contains the API for end users as well as helper functions and classes to build Allure adapters for Python test frameworks
