freshcrate
Skin:/
Home > MCP Servers > paiml-mcp-agent-toolkit

paiml-mcp-agent-toolkit

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

Why this rank:Strong adoptionRelease freshnessHealthy release cadence

Description

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

README

PMAT

PMAT

Zero-configuration AI context generation for any codebase


Table of Contents

What is PMAT?

PMAT (Pragmatic Multi-language Agent Toolkit) provides everything needed to analyze code quality and generate AI-ready context:

  • Context Generation - Deep analysis for Claude, GPT, and other LLMs
  • Technical Debt Grading - A+ through F scoring with 6 orthogonal metrics
  • Mutation Testing - Test suite quality validation (85%+ kill rate)
  • Repository Scoring - Quantitative health assessment (0-289 scale, 11 categories)
  • Git History RAG - Semantic search across commit history with RRF fusion
  • Semantic Search - Natural language code discovery
  • Compliance Governance - 30+ checks across code quality, best practices, and reproducibility
  • Design by Contract - Toyota Way contract profiles with checkpoint validation and rescue protocols
  • Autonomous Kaizen - Toyota Way continuous improvement with auto-fix and commit
  • MCP Integration - 19 tools for Claude Code, Cline, and AI agents
  • Quality Gates - Pre-commit hooks, CI/CD integration, .pmat-gates.toml config
  • 20+ Languages - Rust, TypeScript, Python, Go, Java, C/C++, Lua, Lean, and more

Part of the PAIML Stack, following Toyota Way quality principles (Jidoka, Genchi Genbutsu, Kaizen).

Annotated Code Search

pmat query annotated output

pmat query "cache invalidation" --churn --duplicates --entropy --faults

Every result includes TDG grade, Big-O complexity, git churn, code clones, pattern diversity, fault annotations, call graph, and syntax-highlighted source.

Installation

# Install from crates.io
cargo install pmat

# Or from source (latest)
git clone https://github.com/paiml/paiml-mcp-agent-toolkit
cd paiml-mcp-agent-toolkit && cargo install --path .

Usage

# Generate AI-ready context
pmat context --output context.md --format llm-optimized

# Analyze code complexity
pmat analyze complexity

# Grade technical debt (A+ through F)
pmat analyze tdg

# Score repository health
pmat repo-score .

# Run mutation testing
pmat mutate --target src/

# Start MCP server for Claude Code, Cline, etc.
pmat mcp

Features

Context Generation

Generate comprehensive context for AI assistants:

pmat context                           # Basic analysis
pmat context --format llm-optimized    # AI-optimized output
pmat context --include-tests           # Include test files

Technical Debt Grading (TDG)

Six orthogonal metrics for accurate quality assessment:

pmat analyze tdg                       # Project-wide grade
pmat analyze tdg --include-components  # Per-component breakdown
pmat tdg baseline create               # Create quality baseline
pmat tdg check-regression              # Detect quality degradation

Grading Scale:

  • A+/A: Excellent quality, minimal debt
  • B+/B: Good quality, manageable debt
  • C+/C: Needs improvement
  • D/F: Significant technical debt

Mutation Testing

Validate test suite effectiveness:

pmat mutate --target src/lib.rs        # Single file
pmat mutate --target src/ --threshold 85  # Quality gate
pmat mutate --failures-only            # CI optimization

Supported Languages: Rust, Python, TypeScript, JavaScript, Go, C/C++, C#, Lua, Lean, Java, Kotlin, Ruby, Swift, PHP, Bash, SQL, Scala, YAML, Markdown + MLOps model formats (GGUF, SafeTensors, APR)

Repository Health Scoring

Evidence-based quality metrics (0-289 scale, 11 categories):

pmat rust-project-score                # Fast mode (~3 min)
pmat rust-project-score --full         # Comprehensive (~10-15 min)
pmat repo-score . --deep               # Full git history

Workflow Prompts

Pre-configured AI prompts enforcing EXTREME TDD:

pmat prompt --list                     # Available prompts
pmat prompt code-coverage              # 85%+ coverage enforcement
pmat prompt debug                      # Five Whys analysis
pmat prompt quality-enforcement        # All quality gates

Git History RAG

Search git history by intent using TF-IDF semantic embeddings:

# Fuse git history into code search
pmat query "fix memory leak" -G

# Search with churn, clones, entropy, faults
pmat query "error handling" --churn --duplicates --entropy --faults
# Run the example
cargo run --example git_history_demo

Git Hooks

Automatic quality enforcement:

pmat hooks install                     # Install pre-commit hooks
pmat hooks install --tdg-enforcement   # With TDG quality gates
pmat hooks status                      # Check hook status

Compliance Governance (pmat comply)

30+ automated checks across code quality, best practices, and governance:

pmat comply check                      # Run all compliance checks
pmat comply check --strict             # Exit non-zero on failure
pmat comply check --format json        # Machine-readable output
pmat comply migrate                    # Update to latest version

Key Checks:

  • CB-200: TDG Grade Gate — blocks on non-A functions (auto-rebuilds stale index)
  • CB-304: Dead code percentage enforcement
  • CB-400: Shell/Makefile quality via bashrs
  • CB-500: Rust best practices (30+ patterns)
  • CB-600: Lua best practices
  • CB-900: Markdown link validation
  • CB-1000: MLOps model quality

Provable-Contracts Enforcement (CB-1200..1210):

  • CB-1208: Binding existence — verifies binding.yaml functions exist in src/, detects ghost bindings (L0-L3 enforcement levels)
  • CB-1209: Contract trait enforcement — checks tests/contract_traits.rs for compiler-verified trait impls (13 kernel traits)
  • CB-1210: Precondition quality — flags mass-generated boilerplate and missing postconditions

Configure via .pmat.yaml:

comply:
  thresholds:
    min_tdg_grade: "A"
    pv_lint_is_error: true        # CB-1201: FAIL on pv lint failure
    min_binding_existence: 95     # CB-1208: 95% binding verification
    require_all_traits: true      # CB-1209: 13/13 traits required
    min_kani_coverage: 20         # CB-1206: minimum Kani proof %

Infrastructure Score (pmat infra-score)

CI/CD quality scoring (0-100 + 10 bonus for provable-contracts):

pmat infra-score                       # Text output
pmat infra-score --format json         # Machine-readable
pmat infra-score -v --failures-only    # Show only failing checks

Categories: Workflow Architecture (25pts), Build Reliability (25pts), Quality Pipeline (20pts), Deployment & Release (15pts), Supply Chain (15pts), Provable Contracts bonus (10pts).

Document Search (pmat query --docs)

Search documentation files (Markdown, text, YAML) alongside code:

pmat query "authentication" --docs          # Code + docs results
pmat query "deployment" --docs-only         # Only documentation
pmat query "API endpoints" --no-docs        # Exclude docs (default)

Autonomous Kaizen (pmat kaizen)

Toyota Way continuous improvement — scan, auto-fix, commit:

pmat kaizen --dry-run                  # Scan only (no changes)
pmat kaizen                            # Apply safe auto-fixes
pmat kaizen --commit --push            # Fix, commit, and push
pmat kaizen --format json -o report.json  # CI/CD integration

# Cross-stack mode: scan all batuta stack crates in one invocation
pmat kaizen --cross-stack --dry-run    # Scan all crates
pmat kaizen --cross-stack --commit     # Fix and commit per-crate
pmat kaizen --cross-stack -f json      # Grouped JSON report

Function Extraction (pmat extract)

Extract function boundaries with metadata:

pmat extract src/lib.rs                # Extract functions from file
pmat extract --list src/               # List all functions with imports and visibility

Examples

Generate Context for AI

# For Claude Code
pmat context --output context.md --format llm-optimized

# With semantic search
pmat embed sync ./src
pmat semantic search "error handling patterns"

CI/CD Integration

# Add to your CI pipeline
steps:
  - uses: actions/checkout@v4
  - run: cargo install pmat
  - run: pmat analyze tdg --fail-on-violation --min-grade B
  - run: pmat mutate --target src/ --threshold 80

Quality Baseline Workflow

# 1. Create baseline
pmat tdg baseline create --output .pmat/baseline.json

# 2. Check for regressions
pmat tdg check-regression \
  --baseline .pmat/baseline.json \
  --max-score-drop 5.0 \
  --fail-on-regression

Architecture

pmat/
├── src/
│   ├── cli/          Command handlers and dispatchers
│   ├── services/     Analysis engines (TDG, SATD, complexity, agent context)
│   ├── mcp_server/   MCP protocol server
│   ├── mcp_pmcp/     PMCP protocol integration
│   └── models/       Configuration and data models
├── examples/         89 runnable examples
└── docs/
    └── specifications/  Technical specs

Quality

Metric Value
Tests 21,200+ passing
Coverage 99.66%
Mutation Score >80%
Languages 20 supported + MLOps model formats
MCP Tools 19 available

Falsifiable Quality Commitments

Per Popper's demarcation criterion, all claims are measurable and testable:

Commitment Threshold Verification Method
Context Generation < 5 seconds for 10K LOC project time pmat context on test corpus
Memory Usage < 500 MB for 100K LOC analysis Measured via heaptrack in CI
Test Coverage â‰Ĩ 85% line coverage cargo llvm-cov (CI enforced)
Mutation Score â‰Ĩ 80% killed mutants pmat mutate --threshold 80
Build Time < 3 minutes incremental cargo build --timings
CI Pipeline < 15 minutes total GitHub Actions workflow timing
Binary Size < 50 MB release binary ls -lh target/release/pmat
Language Parsers All 20 languages parse without panic Fuzz testing in CI

How to Verify:

# Run self-assessment with Popper Falsifiability Score
pmat popper-score --verbose

# Individual commitment verification
cargo llvm-cov --html        # Coverage â‰Ĩ85%
pmat mutate --threshold 80   # Mutation â‰Ĩ80%
cargo build --timings        # Build time <3min

Failure = Regression: Any commitment violation blocks CI merge.

Benchmark Results (Statistical Rigor)

All benchmarks use Criterion.rs with proper statistical methodology:

Operation Mean 95% CI Std Dev Sample Size
Context (1K LOC) 127ms [124, 130] Âą12.3ms n=1000 runs
Context (10K LOC) 1.84s [1.79, 1.90] Âą156ms n=500 runs
TDG Scoring 156ms [148, 164] Âą18.2ms n=500 runs
Complexity Analysis 23ms [22, 24] Âą3.1ms n=1000 runs

Comparison Baselines (vs. Alternatives):

Metric PMAT ctags tree-sitter Effect Size
10K LOC parsing 1.84s 0.3s 0.8s d=0.72 (medium)
Memory (10K LOC) 287MB 45MB 120MB -
Semantic depth Full Syntax only AST only -

See docs/BENCHMARKS.md for complete statistical analysis.

ML/AI Reproducibility

PMAT uses ML for semantic search and embeddings. All ML operations are reproducible:

Random Seed Management:

  • Embedding generation uses fixed seed (SEED=42) for deterministic outputs
  • Clustering operations use fixed seed (SEED=12345)
  • Seeds documented in docs/ml/REPRODUCIBILITY.md

Model Artifacts:

  • Pre-trained models from HuggingFace (all-MiniLM-L6-v2)
  • Model versions pinned in Cargo.toml
  • Hash verification on download

Dataset Sources

PMAT does not train models but uses these data sources for evaluation:

Dataset Source Purpose Size
CodeSearchNet GitHub/Microsoft Semantic search benchmarks 2M functions
PMAT-bench Internal Regression testing 500 queries

Data provenance and licensing documented in docs/ml/REPRODUCIBILITY.md.

Sovereign Stack

PMAT is built on the PAIML Sovereign Stack - pure-Rust, SIMD-accelerated libraries:

Library Purpose Version
aprender ML library (text similarity, clustering, topic modeling) 0.27.1
trueno SIMD compute library for matrix operations 0.16.1
trueno-graph GPU-first graph database (PageRank, Louvain, CSR) 0.1.17
trueno-rag RAG pipeline with VectorStore 0.2.2
trueno-db Embedded analytics database 0.3.15
trueno-viz Terminal graph visualization 0.2.1
trueno-zram-core SIMD LZ4/ZSTD compression (optional) 0.3.0
pmat Code analysis toolkit 3.7.0

Key Benefits:

  • Pure Rust (no C dependencies, no FFI)
  • SIMD-first (AVX2, AVX-512, NEON auto-detection)
  • 2-4x speedup on graph algorithms via aprender adapter

Documentation

Contributing

See CONTRIBUTING.md for development setup, testing, and pull request guidelines.

See Also

License

MIT License - see LICENSE for details.


Built with Extreme TDD | Part of PAIML

Release History

VersionChangesUrgencyDate
v3.17.0## Highlights Two ship-blocking bugs landed via #564 and #565, both discovered while building a Coursera RAG-from-Zero lab. ### Fixed - **`scripts/install.sh` URL pattern, tarball layout, and Linux platform default** (#561 → #564). Three sub-bugs in the documented one-liner installer: 1. URL was constructed as `paiml-mcp-agent-toolkit-${PLATFORM}.tar.gz` but actual release assets are named `pmat-v${VERSION}-${PLATFORM}.tar.gz` — every install since the v3.0 rename returned 404. 2. The releHigh5/6/2026
v3.16.0## Highlights - **4 bug fixes**: `pmat comply` default-yaml severity defect (#527), rustls-webpki RUSTSEC-2026-0104 bump (#476), incremental analysis flake (#449), entropy regex word-end bug (#396) - **80% line coverage milestone** across the workspace - **354 new tests** added since v3.14.0 - **MCP**: forward `ToolInfo` metadata for 4 `pmat_*` handlers (#389) - **Verification**: Kani proof harnesses for pure functions (#299) - **Brew**: rename `homebrew/` → `Formula/` for tap convention (#388)High4/27/2026
v3.14.0## What's Changed Migrate all sovereign stack dependencies from old crate names to the unified `aprender-*` 0.30 monorepo on crates.io. ### Dependency renames | Old name | New name | Version | |----------|----------|---------| | `trueno` | `aprender-compute` | 0.30 | | `trueno-db` | `aprender-db` | 0.30 | | `trueno-graph` | `aprender-graph` | 0.30 | | `trueno-rag` | `aprender-rag` | 0.30 | | `trueno-viz` | `aprender-viz` | 0.30 | | `trueno-zram-core` | `aprender-zram-core` | 0.30 | | `presentHigh4/15/2026
v3.10.0## What's New ### `pmat explain` Command Look up any check, metric, or grade to understand what it checks, why it matters, and how to fix failures. ```bash pmat explain CB-1210 # Explain a specific check pmat explain CB-12 # Prefix match: all CB-12xx checks pmat explain precondition # Fuzzy search in names/descriptions pmat explain # List all available checks ``` 19 check explanations across comply (CB-xxx), provable contracts (PV-xx), and TDG grades. ### Provable ContrMedium3/31/2026
v3.8.0## Provable Contracts Integration Escape-proof contract enforcement pipeline across the entire sovereign AI stack. YAML contracts → build.rs env vars → `#[contract]` proc macro → `debug_assert!()` injection. Zero runtime cost in release builds. ### New Compliance Checks | Check | What it verifies | |-------|-----------------| | **CB-1203** | Contract equations with preconditions have `#[contract]` macros | | **CB-1204** | `build.rs` emits PRE/POST env vars from YAML contracts | | **CB-1205** Medium3/23/2026
v3.7.0## Highlights - **RPS v3.0**: New Reproducibility scorer wrapping Popper categories B-F (15 pts), falsifiability gateway (Jidoka), 11 categories / 289 max points - **PMAT-510**: Six scoring system improvements — Five Whys v2 evidence weights, Muda file mapping, EvoScore CB-142, `--rank-by priority`, `test --record` - **Performance**: -2.57 GB peak memory (deep context), -59% index build allocations, -44 MB graph clone elimination - **New commands**: `ci-local`, `bottleneck`, `test-stability`, `Low3/9/2026
v3.3.0## What's New ### Rich `pmat extract --list` output (#217) `pmat extract --list` now returns a rich JSON object instead of a flat array, providing everything needed to split large files into compilable parts: **New top-level fields:** - `file` — file path - `language` — detected language (`rust`, `typescript`, `python`, `c`, `cpp`, `go`, `lua`) - `imports` — all top-level import/use statements (full text) - `cfg_test_line` — line where `#[cfg(test)]` appears (Rust only) **New per-item field:Low2/15/2026
v3.1.0## What's New ### Lua Compliance Suite (CB-608 to CB-619) - **12 new Lua checks**: coroutine misuse, require cycles, global protection, weak tables, test framework detection, OpenResty patterns, FFI safety, OOP patterns, type annotations - Lua dead code detection with module export awareness - Lua mutation testing adapter with busted integration - Tree-sitter TDG analyzer for Lua with full 7-component scoring ### SQL, Scala, YAML, Markdown, MLOps Compliance - **CB-700**: SQL best practices (N+Low2/14/2026
v3.0.7## Fix #173: check_provability hardcoded score + config ignored ### Changes - **Real provability analysis**: `check_provability()` now calls `calculate_provability_score()` using the `LightweightProvabilityAnalyzer` instead of returning a hardcoded `0.65` - **Config threshold**: Added `provability_min = 0.60` to `.pmat-metrics.toml` so the threshold is read from config instead of always falling back to the 0.70 default ### Files Modified - `src/cli/analysis_utilities/quality_checks_part2.rs` —Low2/12/2026
v3.0.6## What's New ### CB-600 Lua Best Practices Detection (PMAT-487) 8 new compliance checks for Lua codebases, grounded in LuaTaint, FLuaScan, Luau type system, and luacheck research: - **CB-600**: Implicit globals — assignment without `local` keyword - **CB-601**: Nil-unsafe access — chained calls on function returns, deep field chains - **CB-602**: pcall error handling — uncaptured/unchecked pcall/xpcall - **CB-603**: Deprecated/dangerous API — `os.execute()`, `loadstring()`, `setfenv()` - **CBLow2/12/2026
v3.0.4## What's New ### Full Lua Language Support (PMAT-486) - **tree-sitter-lua 0.2.0** AST parsing: functions, `require()` imports, table constructors, control flow - Language detection, indexing, function naming conventions (snake_case) - Context generation, simple_deep_context, complexity analysis - Lua analysis example: `cargo run --example lua_analysis` - pmat-book Chapter 13 updated with Lua documentation ### CB-081 Sovereign Threshold Compliance - Full `pmat comply` compliance achieved ## BLow2/10/2026
v3.0.1## Bug Fixes - **Fix context lines (-A/-B/-C) for workspace queries**: Context line display now resolves workspace sibling paths (e.g., `trueno/`, `aprender/`). Previously, workspace function results produced empty output when using `-A`, `-B`, or `-C` flags. - **Fix --exclude-tests in raw mode**: The `--exclude-tests` flag now works correctly with `--raw`, filtering out test files by path pattern. - **Fix enrich_coverage ANDON**: Coverage enrichment dropped from 17-25s to ~1ms on cache hit vLow2/8/2026
v3.0.0## PMAT v3.0.0 ### Highlights - **Git History RAG Pipeline**: Semantic search across git commit history using TF-IDF embeddings - **Reciprocal Rank Fusion (RRF)**: Merge code search and git history results for better relevance - **Incremental Git Sync**: SQLite-backed commit index with O(1) sync detection - **95.30% Test Coverage**: Up from 92.02% with ~100 new tests ### New Features - `pmat query "intent" -G` - Fuse git commit history into code search via RRF - `pmat query "intent" --churn`Low2/5/2026
v2.217.0## New Features ### Struct/Type Definition Search - **New `--type` filter for `pmat query`**: Search by definition type (fn, struct, enum, trait, type) - **DefinitionType enum**: Tracks what kind of definition is indexed - **Extended tree-sitter chunker**: Extracts struct/enum/trait/type definitions in addition to functions ### Usage Examples ```bash # Search for all structs matching a pattern pmat query "Config" --type struct --limit 10 # Search for enums pmat query "Error" --type enum # SeLow2/5/2026
v2.216.0## Agent Context Query Improvements (PMAT-470) ### 5 Query Engine Improvements 1. **Persisted corpus** — Index v1.1.0 saves pre-computed corpus in the LZ4 blob, eliminating 80ms of rebuild on load 2. **Incremental index updates** — SHA256 file checksums detect unchanged files; only re-parses modified/new files (4.5s → 200ms for single-file edits) 3. **Generic name demotion** — Functions named `new`, `default`, `from` get frequency-based scoring penalty (floor at 0.3x) 4. **Scope-aware search**Low2/4/2026
v2.214.0## Popperian Falsification Quality Enforcement This release implements the Work Contract system with evidence-based quality enforcement based on Popperian falsification epistemology. ### Key Features - **Work Contract**: Captures immutable baseline at `pmat work start` (git SHA, TDG, coverage, file manifest) - **13 Falsifiable Claims**: Evidence-based validation at `pmat work complete` - Manifest integrity (no deleted files) - Meta-falsification (the falsifier tests itself) - Coverage gLow1/25/2026
v2.213.17## Changes ### Sovereign Stack Updates - **aprender**: 0.24.0 → 0.24.1 - **trueno**: 0.11.0 → 0.14.0 (major SIMD improvements) - **trueno-db**: 0.3.10 → 0.3.11 - **trueno-graph**: 0.1.10 → 0.1.11 - **trueno-rag**: 0.1.8 → 0.1.10 - **trueno-viz**: 0.1.17 → 0.1.20 ### Bug Fixes - Fixed Unicode boundary panic in `pmat work status` (#128) - Fixed Unicode panic in `pmat qa spec` (#120) - Improved error message for invalid phases format in roadmap.yaml (#130) ### Issue Triage - Closed all 32 open GLow1/24/2026
v2.213.16## Changes ### Bug Fixes - **TDG**: Include file paths in violation output (#tdg-violation-paths) - **CLI**: Show full relative paths in complexity analysis output - "Top Files by Complexity" now shows full path, not just filename - "Top Complexity Hotspots" shows `func` file:line format on one line - **CLI**: Remove internal ticket references from user-facing help text ### Improvements - Refactored examples to use const strings, reducing complexity from 70 to ~5 ### Full Changelog https:Low1/24/2026
v2.213.15## Highlights - **A+ Quality Score (95.8%)** achieved through improved Known Defects scoring - **Security**: Updated lru crate 0.14→0.16 (RUSTSEC-2026-0002) ## Fixed - **Known Defects Scorer**: Fixed false positives from test files - Now detects `#[cfg(all(test, ...))]` patterns - Expanded filename pattern detection (`*_tests_part*.rs`, `*_coverage_tests*.rs`, etc.) - Production unwrap count: 1361 → 219 (accurate) - **Production unwrap() → expect()**: 24 production calls converted withLow1/22/2026
v2.213.14## Fixed ### CB-021 SIMD False Positives - Eliminated 26 false positive warnings in compliance checker - Used `concat!()` macro to split pattern strings in detection code - Prevents compliance checker from flagging its own pattern definitions - Affected files: comply_handlers.rs, cuda_simd.rs, gpu_simd_scorer.rs ### CB-BUDGET False Positives - Improved ComputeBrick detection precision - Now only flags `impl ComputeBrick` trait implementations - Ignores structs like `BrickStats` that have "BrLow1/21/2026
v2.213.13## CB-020 Detection Improvement Fixes false positives for multi-line SAFETY comments in `pmat comply check`. ### Changes - Now checks up to 10 lines back (was 3) to find SAFETY comments - Supports `/ SAFETY:` doc comment style in addition to `// SAFETY:` - Eliminates false positives where SAFETY comments span multiple lines ### Install ```bash cargo install pmat ```Low1/21/2026
v2.213.12## File Health Enforcement System (CB-040) Prevents untestable large files through evidence-based enforcement. ### Features - **File Health Check** in `pmat comply check` - Detects files exceeding 500 lines (new) or 2000 lines (critical) - Calculates Test-to-Lines Ratio (TLR) with scaling requirements - Computes File Health Score (0-100) with letter grades (A+ to F) - **Pre-commit Hook** with ratchet mechanism - New files must be < 500 lines - Existing files cannot grow (Toyota Way Low1/21/2026
v2.213.11## 🔧 Critical Fix: `cargo install` Build Time **Problem**: `cargo install pmat` was hanging for hours at the final linking stage. **Root Cause (Five Whys Analysis)**: - `lto = "fat"` forced single-threaded recompilation of all 593 crates - `codegen-units = 1` prevented any parallelism in the link stage - Combined effect: catastrophic build times on consumer hardware **Solution**: | Setting | Before | After | Impact | |---------|--------|-------|--------| | `lto` | `"fat"` | `"thin"` | ~10x fLow1/15/2026
v2.213.8## What's New ### Dependency Optimization - Feature-gated `git2` behind `git-lib` feature (~67 deps saved when not enabled) - Feature-gated `octocrab` behind `github-api` feature (~255 deps saved) - Default build now has minimal dependencies for faster installs ### Feature Flags - `git-lib`: Use libgit2 for git operations (faster, more features) - `github-api`: GitHub API integration via octocrab - `analytics-simd`: SIMD-accelerated analytics ### Build Quality - ZERO bashrs Makefile lint warnLow1/14/2026
v2.213.5## What's New ### Test Coverage Improvements - **130+ new tests** added across multiple files - Overall coverage: **86.22%** - Added comprehensive tests for: - `command_suggestions.rs`: Levenshtein distance, command validation - `storage_backend.rs`: LibsqlBackend, InMemoryBackend - `resource_control.rs`: Serialization, pressure levels - `event_store.rs`: Refactored with batuta patterns ### Bug Fixes - Fixed `test_git_config_defaults` to match zero-branching policy - Fixed `test_full_cLow1/12/2026
v2.213.4**Full Changelog**: https://github.com/paiml/paiml-mcp-agent-toolkit/compare/v2.213.3...v2.213.4Low1/9/2026
v2.213.3**Full Changelog**: https://github.com/paiml/paiml-mcp-agent-toolkit/compare/v2.213.2...v2.213.3Low1/9/2026
v2.213.1## 🔒 Security Fix - **CVE-2025-64718**: Fixed prototype pollution vulnerability in js-yaml dependency (bridge/package-lock.json) ## đŸ›Ąī¸ Production Code Hardening - **234 `.unwrap()` calls eliminated**: Converted all production code `.unwrap()` to `.expect("internal error")` for better panic messages - Based on Cloudflare outage analysis (2025-11-18) - bare `.unwrap()` causes uninformative panics in production ## ✨ New Features (since v2.210.0) ### 200-Point Perfection Score System - CompreLow12/29/2025
v2.210.0## New Features ### Oracle Module (Toyota Way PDCA) - Implements Plan-Do-Check-Act continuous improvement cycle - Signal collectors: rustc, clippy, cargo test integration - Convergence detection with configurable targets - Andon principle: stagnation detection and regression prevention - Auto-apply fixes with confidence thresholds ### Rich Reporter Framework - Data science analysis: K-Means clustering, PageRank centrality - Louvain community detection for related defects - Z-score anomaly deteLow12/7/2025
v2.209.0## v2.209.0 - Pure WASM Dashboard ### Features - **pmat-dashboard**: New pure Rust/WASM dashboard crate built with Presentar framework - **81% bundle size reduction**: From 3.1 MB (JavaScript) to ~574 KB (WASM) - **WCAG 2.1 AA accessibility**: Built-in contrast ratio validation - **New widgets**: HotspotTable, MetricsChart, DagDiagram, DashboardButton - **WebSocket binary protocol**: Replaces Server-Sent Events for real-time updates - **60fps GPU-accelerated rendering**: Via WebGPU/trueno-viz Low12/6/2025
v2.208.0## What's New ### O(1) Coverage Caching (bashrs-style) - `make coverage-fast` - **8ms** cache hit using git tree hash - `make coverage-full` - Full coverage with slow tests (CI/nightly) - `make coverage-invalidate` - Clear cache when needed ### Philosophy Speed > vanity metrics. Fast tests for dev workflow, full tests for CI. ### Coverage - Fast coverage: ~60% (dev workflow, 8ms) - Full coverage: ~75-80% (CI/nightly, 30+ min) --- 🤖 Generated with [Claude Code](https://claude.com/claude-codeLow12/6/2025
v2.207.0## v2.207.0 - Pure Rust Semantic Search ### Highlights **Zero API Keys Required** - Semantic search now runs completely offline using pure Rust implementations. ### New Features - **Pure Rust Semantic Search Stack** - `trueno-rag` (0.1.2): RAG pipeline with hybrid retrieval, RRF fusion - `trueno-graph` (0.1.2): PageRank, BFS, Louvain clustering - `aprender` (0.14.0): TF-IDF, LDA, K-means, DBSCAN clustering - **Commands** - `pmat analyze topics --num-topics N` - Extract semantic topiLow12/2/2025
v2.206.0## pmat v2.206.0 ### Major Updates - **Dependencies**: Updated tokio to 1.48, octocrab to 0.48 - **Transitive deps**: aprender 0.7.1, pmcp 1.8.6, trueno-db 0.3.4, trueno-graph 0.1.2 ### Features (since 2.205.0) - `pmat test-discovery` phases 2-4: categorization, marking, verification (#98) - `pmat comply` command handlers for compliance checking (#96) - `pmat qa-work` Toyota Way QA validation (#102) - `pmat maintain cleanup-resources` command (#86) - `pmat maintain bug-report` command (#81) - Low12/2/2025
v2.205.0## 🚀 PMAT v2.205.0 ### Highlights - **trueno-graph Integration**: O(1) symbol lookups and PageRank scoring in context generation and TDG analysis - **unwrap() False Positive Fix**: Eliminated doc comment and test code false positives in rust-project-score (#99, #100) - **O(1 Quality Gates**: <10ms pre-commit validation with realistic thresholds ### Bug Fixes - Fix unwrap() false positives - exclude doc comments and `#[cfg(test)]` modules - Fix O(1 quality gate thresholds (lint: 150s, test-fLow11/25/2025
v2.201.0## Features ### File Locking for Multi-Agent Safety - Cross-platform file locking using `fs2` crate (Linux, macOS, Windows) - Exclusive locks for write operations prevent data loss - Shared locks for read operations allow concurrent readers - Atomic read-modify-write operations eliminate race conditions - RAII pattern ensures locks always released ### Multi-Agent Workflow Support - ✅ 10 concurrent agents can write simultaneously - ✅ Zero data loss - all concurrent writes succeed - ✅ Cross-procLow11/22/2025
v2.200.1## Release v2.200.1 ### Summary This bugfix release addresses [GitHub issue #84](https://github.com/paiml/paiml-mcp-agent-toolkit/issues/84) by allowing extra fields in roadmap YAML files to support backward compatibility with older roadmap formats. ### Changes - **Fix**: Remove `#[serde(deny_unknown_fields)]` attribute from `RoadmapItem` struct to allow extra YAML fields - **Test**: Add `test_trueno_db_yaml_format_with_extra_fields` to verify backward compatibility - **Version**: Bump versionLow11/21/2025
v2.200.0## Known Defects v2.1: Zero-Tolerance Defect Detection ### đŸ›Ąī¸ New Features **TDG Auto-Fail on Critical Defects** - TDG automatically fails (exit code 1) when critical defects detected - Score: 0.0/100, Grade: F - Zero tolerance for production-breaking patterns **New Command: `pmat analyze defects`** - Project-wide critical defect scanning - Multiple output formats: text, JSON, JUnit XML - Exit code 1 for CI/CD integration - Comprehensive test exclusion ### 🔴 Critical Defect Patterns **`.uLow11/21/2025
v2.198.0# Release v2.198.0 - Unified GitHub/YAML Workflow System ## Major Features ### Complete Workflow Management (Issue #75) - **5 New Commands**: `pmat work init`, `start`, `continue`, `complete`, `status` - **Hybrid Architecture**: Write-through to both GitHub Issues and YAML roadmaps - **GitHub Integration**: Fetch issue metadata, extract acceptance criteria, auto-link - **Quality Gates**: Automatic validation with `cargo test` and `cargo clippy` - **Pre-commit Hooks**: Commit message validationLow11/19/2025
v2.197.0## đŸĻ€ Rust Project Score v1.2 - Formal Verification ### Highlights - **New Category**: Formal Verification (8 points max) - **Miri Integration** (3 points): Undefined behavior detection for unsafe Rust - **Kani Formal Verification** (5 points): Mathematical proof of correctness using CBMC - **Total Points**: 114 (up from 106 in v1.1) - **Toyota Way Principles**: Jidoka (č‡Ē働化 - stop on UB), Genchi Genbutsu (empirical evidence), Kaizen (攚善) ### New Category: Formal Verification (8 points) ##Low11/19/2025
v2.195.0## 🚀 Release v2.195.0 ### Added #### Workflow Prompts: release-prep - Multi-language release preparation with comprehensive quality gates - Supports Rust, Python, TypeScript, and Go with variable substitution - Integrates Toyota Way principles (Jidoka, Andon Cord, Genchi Genbutsu, Kaizen) - Rollback procedures for emergency hotfixes - Location: `server/prompts/release-prep.yaml` #### Workflow Prompts: code-coverage v3.0 - Upgraded with compiler-grade quality standards - Research validation fLow11/14/2025
v2.194.1## Release v2.194.1 ### Changes - Updated Makefile targets to match bashrs style exactly - Fixed clippy warnings (too_many_arguments) - Marked flaky test as #[ignore] with Five Whys analysis - Improved test-fast and coverage targets with two-phase approach ### Quality Gates ✅ - pmat-book validation: PASSING - Clippy: 0 warnings - Tests: 4653 passed, 0 failed, 187 ignored - Repository: Clean ### Installation ```bash cargo install pmat ``` ### Documentation https://paiml.github.io/pmat-book/Low11/12/2025
v2.194.0## đŸŽ¯ Workflow Prompts for EXTREME TDD This release adds the `pmat prompt` command to show pre-configured workflow prompts that enforce EXTREME TDD and Toyota Way quality principles. ### ✨ New Features **`pmat prompt` Command** - 11 embedded workflow prompts (code-coverage, debug, continue, quality-enforcement, etc.) - Multiple output formats: YAML, JSON, text - Variable substitution with `${VAR}` syntax - `--list` flag to see all available prompts - `--show-variables` to see prompt variablesLow11/10/2025
v2.193.0## 🎉 Repository Health Scoring System Complete implementation of `pmat repo-score` - quantitative repository health assessment. ### ✨ New Feature: `pmat repo-score` Score your repository on a **0-110 scale** (100 base + 10 bonus points) across 6 quality categories: **📊 Base Categories (100 points):** - **A: Documentation** (20 pts) - README accuracy and comprehensiveness - **B: Pre-commit Hooks** (20 pts) - Hook presence and performance - **C: Repository Hygiene** (10 pts) - No cruft or teLow11/10/2025
v2.192.0## 🎉 Sprint 81: Issue #53 Complete - MCP Tool Placeholder Elimination (16/16, 100%) **All 16 MCP functions now use real services - placeholder elimination complete!** ### Batch 5: Advanced Analysis MCP Functions (Final Batch) This release completes Issue #53 by replacing the final 4 MCP tool placeholder functions with real service integrations: #### New Functions 1. **`analyze_lint_hotspots`** - Find quality hotspots via TDG analysis - TDG-based quality scoring with letter grades (A+ toLow11/1/2025
v2.189.0## BUG-010: Silence Noisy Warnings in SATD Analysis ### Summary Fixed confusing warnings that interleaved with progress indicators during context generation on large projects. ### Issue When running `pmat context` on large projects (e.g., Ceph), warnings were: - **Interleaved** with progress indicators - **Truncated**: "Parameter validation failed: l" (incomplete messages) - **Confusing**: "Warning: Error processing file" (contradictory) Example output (BROKEN): ``` ⠙ Running parallel analyseLow10/31/2025
v2.188.0## BUG-001-003: Fixed Embed Command Errors and Examples ### Summary Fixed three related bugs in the `pmat embed` subcommand that prevented users from using embed functionality. ### Issues Fixed **BUG-001: `pmat embed status` Invalid Format Error** ✅ - **Issue**: Showed `invalid value 'summary' for '--format <FORMAT>'` error - **Cause**: Default value "summary" but OutputFormat only has Table/Json/Yaml - **Fix**: Changed default from "summary" → "table" **BUG-002: `pmat embed sync` Invalid Low10/31/2025
v2.187.0## BUG-006: Replace Hardcoded Analysis Count with Named Constant ### Summary Code quality improvement replacing magic number "8" with named constant `ANALYSIS_COUNT` for better maintainability. ### Investigation - **Bug report claimed**: "Only 4 analyses run" - **Actual behavior**: All 8 analyses DO run correctly (complexity, provability, satd, churn, dag, tdg, big_o, dead_code) - **Real issue**: Hardcoded "8" in 2 locations - poor code maintainability ### Changes - Added `const ANALYSIS_COUNLow10/31/2025
v2.186.0# v2.186.0 - Sprint 79 Phase 2 COMPLETE: User Experience **Released**: October 31, 2025 **Sprint**: 79 Phase 2 - User Experience Bugs (4/4 complete ✅) **Status**: ✅ COMPLETE - ALL UX BUGS FIXED with 100% test coverage ## đŸŽ¯ Sprint Goals Fix all user experience bugs identified in production testing: misleading metrics, false positives, empty sections, and broken progress output. ## ✅ ALL Bugs Fixed (4/4 completed!) ### BUG-007: Function Count Always Zero ✅ - **Issue**: Shows "FunctionsLow10/31/2025
v2.185.0# v2.185.0 - Sprint 79 Phase 2: User Experience **Released**: October 31, 2025 **Sprint**: 79 Phase 2 - User Experience Bugs (3/4 complete) **Status**: ✅ THREE BUGS FIXED with 100% test coverage ## đŸŽ¯ Sprint Goals Fix user experience bugs identified in production testing, focusing on misleading metrics, false positives, and empty report sections. ## ✅ Bugs Fixed (3/3 completed) ### BUG-007: Function Count Always Zero ✅ - **Issue**: Shows "Functions: 0" despite functions present - **RoLow10/31/2025
v2.180.1# PMAT v2.180.1 - Critical Hotfix **âš ī¸ All v2.180.0 users should upgrade immediately** ## Critical Bug Fixed ### TDG Baseline Backend Issue **Problem**: TDG baseline commands failed with "Sled backend not available" error. **Root Cause**: `TieredStore::new()` was hardcoded to use the deprecated sled backend instead of libsql (default). **Files**: `server/src/tdg/storage.rs` lines 121, 128 **Impact**: - ❌ `pmat tdg baseline create` - BROKEN in v2.180.0 - ❌ `pmat tdg baseline compare` - BRLow10/29/2025
v2.179.0# PMAT v2.179.0 - Git-Commit Correlation (Sprint 65 Complete) **Release Date**: October 28, 2025 **Sprint**: 65 (Phases 1-3) **Theme**: Quality Archaeology - Link TDG metrics to git commits **Status**: ✅ Production Ready --- ## 🎉 Major Features ### Git-Commit Correlation for TDG PMAT can now track Technical Debt Grading (TDG) scores at specific git commits, enabling powerful "quality archaeology" workflows. Answer questions like: - "Which commit broke quality?" - "What's the quality delta Low10/28/2025
v2.178.0# v2.178.0 - Pre-commit Hooks: Missing Commands Implementation Resolves documentation-reality gap from [pmat-book Chapter 9](https://paiml.github.io/pmat-book/ch09-00-precommit-hooks.html). ## đŸŽ¯ What's New ### New Commands - **`pmat hooks init`** - Initialize hooks (alias for `install`) - Documented in book Chapter 9, line 40 - Matches expected user workflow from documentation - **`pmat hooks run`** - Execute hooks for CI/CD - Documented in book Chapter 9, line 421 - Supports `--aLow10/28/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

wanakuWanaku MCP Routerv0.1.3
biomcpBioMCP: Biomedical Model Context Protocolv0.8.22
turbovaultMarkdown and OFM SDK w/ MCP server that transforms your Obsidian vault into an intelligent knowledge systemv1.5.0
pipulateLocal First AI SEO Software on Nix, FastHTML & HTMXmain@2026-06-06
metorial-platformThe engine powering hundreds of thousands of MCP connections 🤖 đŸ”Ĩdev@2026-06-06

More in MCP Servers

AstrBotAgentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨
agentscopeBuild and run agents you can see, understand and trust.
claude-plugins-officialOfficial, Anthropic-managed directory of high quality Claude Code Plugins.
langchain4jLangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impleme