Home > MCP Servers > paiml-mcp-agent-toolkit

paiml-mcp-agent-toolkit

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

agentic c deno kotlin mcp mcp-server paiml paiml-active-tool rust

Why this rank:Strong adoptionRecent releaseHealthy release cadence

Description

Pragmatic AI Labs MCP Agent Toolkit - An MCP Server designed to make code with agents more deterministic

README

PMAT

Zero-configuration AI context generation for any codebase

Installation | Usage | Features | Examples | Documentation

What is PMAT?
Installation
Usage
Features
Architecture

What is PMAT?

PMAT (Pragmatic Multi-language Agent Toolkit) provides everything needed to analyze code quality and generate AI-ready context:

Context Generation - Deep analysis for Claude, GPT, and other LLMs
Technical Debt Grading - A+ through F scoring with 6 orthogonal metrics
Mutation Testing - Test suite quality validation (85%+ kill rate)
Repository Scoring - Quantitative health assessment (0-289 scale, 11 categories)
Git History RAG - Semantic search across commit history with RRF fusion
Semantic Search - Natural language code discovery
Compliance Governance - 30+ checks across code quality, best practices, and reproducibility
Design by Contract - Toyota Way contract profiles with checkpoint validation and rescue protocols
Autonomous Kaizen - Toyota Way continuous improvement with auto-fix and commit
MCP Integration - 19 tools for Claude Code, Cline, and AI agents
Quality Gates - Pre-commit hooks, CI/CD integration, .pmat-gates.toml config
20+ Languages - Rust, TypeScript, Python, Go, Java, C/C++, Lua, Lean, and more

Part of the PAIML Stack, following Toyota Way quality principles (Jidoka, Genchi Genbutsu, Kaizen).

Annotated Code Search

pmat query "cache invalidation" --churn --duplicates --entropy --faults

Every result includes TDG grade, Big-O complexity, git churn, code clones, pattern diversity, fault annotations, call graph, and syntax-highlighted source.

Installation

# Install from crates.io
cargo install pmat

# Or from source (latest)
git clone https://github.com/paiml/paiml-mcp-agent-toolkit
cd paiml-mcp-agent-toolkit && cargo install --path .

Usage

# Generate AI-ready context
pmat context --output context.md --format llm-optimized

# Analyze code complexity
pmat analyze complexity

# Grade technical debt (A+ through F)
pmat analyze tdg

# Score repository health
pmat repo-score .

# Run mutation testing
pmat mutate --target src/

# Start MCP server for Claude Code, Cline, etc.
pmat mcp

Features

Context Generation

Generate comprehensive context for AI assistants:

pmat context                           # Basic analysis
pmat context --format llm-optimized    # AI-optimized output
pmat context --include-tests           # Include test files

Technical Debt Grading (TDG)

Six orthogonal metrics for accurate quality assessment:

pmat analyze tdg                       # Project-wide grade
pmat analyze tdg --include-components  # Per-component breakdown
pmat tdg baseline create               # Create quality baseline
pmat tdg check-regression              # Detect quality degradation

Grading Scale:

A+/A: Excellent quality, minimal debt
B+/B: Good quality, manageable debt
C+/C: Needs improvement
D/F: Significant technical debt

Mutation Testing

Validate test suite effectiveness:

pmat mutate --target src/lib.rs        # Single file
pmat mutate --target src/ --threshold 85  # Quality gate
pmat mutate --failures-only            # CI optimization

Supported Languages: Rust, Python, TypeScript, JavaScript, Go, C/C++, C#, Lua, Lean, Java, Kotlin, Ruby, Swift, PHP, Bash, SQL, Scala, YAML, Markdown + MLOps model formats (GGUF, SafeTensors, APR)

Repository Health Scoring

Evidence-based quality metrics (0-289 scale, 11 categories):

pmat rust-project-score                # Fast mode (~3 min)
pmat rust-project-score --full         # Comprehensive (~10-15 min)
pmat repo-score . --deep               # Full git history

Workflow Prompts

Pre-configured AI prompts enforcing EXTREME TDD:

pmat prompt --list                     # Available prompts
pmat prompt code-coverage              # 85%+ coverage enforcement
pmat prompt debug                      # Five Whys analysis
pmat prompt quality-enforcement        # All quality gates

Git History RAG

Search git history by intent using TF-IDF semantic embeddings:

# Fuse git history into code search
pmat query "fix memory leak" -G

# Search with churn, clones, entropy, faults
pmat query "error handling" --churn --duplicates --entropy --faults

# Run the example
cargo run --example git_history_demo

Git Hooks

Automatic quality enforcement:

pmat hooks install                     # Install pre-commit hooks
pmat hooks install --tdg-enforcement   # With TDG quality gates
pmat hooks status                      # Check hook status

Compliance Governance (`pmat comply`)

30+ automated checks across code quality, best practices, and governance:

pmat comply check                      # Run all compliance checks
pmat comply check --strict             # Exit non-zero on failure
pmat comply check --format json        # Machine-readable output
pmat comply migrate                    # Update to latest version

Key Checks:

CB-200: TDG Grade Gate — blocks on non-A functions (auto-rebuilds stale index)
CB-304: Dead code percentage enforcement
CB-400: Shell/Makefile quality via bashrs
CB-500: Rust best practices (30+ patterns)
CB-600: Lua best practices
CB-900: Markdown link validation
CB-1000: MLOps model quality

Provable-Contracts Enforcement (CB-1200..1210):

CB-1208: Binding existence — verifies binding.yaml functions exist in src/, detects ghost bindings (L0-L3 enforcement levels)
CB-1209: Contract trait enforcement — checks tests/contract_traits.rs for compiler-verified trait impls (13 kernel traits)
CB-1210: Precondition quality — flags mass-generated boilerplate and missing postconditions

Configure via .pmat.yaml:

comply:
  thresholds:
    min_tdg_grade: "A"
    pv_lint_is_error: true        # CB-1201: FAIL on pv lint failure
    min_binding_existence: 95     # CB-1208: 95% binding verification
    require_all_traits: true      # CB-1209: 13/13 traits required
    min_kani_coverage: 20         # CB-1206: minimum Kani proof %

Infrastructure Score (`pmat infra-score`)

CI/CD quality scoring (0-100 + 10 bonus for provable-contracts):

pmat infra-score                       # Text output
pmat infra-score --format json         # Machine-readable
pmat infra-score -v --failures-only    # Show only failing checks

Categories: Workflow Architecture (25pts), Build Reliability (25pts), Quality Pipeline (20pts), Deployment & Release (15pts), Supply Chain (15pts), Provable Contracts bonus (10pts).

Document Search (`pmat query --docs`)

Search documentation files (Markdown, text, YAML) alongside code:

pmat query "authentication" --docs          # Code + docs results
pmat query "deployment" --docs-only         # Only documentation
pmat query "API endpoints" --no-docs        # Exclude docs (default)

Autonomous Kaizen (`pmat kaizen`)

Toyota Way continuous improvement — scan, auto-fix, commit:

pmat kaizen --dry-run                  # Scan only (no changes)
pmat kaizen                            # Apply safe auto-fixes
pmat kaizen --commit --push            # Fix, commit, and push
pmat kaizen --format json -o report.json  # CI/CD integration

# Cross-stack mode: scan all batuta stack crates in one invocation
pmat kaizen --cross-stack --dry-run    # Scan all crates
pmat kaizen --cross-stack --commit     # Fix and commit per-crate
pmat kaizen --cross-stack -f json      # Grouped JSON report

Function Extraction (`pmat extract`)

Extract function boundaries with metadata:

pmat extract src/lib.rs                # Extract functions from file
pmat extract --list src/               # List all functions with imports and visibility

Examples

Generate Context for AI

# For Claude Code
pmat context --output context.md --format llm-optimized

# With semantic search
pmat embed sync ./src
pmat semantic search "error handling patterns"

CI/CD Integration

# Add to your CI pipeline
steps:
  - uses: actions/checkout@v4
  - run: cargo install pmat
  - run: pmat analyze tdg --fail-on-violation --min-grade B
  - run: pmat mutate --target src/ --threshold 80

Quality Baseline Workflow

# 1. Create baseline
pmat tdg baseline create --output .pmat/baseline.json

# 2. Check for regressions
pmat tdg check-regression \
  --baseline .pmat/baseline.json \
  --max-score-drop 5.0 \
  --fail-on-regression

Architecture

pmat/
├── src/
│   ├── cli/          Command handlers and dispatchers
│   ├── services/     Analysis engines (TDG, SATD, complexity, agent context)
│   ├── mcp_server/   MCP protocol server
│   ├── mcp_pmcp/     PMCP protocol integration
│   └── models/       Configuration and data models
├── examples/         89 runnable examples
└── docs/
    └── specifications/  Technical specs

Quality

Metric	Value
Tests	21,200+ passing
Coverage	99.66%
Mutation Score	>80%
Languages	20 supported + MLOps model formats
MCP Tools	19 available

Falsifiable Quality Commitments

Per Popper's demarcation criterion, all claims are measurable and testable:

Commitment	Threshold	Verification Method
Context Generation	< 5 seconds for 10K LOC project	`time pmat context` on test corpus
Memory Usage	< 500 MB for 100K LOC analysis	Measured via `heaptrack` in CI
Test Coverage	≥ 85% line coverage	`cargo llvm-cov` (CI enforced)
Mutation Score	≥ 80% killed mutants	`pmat mutate --threshold 80`
Build Time	< 3 minutes incremental	`cargo build --timings`
CI Pipeline	< 15 minutes total	GitHub Actions workflow timing
Binary Size	< 50 MB release binary	`ls -lh target/release/pmat`
Language Parsers	All 20 languages parse without panic	Fuzz testing in CI

How to Verify:

# Run self-assessment with Popper Falsifiability Score
pmat popper-score --verbose

# Individual commitment verification
cargo llvm-cov --html        # Coverage ≥85%
pmat mutate --threshold 80   # Mutation ≥80%
cargo build --timings        # Build time <3min

Failure = Regression: Any commitment violation blocks CI merge.

Benchmark Results (Statistical Rigor)

All benchmarks use Criterion.rs with proper statistical methodology:

Operation	Mean	95% CI	Std Dev	Sample Size
Context (1K LOC)	127ms	[124, 130]	±12.3ms	n=1000 runs
Context (10K LOC)	1.84s	[1.79, 1.90]	±156ms	n=500 runs
TDG Scoring	156ms	[148, 164]	±18.2ms	n=500 runs
Complexity Analysis	23ms	[22, 24]	±3.1ms	n=1000 runs

Comparison Baselines (vs. Alternatives):

Metric	PMAT	ctags	tree-sitter	Effect Size
10K LOC parsing	1.84s	0.3s	0.8s	d=0.72 (medium)
Memory (10K LOC)	287MB	45MB	120MB	-
Semantic depth	Full	Syntax only	AST only	-

See docs/BENCHMARKS.md for complete statistical analysis.

ML/AI Reproducibility

PMAT uses ML for semantic search and embeddings. All ML operations are reproducible:

Random Seed Management:

Embedding generation uses fixed seed (SEED=42) for deterministic outputs
Clustering operations use fixed seed (SEED=12345)
Seeds documented in docs/ml/REPRODUCIBILITY.md

Model Artifacts:

Pre-trained models from HuggingFace (all-MiniLM-L6-v2)
Model versions pinned in Cargo.toml
Hash verification on download

Dataset Sources

PMAT does not train models but uses these data sources for evaluation:

Dataset	Source	Purpose	Size
CodeSearchNet	GitHub/Microsoft	Semantic search benchmarks	2M functions
PMAT-bench	Internal	Regression testing	500 queries

Data provenance and licensing documented in docs/ml/REPRODUCIBILITY.md.

Sovereign Stack

PMAT is built on the PAIML Sovereign Stack - pure-Rust, SIMD-accelerated libraries:

Library	Purpose	Version
aprender	ML library (text similarity, clustering, topic modeling)	0.27.1
trueno	SIMD compute library for matrix operations	0.16.1
trueno-graph	GPU-first graph database (PageRank, Louvain, CSR)	0.1.17
trueno-rag	RAG pipeline with VectorStore	0.2.2
trueno-db	Embedded analytics database	0.3.15
trueno-viz	Terminal graph visualization	0.2.1
trueno-zram-core	SIMD LZ4/ZSTD compression (optional)	0.3.0
pmat	Code analysis toolkit	3.7.0

Key Benefits:

Pure Rust (no C dependencies, no FFI)
SIMD-first (AVX2, AVX-512, NEON auto-detection)
2-4x speedup on graph algorithms via aprender adapter

Documentation

PMAT Book - Complete guide
API Reference - Rust API docs
MCP Tools - MCP integration guide
Specifications - Technical specs
🤖 Coursera Hugging Face AI Development Specialization - Build Production AI systems with Hugging Face in Pure Rust

Contributing

See CONTRIBUTING.md for development setup, testing, and pull request guidelines.

License

MIT License - see LICENSE for details.

_{Built with Extreme TDD | Part of PAIML}

Release History

Version	Changes	Urgency	Date
v3.24.2	## MCP Agent Toolkit v3.24.2 ### Installation ```bash cargo install pmat ``` ### What's New - Template generation for Makefiles, README files, and .gitignore files - Support for Rust, Deno/TypeScript, and Python UV toolchains - Stateless MCP server with embedded templates - Claude Code integration ready ### Supported Platforms - Linux x86_64 - macOS x86_64 - macOS aarch64 (Apple Silicon)	Medium	7/5/2026
v3.21.0	# pmat v3.21.0 — Modern Agentic Coding Support (MACS, Component 32) Hardens PMAT's agents propose, receipts dispose boundary so every crossing between the stochastic side (LLM sessions, ultracode fleets) and the deterministic side (`pmat verify`, contracts, falsification receipts) is attributable, enforced, and reproducible. Implements the full [MACS spec](docs/specifications/components/modern-agentic-coding-support.md) as the totally-ordered ticket sequence MACS-000…016 (#612, #613). ## H	High	7/2/2026
v3.19.2	## [3.19.2] - 2026-06-13 Fixes two defects surfaced by the v3.19.0 self-dogfood. ### Fixed - `analyze dead-code` file count: the analyzer walked files with a raw `walkdir` that only skipped `target/`, so it descended into the hidden `.claude/worktrees/` git-worktree copies — inflating `total_files_analyzed` ~60× (263,890 vs the real ~4,224) and surfacing worktree duplicates as dead code. Both walks (`scan_for_suppression_attributes`, the line counter) now use `ignore::WalkBuilder	High	6/13/2026
v3.18.0	## `pmat verify` — CI-faithful pre-flight for autonomous agents One command that runs the gate set CI actually enforces — format, complexity, satd, clippy, tests — fail-fast, with machine-readable output (`--format json`), so an agent (e.g. Fable 5 in autonomous mode) gets "green here ⇒ green in CI" before committing. Closes the gap where the pre-commit hook and `pmat quality-gate` both miss clippy + tests. ``` edit → pmat verify --format json → self-fix on red → commit on green ``` -	High	6/11/2026
v3.17.0	## Highlights Two ship-blocking bugs landed via #564 and #565, both discovered while building a Coursera RAG-from-Zero lab. ### Fixed - `scripts/install.sh` URL pattern, tarball layout, and Linux platform default (#561 → #564). Three sub-bugs in the documented one-liner installer: 1. URL was constructed as `paiml-mcp-agent-toolkit-${PLATFORM}.tar.gz` but actual release assets are named `pmat-v${VERSION}-${PLATFORM}.tar.gz` — every install since the v3.0 rename returned 404. 2. The rele	High	5/6/2026
v3.16.0	## Highlights - 4 bug fixes: `pmat comply` default-yaml severity defect (#527), rustls-webpki RUSTSEC-2026-0104 bump (#476), incremental analysis flake (#449), entropy regex word-end bug (#396) - 80% line coverage milestone across the workspace - 354 new tests added since v3.14.0 - MCP: forward `ToolInfo` metadata for 4 `pmat_` handlers (#389) - Verification: Kani proof harnesses for pure functions (#299) - Brew*: rename `homebrew/` → `Formula/` for tap convention (#388)	High	4/27/2026
v3.14.0	## What's Changed Migrate all sovereign stack dependencies from old crate names to the unified `aprender-*` 0.30 monorepo on crates.io. ### Dependency renames \| Old name \| New name \| Version \| \|----------\|----------\|---------\| \| `trueno` \| `aprender-compute` \| 0.30 \| \| `trueno-db` \| `aprender-db` \| 0.30 \| \| `trueno-graph` \| `aprender-graph` \| 0.30 \| \| `trueno-rag` \| `aprender-rag` \| 0.30 \| \| `trueno-viz` \| `aprender-viz` \| 0.30 \| \| `trueno-zram-core` \| `aprender-zram-core` \| 0.30 \| \| `present	High	4/15/2026
v3.10.0	## What's New ### `pmat explain` Command Look up any check, metric, or grade to understand what it checks, why it matters, and how to fix failures. ```bash pmat explain CB-1210 # Explain a specific check pmat explain CB-12 # Prefix match: all CB-12xx checks pmat explain precondition # Fuzzy search in names/descriptions pmat explain # List all available checks ``` 19 check explanations across comply (CB-xxx), provable contracts (PV-xx), and TDG grades. ### Provable Contr	Medium	3/31/2026
v3.8.0	## Provable Contracts Integration Escape-proof contract enforcement pipeline across the entire sovereign AI stack. YAML contracts → build.rs env vars → `#[contract]` proc macro → `debug_assert!()` injection. Zero runtime cost in release builds. ### New Compliance Checks \| Check \| What it verifies \| \|-------\|-----------------\| \| CB-1203 \| Contract equations with preconditions have `#[contract]` macros \| \| CB-1204 \| `build.rs` emits PRE/POST env vars from YAML contracts \| \| CB-1205	Medium	3/23/2026
v3.7.0	## Highlights - RPS v3.0: New Reproducibility scorer wrapping Popper categories B-F (15 pts), falsifiability gateway (Jidoka), 11 categories / 289 max points - PMAT-510: Six scoring system improvements — Five Whys v2 evidence weights, Muda file mapping, EvoScore CB-142, `--rank-by priority`, `test --record` - Performance: -2.57 GB peak memory (deep context), -59% index build allocations, -44 MB graph clone elimination - New commands: `ci-local`, `bottleneck`, `test-stability`, `	Low	3/9/2026
v3.3.0	## What's New ### Rich `pmat extract --list` output (#217) `pmat extract --list` now returns a rich JSON object instead of a flat array, providing everything needed to split large files into compilable parts: New top-level fields: - `file` — file path - `language` — detected language (`rust`, `typescript`, `python`, `c`, `cpp`, `go`, `lua`) - `imports` — all top-level import/use statements (full text) - `cfg_test_line` — line where `#[cfg(test)]` appears (Rust only) **New per-item field:	Low	2/15/2026
v3.1.0	## What's New ### Lua Compliance Suite (CB-608 to CB-619) - 12 new Lua checks: coroutine misuse, require cycles, global protection, weak tables, test framework detection, OpenResty patterns, FFI safety, OOP patterns, type annotations - Lua dead code detection with module export awareness - Lua mutation testing adapter with busted integration - Tree-sitter TDG analyzer for Lua with full 7-component scoring ### SQL, Scala, YAML, Markdown, MLOps Compliance - CB-700: SQL best practices (N+	Low	2/14/2026
v3.0.7	## Fix #173: check_provability hardcoded score + config ignored ### Changes - Real provability analysis: `check_provability()` now calls `calculate_provability_score()` using the `LightweightProvabilityAnalyzer` instead of returning a hardcoded `0.65` - Config threshold: Added `provability_min = 0.60` to `.pmat-metrics.toml` so the threshold is read from config instead of always falling back to the 0.70 default ### Files Modified - `src/cli/analysis_utilities/quality_checks_part2.rs` —	Low	2/12/2026
v3.0.6	## What's New ### CB-600 Lua Best Practices Detection (PMAT-487) 8 new compliance checks for Lua codebases, grounded in LuaTaint, FLuaScan, Luau type system, and luacheck research: - CB-600: Implicit globals — assignment without `local` keyword - CB-601: Nil-unsafe access — chained calls on function returns, deep field chains - CB-602: pcall error handling — uncaptured/unchecked pcall/xpcall - CB-603: Deprecated/dangerous API — `os.execute()`, `loadstring()`, `setfenv()` - **CB	Low	2/12/2026
v3.0.4	## What's New ### Full Lua Language Support (PMAT-486) - tree-sitter-lua 0.2.0 AST parsing: functions, `require()` imports, table constructors, control flow - Language detection, indexing, function naming conventions (snake_case) - Context generation, simple_deep_context, complexity analysis - Lua analysis example: `cargo run --example lua_analysis` - pmat-book Chapter 13 updated with Lua documentation ### CB-081 Sovereign Threshold Compliance - Full `pmat comply` compliance achieved ## B	Low	2/10/2026
v3.0.1	## Bug Fixes - Fix context lines (-A/-B/-C) for workspace queries: Context line display now resolves workspace sibling paths (e.g., `trueno/`, `aprender/`). Previously, workspace function results produced empty output when using `-A`, `-B`, or `-C` flags. - Fix --exclude-tests in raw mode: The `--exclude-tests` flag now works correctly with `--raw`, filtering out test files by path pattern. - Fix enrich_coverage ANDON: Coverage enrichment dropped from 17-25s to ~1ms on cache hit v	Low	2/8/2026
v3.0.0	## PMAT v3.0.0 ### Highlights - Git History RAG Pipeline: Semantic search across git commit history using TF-IDF embeddings - Reciprocal Rank Fusion (RRF): Merge code search and git history results for better relevance - Incremental Git Sync: SQLite-backed commit index with O(1) sync detection - 95.30% Test Coverage: Up from 92.02% with ~100 new tests ### New Features - `pmat query "intent" -G` - Fuse git commit history into code search via RRF - `pmat query "intent" --churn`	Low	2/5/2026
v2.217.0	## New Features ### Struct/Type Definition Search - New `--type` filter for `pmat query`: Search by definition type (fn, struct, enum, trait, type) - DefinitionType enum: Tracks what kind of definition is indexed - Extended tree-sitter chunker: Extracts struct/enum/trait/type definitions in addition to functions ### Usage Examples ```bash # Search for all structs matching a pattern pmat query "Config" --type struct --limit 10 # Search for enums pmat query "Error" --type enum # Se	Low	2/5/2026
v2.216.0	## Agent Context Query Improvements (PMAT-470) ### 5 Query Engine Improvements 1. Persisted corpus — Index v1.1.0 saves pre-computed corpus in the LZ4 blob, eliminating 80ms of rebuild on load 2. Incremental index updates — SHA256 file checksums detect unchanged files; only re-parses modified/new files (4.5s → 200ms for single-file edits) 3. Generic name demotion — Functions named `new`, `default`, `from` get frequency-based scoring penalty (floor at 0.3x) 4. Scope-aware search	Low	2/4/2026
v2.214.0	## Popperian Falsification Quality Enforcement This release implements the Work Contract system with evidence-based quality enforcement based on Popperian falsification epistemology. ### Key Features - Work Contract: Captures immutable baseline at `pmat work start` (git SHA, TDG, coverage, file manifest) - 13 Falsifiable Claims: Evidence-based validation at `pmat work complete` - Manifest integrity (no deleted files) - Meta-falsification (the falsifier tests itself) - Coverage g	Low	1/25/2026
v2.213.17	## Changes ### Sovereign Stack Updates - aprender: 0.24.0 → 0.24.1 - trueno: 0.11.0 → 0.14.0 (major SIMD improvements) - trueno-db: 0.3.10 → 0.3.11 - trueno-graph: 0.1.10 → 0.1.11 - trueno-rag: 0.1.8 → 0.1.10 - trueno-viz: 0.1.17 → 0.1.20 ### Bug Fixes - Fixed Unicode boundary panic in `pmat work status` (#128) - Fixed Unicode panic in `pmat qa spec` (#120) - Improved error message for invalid phases format in roadmap.yaml (#130) ### Issue Triage - Closed all 32 open G	Low	1/24/2026
v2.213.16	## Changes ### Bug Fixes - TDG: Include file paths in violation output (#tdg-violation-paths) - CLI: Show full relative paths in complexity analysis output - "Top Files by Complexity" now shows full path, not just filename - "Top Complexity Hotspots" shows `func` file:line format on one line - CLI: Remove internal ticket references from user-facing help text ### Improvements - Refactored examples to use const strings, reducing complexity from 70 to ~5 ### Full Changelog https:	Low	1/24/2026
v2.213.15	## Highlights - A+ Quality Score (95.8%) achieved through improved Known Defects scoring - Security: Updated lru crate 0.14→0.16 (RUSTSEC-2026-0002) ## Fixed - Known Defects Scorer: Fixed false positives from test files - Now detects `#[cfg(all(test, ...))]` patterns - Expanded filename pattern detection (`_tests_part.rs`, `_coverage_tests.rs`, etc.) - Production unwrap count: 1361 → 219 (accurate) - Production unwrap() → expect(): 24 production calls converted with	Low	1/22/2026
v2.213.14	## Fixed ### CB-021 SIMD False Positives - Eliminated 26 false positive warnings in compliance checker - Used `concat!()` macro to split pattern strings in detection code - Prevents compliance checker from flagging its own pattern definitions - Affected files: comply_handlers.rs, cuda_simd.rs, gpu_simd_scorer.rs ### CB-BUDGET False Positives - Improved ComputeBrick detection precision - Now only flags `impl ComputeBrick` trait implementations - Ignores structs like `BrickStats` that have "Br	Low	1/21/2026
v2.213.13	## CB-020 Detection Improvement Fixes false positives for multi-line SAFETY comments in `pmat comply check`. ### Changes - Now checks up to 10 lines back (was 3) to find SAFETY comments - Supports `/ SAFETY:` doc comment style in addition to `// SAFETY:` - Eliminates false positives where SAFETY comments span multiple lines ### Install ```bash cargo install pmat ```	Low	1/21/2026
v2.213.12	## File Health Enforcement System (CB-040) Prevents untestable large files through evidence-based enforcement. ### Features - File Health Check in `pmat comply check` - Detects files exceeding 500 lines (new) or 2000 lines (critical) - Calculates Test-to-Lines Ratio (TLR) with scaling requirements - Computes File Health Score (0-100) with letter grades (A+ to F) - Pre-commit Hook with ratchet mechanism - New files must be < 500 lines - Existing files cannot grow (Toyota Way	Low	1/21/2026
v2.213.11	## 🔧 Critical Fix: `cargo install` Build Time Problem: `cargo install pmat` was hanging for hours at the final linking stage. Root Cause (Five Whys Analysis): - `lto = "fat"` forced single-threaded recompilation of all 593 crates - `codegen-units = 1` prevented any parallelism in the link stage - Combined effect: catastrophic build times on consumer hardware Solution: \| Setting \| Before \| After \| Impact \| \|---------\|--------\|-------\|--------\| \| `lto` \| `"fat"` \| `"thin"` \| ~10x f	Low	1/15/2026
v2.213.8	## What's New ### Dependency Optimization - Feature-gated `git2` behind `git-lib` feature (~67 deps saved when not enabled) - Feature-gated `octocrab` behind `github-api` feature (~255 deps saved) - Default build now has minimal dependencies for faster installs ### Feature Flags - `git-lib`: Use libgit2 for git operations (faster, more features) - `github-api`: GitHub API integration via octocrab - `analytics-simd`: SIMD-accelerated analytics ### Build Quality - ZERO bashrs Makefile lint warn	Low	1/14/2026
v2.213.5	## What's New ### Test Coverage Improvements - 130+ new tests added across multiple files - Overall coverage: 86.22% - Added comprehensive tests for: - `command_suggestions.rs`: Levenshtein distance, command validation - `storage_backend.rs`: LibsqlBackend, InMemoryBackend - `resource_control.rs`: Serialization, pressure levels - `event_store.rs`: Refactored with batuta patterns ### Bug Fixes - Fixed `test_git_config_defaults` to match zero-branching policy - Fixed `test_full_c	Low	1/12/2026
v2.213.4	Full Changelog: https://github.com/paiml/paiml-mcp-agent-toolkit/compare/v2.213.3...v2.213.4	Low	1/9/2026
v2.213.3	Full Changelog: https://github.com/paiml/paiml-mcp-agent-toolkit/compare/v2.213.2...v2.213.3	Low	1/9/2026
v2.213.1	## 🔒 Security Fix - CVE-2025-64718: Fixed prototype pollution vulnerability in js-yaml dependency (bridge/package-lock.json) ## 🛡️ Production Code Hardening - 234 `.unwrap()` calls eliminated: Converted all production code `.unwrap()` to `.expect("internal error")` for better panic messages - Based on Cloudflare outage analysis (2025-11-18) - bare `.unwrap()` causes uninformative panics in production ## ✨ New Features (since v2.210.0) ### 200-Point Perfection Score System - Compre	Low	12/29/2025
v2.210.0	## New Features ### Oracle Module (Toyota Way PDCA) - Implements Plan-Do-Check-Act continuous improvement cycle - Signal collectors: rustc, clippy, cargo test integration - Convergence detection with configurable targets - Andon principle: stagnation detection and regression prevention - Auto-apply fixes with confidence thresholds ### Rich Reporter Framework - Data science analysis: K-Means clustering, PageRank centrality - Louvain community detection for related defects - Z-score anomaly dete	Low	12/7/2025
v2.209.0	## v2.209.0 - Pure WASM Dashboard ### Features - pmat-dashboard: New pure Rust/WASM dashboard crate built with Presentar framework - 81% bundle size reduction: From 3.1 MB (JavaScript) to ~574 KB (WASM) - WCAG 2.1 AA accessibility: Built-in contrast ratio validation - New widgets: HotspotTable, MetricsChart, DagDiagram, DashboardButton - WebSocket binary protocol: Replaces Server-Sent Events for real-time updates - 60fps GPU-accelerated rendering: Via WebGPU/trueno-viz	Low	12/6/2025
v2.208.0	## What's New ### O(1) Coverage Caching (bashrs-style) - `make coverage-fast` - 8ms cache hit using git tree hash - `make coverage-full` - Full coverage with slow tests (CI/nightly) - `make coverage-invalidate` - Clear cache when needed ### Philosophy Speed > vanity metrics. Fast tests for dev workflow, full tests for CI. ### Coverage - Fast coverage: ~60% (dev workflow, 8ms) - Full coverage: ~75-80% (CI/nightly, 30+ min) --- 🤖 Generated with [Claude Code](https://claude.com/claude-code	Low	12/6/2025
v2.207.0	## v2.207.0 - Pure Rust Semantic Search ### Highlights Zero API Keys Required - Semantic search now runs completely offline using pure Rust implementations. ### New Features - Pure Rust Semantic Search Stack - `trueno-rag` (0.1.2): RAG pipeline with hybrid retrieval, RRF fusion - `trueno-graph` (0.1.2): PageRank, BFS, Louvain clustering - `aprender` (0.14.0): TF-IDF, LDA, K-means, DBSCAN clustering - Commands - `pmat analyze topics --num-topics N` - Extract semantic topi	Low	12/2/2025
v2.206.0	## pmat v2.206.0 ### Major Updates - Dependencies: Updated tokio to 1.48, octocrab to 0.48 - Transitive deps: aprender 0.7.1, pmcp 1.8.6, trueno-db 0.3.4, trueno-graph 0.1.2 ### Features (since 2.205.0) - `pmat test-discovery` phases 2-4: categorization, marking, verification (#98) - `pmat comply` command handlers for compliance checking (#96) - `pmat qa-work` Toyota Way QA validation (#102) - `pmat maintain cleanup-resources` command (#86) - `pmat maintain bug-report` command (#81) -	Low	12/2/2025
v2.205.0	## 🚀 PMAT v2.205.0 ### Highlights - trueno-graph Integration: O(1) symbol lookups and PageRank scoring in context generation and TDG analysis - unwrap() False Positive Fix: Eliminated doc comment and test code false positives in rust-project-score (#99, #100) - O(1 Quality Gates: <10ms pre-commit validation with realistic thresholds ### Bug Fixes - Fix unwrap() false positives - exclude doc comments and `#[cfg(test)]` modules - Fix O(1 quality gate thresholds (lint: 150s, test-f	Low	11/25/2025
v2.201.0	## Features ### File Locking for Multi-Agent Safety - Cross-platform file locking using `fs2` crate (Linux, macOS, Windows) - Exclusive locks for write operations prevent data loss - Shared locks for read operations allow concurrent readers - Atomic read-modify-write operations eliminate race conditions - RAII pattern ensures locks always released ### Multi-Agent Workflow Support - ✅ 10 concurrent agents can write simultaneously - ✅ Zero data loss - all concurrent writes succeed - ✅ Cross-proc	Low	11/22/2025
v2.200.1	## Release v2.200.1 ### Summary This bugfix release addresses [GitHub issue #84](https://github.com/paiml/paiml-mcp-agent-toolkit/issues/84) by allowing extra fields in roadmap YAML files to support backward compatibility with older roadmap formats. ### Changes - Fix: Remove `#[serde(deny_unknown_fields)]` attribute from `RoadmapItem` struct to allow extra YAML fields - Test: Add `test_trueno_db_yaml_format_with_extra_fields` to verify backward compatibility - Version: Bump version	Low	11/21/2025
v2.200.0	## Known Defects v2.1: Zero-Tolerance Defect Detection ### 🛡️ New Features TDG Auto-Fail on Critical Defects - TDG automatically fails (exit code 1) when critical defects detected - Score: 0.0/100, Grade: F - Zero tolerance for production-breaking patterns New Command: `pmat analyze defects` - Project-wide critical defect scanning - Multiple output formats: text, JSON, JUnit XML - Exit code 1 for CI/CD integration - Comprehensive test exclusion ### 🔴 Critical Defect Patterns **`.u	Low	11/21/2025
v2.198.0	# Release v2.198.0 - Unified GitHub/YAML Workflow System ## Major Features ### Complete Workflow Management (Issue #75) - 5 New Commands: `pmat work init`, `start`, `continue`, `complete`, `status` - Hybrid Architecture: Write-through to both GitHub Issues and YAML roadmaps - GitHub Integration: Fetch issue metadata, extract acceptance criteria, auto-link - Quality Gates: Automatic validation with `cargo test` and `cargo clippy` - Pre-commit Hooks: Commit message validation	Low	11/19/2025
v2.197.0	## 🦀 Rust Project Score v1.2 - Formal Verification ### Highlights - New Category: Formal Verification (8 points max) - Miri Integration (3 points): Undefined behavior detection for unsafe Rust - Kani Formal Verification (5 points): Mathematical proof of correctness using CBMC - Total Points: 114 (up from 106 in v1.1) - Toyota Way Principles: Jidoka (自働化 - stop on UB), Genchi Genbutsu (empirical evidence), Kaizen (改善) ### New Category: Formal Verification (8 points) ##	Low	11/19/2025
v2.195.0	## 🚀 Release v2.195.0 ### Added #### Workflow Prompts: release-prep - Multi-language release preparation with comprehensive quality gates - Supports Rust, Python, TypeScript, and Go with variable substitution - Integrates Toyota Way principles (Jidoka, Andon Cord, Genchi Genbutsu, Kaizen) - Rollback procedures for emergency hotfixes - Location: `server/prompts/release-prep.yaml` #### Workflow Prompts: code-coverage v3.0 - Upgraded with compiler-grade quality standards - Research validation f	Low	11/14/2025
v2.194.1	## Release v2.194.1 ### Changes - Updated Makefile targets to match bashrs style exactly - Fixed clippy warnings (too_many_arguments) - Marked flaky test as #[ignore] with Five Whys analysis - Improved test-fast and coverage targets with two-phase approach ### Quality Gates ✅ - pmat-book validation: PASSING - Clippy: 0 warnings - Tests: 4653 passed, 0 failed, 187 ignored - Repository: Clean ### Installation ```bash cargo install pmat ``` ### Documentation https://paiml.github.io/pmat-book/	Low	11/12/2025
v2.194.0	## 🎯 Workflow Prompts for EXTREME TDD This release adds the `pmat prompt` command to show pre-configured workflow prompts that enforce EXTREME TDD and Toyota Way quality principles. ### ✨ New Features `pmat prompt` Command - 11 embedded workflow prompts (code-coverage, debug, continue, quality-enforcement, etc.) - Multiple output formats: YAML, JSON, text - Variable substitution with `${VAR}` syntax - `--list` flag to see all available prompts - `--show-variables` to see prompt variables	Low	11/10/2025
v2.193.0	## 🎉 Repository Health Scoring System Complete implementation of `pmat repo-score` - quantitative repository health assessment. ### ✨ New Feature: `pmat repo-score` Score your repository on a 0-110 scale (100 base + 10 bonus points) across 6 quality categories: 📊 Base Categories (100 points): - A: Documentation (20 pts) - README accuracy and comprehensiveness - B: Pre-commit Hooks (20 pts) - Hook presence and performance - C: Repository Hygiene (10 pts) - No cruft or te	Low	11/10/2025
v2.192.0	## 🎉 Sprint 81: Issue #53 Complete - MCP Tool Placeholder Elimination (16/16, 100%) All 16 MCP functions now use real services - placeholder elimination complete! ### Batch 5: Advanced Analysis MCP Functions (Final Batch) This release completes Issue #53 by replacing the final 4 MCP tool placeholder functions with real service integrations: #### New Functions 1. `analyze_lint_hotspots` - Find quality hotspots via TDG analysis - TDG-based quality scoring with letter grades (A+ to	Low	11/1/2025
v2.189.0	## BUG-010: Silence Noisy Warnings in SATD Analysis ### Summary Fixed confusing warnings that interleaved with progress indicators during context generation on large projects. ### Issue When running `pmat context` on large projects (e.g., Ceph), warnings were: - Interleaved with progress indicators - Truncated: "Parameter validation failed: l" (incomplete messages) - Confusing: "Warning: Error processing file" (contradictory) Example output (BROKEN): ``` ⠙ Running parallel analyse	Low	10/31/2025
v2.188.0	## BUG-001-003: Fixed Embed Command Errors and Examples ### Summary Fixed three related bugs in the `pmat embed` subcommand that prevented users from using embed functionality. ### Issues Fixed BUG-001: `pmat embed status` Invalid Format Error ✅ - Issue: Showed `invalid value 'summary' for '--format <FORMAT>'` error - Cause: Default value "summary" but OutputFormat only has Table/Json/Yaml - Fix: Changed default from "summary" → "table" **BUG-002: `pmat embed sync` Invalid	Low	10/31/2025
v2.187.0	## BUG-006: Replace Hardcoded Analysis Count with Named Constant ### Summary Code quality improvement replacing magic number "8" with named constant `ANALYSIS_COUNT` for better maintainability. ### Investigation - Bug report claimed: "Only 4 analyses run" - Actual behavior: All 8 analyses DO run correctly (complexity, provability, satd, churn, dag, tdg, big_o, dead_code) - Real issue: Hardcoded "8" in 2 locations - poor code maintainability ### Changes - Added `const ANALYSIS_COUN	Low	10/31/2025
v2.186.0	# v2.186.0 - Sprint 79 Phase 2 COMPLETE: User Experience Released: October 31, 2025 Sprint: 79 Phase 2 - User Experience Bugs (4/4 complete ✅) Status: ✅ COMPLETE - ALL UX BUGS FIXED with 100% test coverage ## 🎯 Sprint Goals Fix all user experience bugs identified in production testing: misleading metrics, false positives, empty sections, and broken progress output. ## ✅ ALL Bugs Fixed (4/4 completed!) ### BUG-007: Function Count Always Zero ✅ - Issue: Shows "Functions	Low	10/31/2025
v2.185.0	# v2.185.0 - Sprint 79 Phase 2: User Experience Released: October 31, 2025 Sprint: 79 Phase 2 - User Experience Bugs (3/4 complete) Status: ✅ THREE BUGS FIXED with 100% test coverage ## 🎯 Sprint Goals Fix user experience bugs identified in production testing, focusing on misleading metrics, false positives, and empty report sections. ## ✅ Bugs Fixed (3/3 completed) ### BUG-007: Function Count Always Zero ✅ - Issue: Shows "Functions: 0" despite functions present - **Ro	Low	10/31/2025
v2.180.1	# PMAT v2.180.1 - Critical Hotfix ⚠️ All v2.180.0 users should upgrade immediately ## Critical Bug Fixed ### TDG Baseline Backend Issue Problem: TDG baseline commands failed with "Sled backend not available" error. Root Cause: `TieredStore::new()` was hardcoded to use the deprecated sled backend instead of libsql (default). Files: `server/src/tdg/storage.rs` lines 121, 128 Impact: - ❌ `pmat tdg baseline create` - BROKEN in v2.180.0 - ❌ `pmat tdg baseline compare` - BR	Low	10/29/2025
v2.179.0	# PMAT v2.179.0 - Git-Commit Correlation (Sprint 65 Complete) Release Date: October 28, 2025 Sprint: 65 (Phases 1-3) Theme: Quality Archaeology - Link TDG metrics to git commits Status: ✅ Production Ready --- ## 🎉 Major Features ### Git-Commit Correlation for TDG PMAT can now track Technical Debt Grading (TDG) scores at specific git commits, enabling powerful "quality archaeology" workflows. Answer questions like: - "Which commit broke quality?" - "What's the quality delta	Low	10/28/2025
v2.178.0	# v2.178.0 - Pre-commit Hooks: Missing Commands Implementation Resolves documentation-reality gap from [pmat-book Chapter 9](https://paiml.github.io/pmat-book/ch09-00-precommit-hooks.html). ## 🎯 What's New ### New Commands - `pmat hooks init` - Initialize hooks (alias for `install`) - Documented in book Chapter 9, line 40 - Matches expected user workflow from documentation - `pmat hooks run` - Execute hooks for CI/CD - Documented in book Chapter 9, line 421 - Supports `--a	Low	10/28/2025

Dependencies & License Audit

Loading dependencies...

Similar Packages

wanakuWanaku MCP Routerv0.2.0

turbovaultMarkdown and OFM SDK w/ MCP server that transforms your Obsidian vault into an intelligent knowledge systemv1.6.0

biomcpBioMCP: Biomedical Model Context Protocolv0.8.25

vestigeCognitive memory for AI agents — FSRS-6 spaced repetition, 29 brain modules, 3D dashboard, single 22MB Rust binary. MCP server for Claude, Cursor, VS Code, Xcode, JetBrains.v2.3.0

meerkatMeerkat - A modular, high-performance agent harness built in Rust.v0.8.7

More in MCP Servers

supersetCode Editor for the AI Agents Era - Run an army of Claude Code, Codex, etc. on your machine

kreuzbergA polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python

ai-engineering-from-scratchLearn it. Build it. Ship it for others.

CodeGraphContextAn MCP server plus a CLI tool that indexes local code into a graph database to provide context to AI assistants.