freshcrate

Search results for "inference"

Clear filters
9 results found (JavaScript)

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama

ThumbGate📁v1.14.1🌱 Seedling16

Self-improving agent governance: 👍/👎 → Pre-Action Gates that block repeat AI mistakes. Stop paying for the same mistake twice.

houtini-lm📁v2.8.0🌿 Growing71

MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek, Groq, Cerebras.

DeepCamera📁v2026.3🌳 Mature2,689

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understand

RAGMeUp📁scala-ui🌳 Mature676

Generic rag framework to apply the power of LLMs on any given dataset

aigne-doc-smith📁v0.9.11-beta🌱 Seedling159

AIGNE DocSmith is a powerful, AI-driven documentation generation tool built on the AIGNE Framework. It automates the creation of detailed, structured, and multi-language documentation directly from yo

models📁main@2026-04-21🌿 Growing79

This repository contains comprehensive pricing and configuration data for LLMs. It powers cost attribution for 200+ enterprises running 400B+ tokens through Portkey AI Gateway every day.

KREASYS📁main@2026-04-21🌱 Seedling2

Build and manage projects with an autonomous browser-based IDE featuring integrated multi-modal AI tools for efficient development workflows.

foundation-ai-agent📁1.0.1🌱 Seedling

12 native protocol layers for AI-agent systems. No wrappers. No SDK dependencies.