Search results for "acceleration"
Fast inference engine for Transformer models
A Python framework for building reactive web-apps. Developed by Plotly.
A hyperparameter optimization framework
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Nornicdb is a low-latency, Graph + Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Uses Neo4j Bolt/Cypher and qdrant's gRPC drivers so you can switch with no changes. T
No description
Every meeting, every idea, every voice note โ searchable by your AI. Open-source, privacy-first conversation memory layer.
A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Your AI assistant that never forgets and runs 100% privately on your computer. Leave it on 24/7 - it learns your preferences, helps with code, manages your health goals, searches the web, and connects
SeekStorm: vector & lexical search - in-process library & multi-tenancy server, in Rust.
A modular MCP server that provides commonly used developer tools for AI coding agents
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Enterprise-grade (40m+ lines) codebase intelligence in a zero-setup, private and local Claude Plugin or MCP: managed indexing, hybrid semantic search, polyglot code dependency graphs, and DB/API/infra
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Nuwax Agent OS - The world's first universal agent operating system, building your private vertical general-purpose agent. ้็จๆบ่ฝไฝๆไฝ็ณป็ป๏ผๆ้ ไฝ ็งๆ็ๅ็ฑป้็จๆบ่ฝไฝใๆฐไธไปฃAIๅบ็จ่ฎพ่ฎกใๅผๅใๅฎ่ทตๅนณๅฐ๏ผๆ ้ไปฃ็ ๏ผ่ฝปๆพๅๅปบ๏ผ้ๅๅ็ฑปไบบ็พค๏ผๆฏๆๅค็ง็ซฏๅๅธๅAPI๏ผๆไพๅฎๅ็
A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX bac
SQLite-Vector is a cross-platform, ultra-efficient SQLite extension that brings vector search capabilities to your embedded database.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
Swift-based vector database for on-device RAG using MLTensor and MLX Embedders
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
"RAG-Anything: All-in-One RAG Framework"
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent โ watches, understand
๐ฅ Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
High-Performance Encrypted Database for .NET 10 | Embedded + gRPC Server | Vector Search โข GraphRAG โข Analytics
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Generic rag framework to apply the power of LLMs on any given dataset
One API for 20+ LLM providers, your databases, and your files โ self-hosted, open-source AI gateway with RAG, voice, and guardrails.
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, spe
A high-performance, in-memory vector database written in Rust, designed for semantic search and top-k nearest neighbor queries in AI-driven applications, with binary file persistence for durability.
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.
Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
Dragon Brain โ persistent long-term memory for AI agents via MCP (Model Context Protocol). Knowledge graph (FalkorDB) + vector search (Qdrant) + CUDA GPU embeddings. Works with Claude, Gemini CLI, Cur
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
A curated list of awesome works related to high dimensional structure/vector search & database
Lightweight semantic code search engine โ 2-stage vector + FTS + RRF fusion + MCP server for Claude Code
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
Awesome list of AI-Driven Development.
Local AI server with persistent memory, RAG, and multi-backend inference (MLX / llama.cpp / Ollama). Runs entirely on your machine โ zero data sent to external services.
Local AI anywhere, for everyone โ LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.
Control robots and physical hardware with natural language through Strands Agents.
๐ฌ 500+ curated Seedance 2.0 video generation prompts โ cinematic, anime, UGC, ads, meme styles. Includes Seedance API guides, character consistency tips, and advanced video workflows.
โก๐พ Vectro โ Compress LLM embeddings ๐ง ๐ Save memory, speed up retrieval, and keep semantic accuracy ๐ฏโจ Lightning-fast quantization for Python + Mojo, vector DB friendly ๐๏ธ, and perfect for RAG pip
A minimal, lightweight structured data store designed for small applications, scripts and automation workflows. Built for simplicity, portability and low overhead.
Nikola โ autonomous AI system based on ATPM consciousness architecture. Aria is its primary language substrate.
Riverbed Community Toolkit is a public toolkit for Riverbed Solutions engineering and integration
Wrap concurrent code in Pony reference capabilities for data-race freedom
Generate OTP supervision trees and fault-tolerance scaffolding
Extract state machines from code and model-check with TLA+/PlusCal
