freshcrate

Search results for "dpo"

Clear filters
3 results found (Python)
npcpy๐Ÿ“v1.4.21๐ŸŒณ Matureโญ1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

unsloth-buddy๐Ÿ“main@2026-04-15๐ŸŒฟ Growingโญ212

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA ยท TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

rag-chatbot๐Ÿ“main@2026-04-14๐ŸŒฟ Growingโญ402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.