freshcrate

Search results for "dpo"

5 results found
npcpy📁v1.4.21🌳 Mature1,287

The python library for research and development in NLP, multimodal LLMs, Agents, ML, Knowledge Graphs, and more.

unsloth-buddy📁main@2026-04-15🌿 Growing212

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc

awesome-prompts📁main@2026-04-21🌿 Growing7,572

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

memory_agent_hub📁main@2026-04-20🌱 Seedling38

2026 swarm Agent 年,swarm Agent 、Agent team、 ai coding、skill、memory、evolve、agentic RL 等 AI Agent集合

rag-chatbot📁main@2026-04-14🌿 Growing402

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.