verl
🧠A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs
A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs
A benchmark measuring whether AI models challenge nonsensical prompts rather than confidently answering them, featuring 100 questions across 5 domains with a 3-tier judgment system and multi-judge panel.
A local-first personal AI agent framework from Stanford that enables offline agent orchestration, skill import, and trace-driven continuous learning through five composable primitives, supporting 10+ inference backends and four interaction modes.
Flexible and scalable reinforcement learning training infrastructure for embodied and agentic AI post-training, decoupling logical workflow composition from efficient physical execution via the M2Flow paradigm.
An all-in-one data preparation system for LLMs, supporting reproducible operator pipelines for data generation, cleaning, evaluation, and filtering.
An RL training environment building library for LLMs, providing complete infrastructure from development and testing to scaled rollout collection, with built-in RLVR scenarios and tool-calling support.
The Open Source AI Engineering Platform for Agents, LLMs & Models, providing experiment tracking, model registry, LLM observability, evaluation, prompt optimization, and a unified AI gateway.
A self-learning vector database integrating GNN-driven search optimization, local LLM inference, Cypher graph queries, and a PostgreSQL vector extension, deployable from WASM embeddings to Raft-distributed clusters.
An open-source framework for building, evaluating, and training general multi-agent systems. Features natural language agent creation, distributed reinforcement learning training pipeline, and complex environment interactions. Ranks top on authoritative benchmarks including GAIA, OSWorld, and VisualWebArena.
An interactive open-access textbook on Machine Learning Systems engineering from Harvard University, integrating the TinyTorch framework with hands-on edge deployment labs, covering the full spectrum from ML fundamentals to system optimization.
Page 1 / 2 · 16 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.