verl
🧠A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs
A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs
A benchmark measuring whether AI models challenge nonsensical prompts rather than confidently answering them, featuring 100 questions across 5 domains with a 3-tier judgment system and multi-judge panel.
An interactive open-access textbook on Machine Learning Systems engineering from Harvard University, integrating the TinyTorch framework with hands-on edge deployment labs, covering the full spectrum from ML fundamentals to system optimization.
Official code repository for the O'Reilly book "Hands-On Large Language Models". Features 12 core chapters and bonus content covering Tokens, Transformers, RAG, and Fine-tuning. Includes 300+ illustrations and runnable Jupyter Notebooks optimized for Colab and local environments.
A comprehensive AI engineering hub featuring 93+ production-ready projects with in-depth tutorials and implementations for LLMs, RAGs, AI Agents, and MCP, covering beginner to advanced skill levels.
A generative agent framework inspired by human dual-process theory, combining fast and slow thinking mechanisms with in-context reinforcement learning to efficiently solve complex interactive reasoning tasks.
A curated collection of resources for Long Chain-of-Thought (Long-CoT) reasoning in LLMs, featuring papers, implementations, and datasets to track the latest advancements in the field.
A curated collection of recent research papers on autonomous agents, focusing on both reinforcement learning-based and large language model-based approaches, helping researchers quickly understand the cutting edge of the field。
GPTSwarm is a graph-based framework for LLM-based agents that allows building LLM agents from graphs and enables customized, automatic self-organization of agent swarms with self-improvement capabilities.
A curated list of research on Embodied AI or robots with Large Language Models that tracks the latest developments in this field.
Page 1 / 4 · 32 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.