DISCOVER THE FUTURE OF AI AGENTS

Other

16 projects

verl

🧠

A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs

OtherDeep LearningMultimodal

BullshitBench

A benchmark measuring whether AI models challenge nonsensical prompts rather than confidently answering them, featuring 100 questions across 5 domains with a 3-tier judgment system and multi-judge panel.

Model & Inference FrameworkNatural Language ProcessingLarge Language Models

OpenJarvis

A local-first personal AI agent framework from Stanford that enables offline agent orchestration, skill import, and trace-driven continuous learning through five composable primitives, supporting 10+ inference backends and four interaction modes.

OtherLarge Language ModelsModel Context Protocol

RLinf

Flexible and scalable reinforcement learning training infrastructure for embodied and agentic AI post-training, decoupling logical workflow composition from efficient physical execution via the M2Flow paradigm.

MultimodalAI AgentsReinforcement Learning

DataFlow

An all-in-one data preparation system for LLMs, supporting reproducible operator pipelines for data generation, cleaning, evaluation, and filtering.

Model & Inference FrameworkNatural Language ProcessingRAG

NeMo Gym

🧠

An RL training environment building library for LLMs, providing complete infrastructure from development and testing to scaled rollout collection, with built-in RLVR scenarios and tool-calling support.

OtherDeep LearningAI Agents

MLflow

The Open Source AI Engineering Platform for Agents, LLMs & Models, providing experiment tracking, model registry, LLM observability, evaluation, prompt optimization, and a unified AI gateway.

Model & Inference FrameworkSDKAI Agents

RuVector

A self-learning vector database integrating GNN-driven search optimization, local LLM inference, Cypher graph queries, and a PostgreSQL vector extension, deployable from WASM embeddings to Raft-distributed clusters.

Docs, Tutorials & ResourcesMachine LearningRAG

AWorld

An open-source framework for building, evaluating, and training general multi-agent systems. Features natural language agent creation, distributed reinforcement learning training pipeline, and complex environment interactions. Ranks top on authoritative benchmarks including GAIA, OSWorld, and VisualWebArena.

OtherMultimodalLarge Language Models

Machine Learning Systems (CS249r Book)

An interactive open-access textbook on Machine Learning Systems engineering from Harvard University, integrating the TinyTorch framework with hands-on edge deployment labs, covering the full spectrum from ML fundamentals to system optimization.

OtherDeep LearningMachine Learning
Per page

Page 1 / 2 · 16 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.