DISCOVER THE FUTURE OF AI AGENTS

All Projects

59 projects

vllm-mlx

🧠

A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.

MultimodalLarge Language ModelsPython

UncommonRoute

A local proxy that automatically routes each LLM request to the cheapest still-capable model

Model & Inference FrameworkAI AgentsLarge Language Models

Hyperspace AGI

The first experimental fully peer-to-peer distributed AGI system where intelligence compounds continuously through autonomous agent networks, supporting decentralized training across heterogeneous devices, P2P inference routing, and a built-in blockchain micropayment economy.

Model & Inference FrameworkMulti-Agent SystemAI Agents

Rapid-MLX

A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.

AI AgentsLarge Language ModelsModel Context Protocol

verl

🧠

A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs

OtherDeep LearningMultimodal

BullshitBench

A benchmark measuring whether AI models challenge nonsensical prompts rather than confidently answering them, featuring 100 questions across 5 domains with a 3-tier judgment system and multi-judge panel.

Model & Inference FrameworkNatural Language ProcessingLarge Language Models

AutoRound

An advanced post-training quantization toolkit for LLMs and VLMs by Intel, leveraging SignRound optimization to support 2–4 bit weight quantization and automatic mixed-precision scheme generation across Intel CPU/GPU, NVIDIA GPU, and Habana Gaudi.

MultimodalLarge Language ModelsTransformers

OpenJarvis

A local-first personal AI agent framework from Stanford that enables offline agent orchestration, skill import, and trace-driven continuous learning through five composable primitives, supporting 10+ inference backends and four interaction modes.

OtherLarge Language ModelsModel Context Protocol

Harbor

🧠

A Docker Compose-based CLI orchestrator for local LLM stacks — spin up pre-wired inference backends, frontend UIs, RAG, voice, image generation, and more with a single command

Model & Inference FrameworkMultimodalLarge Language Models

Mooncake

A KVCache-centric disaggregated architecture platform for LLM serving, providing distributed KVCache pooling, topology-aware high-speed transfer engine, and centralized scheduler, supporting Prefill-Decode separation and MoE elastic inference.

Large Language ModelsRustPyTorch
Per page

Page 1 / 6 · 59 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.