Agent Park - Agent Project Navigator

All Projects

27 projects

vllm-mlx

🧠

A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.

MultimodalLarge Language ModelsPython

VIEW DETAILS →

UncommonRoute

✨

A local proxy that automatically routes each LLM request to the cheapest still-capable model

Model & Inference FrameworkAI AgentsLarge Language Models

VIEW DETAILS →

Hyperspace AGI

✨

The first experimental fully peer-to-peer distributed AGI system where intelligence compounds continuously through autonomous agent networks, supporting decentralized training across heterogeneous devices, P2P inference routing, and a built-in blockchain micropayment economy.

Model & Inference FrameworkMulti-Agent SystemAI Agents

VIEW DETAILS →

Rapid-MLX

✨

A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.

AI AgentsLarge Language ModelsModel Context Protocol

VIEW DETAILS →

OpenJarvis

✨

A local-first personal AI agent framework from Stanford that enables offline agent orchestration, skill import, and trace-driven continuous learning through five composable primitives, supporting 10+ inference backends and four interaction modes.

OtherLarge Language ModelsModel Context Protocol

VIEW DETAILS →

vLLM-Omni

🧠

A fully disaggregated multimodal model inference and serving framework that extends vLLM to support any-to-any modality unified inference and high-performance deployment.

Deep LearningMultimodalFastAPI

VIEW DETAILS →

Harbor

🧠

A Docker Compose-based CLI orchestrator for local LLM stacks — spin up pre-wired inference backends, frontend UIs, RAG, voice, image generation, and more with a single command

Model & Inference FrameworkMultimodalLarge Language Models

VIEW DETAILS →

Mooncake

✨

A KVCache-centric disaggregated architecture platform for LLM serving, providing distributed KVCache pooling, topology-aware high-speed transfer engine, and centralized scheduler, supporting Prefill-Decode separation and MoE elastic inference.

Large Language ModelsRustPyTorch

VIEW DETAILS →

llama.cpp

✨

LLM inference in C/C++ achieving state-of-the-art performance on local or cloud with minimal setup via the GGUF format and multi-hardware backend support.

Large Language ModelsPythonCLI

VIEW DETAILS →

mlx-openai-server

✨

A high-performance OpenAI-compatible API server for MLX models on Apple Silicon, supporting text, vision, audio transcription, and image generation/editing.

Deep LearningLarge Language ModelsMultimodal

VIEW DETAILS →

Per page

Page 1 / 3 · 27 total

Browse by Filters

Project Type

Filter by Domain

Filter by Product Form

All Projects

vllm-mlx

UncommonRoute

Hyperspace AGI

Rapid-MLX

OpenJarvis

vLLM-Omni

Harbor

Mooncake

llama.cpp

mlx-openai-server

STAY UPDATED