vllm-mlx
🧠A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.
MultimodalLarge Language ModelsPython
Pipecat
✨An open-source Python framework for real-time voice and multimodal conversational AI agents, enabling end-to-end streaming voice interaction via composable Pipeline architecture.
MultimodalMulti-Agent SystemAI Agents
Rapid-MLX
✨A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.
AI AgentsLarge Language ModelsModel Context Protocol
vLLM-Omni
🧠A fully disaggregated multimodal model inference and serving framework that extends vLLM to support any-to-any modality unified inference and high-performance deployment.
Deep LearningMultimodalFastAPI
mlx-openai-server
✨A high-performance OpenAI-compatible API server for MLX models on Apple Silicon, supporting text, vision, audio transcription, and image generation/editing.
Deep LearningLarge Language ModelsMultimodal
Ghost OS
✨Full computer-use system for AI agents on macOS, exposing 29 MCP tools for structured perception, visual grounding, synthetic input, and self-learning Recipe workflows.
Docs, Tutorials & ResourcesMultimodalModel Context Protocol
Cherry Studio
✨A cross-platform desktop AI productivity client that unifies access to multiple LLM providers, featuring side-by-side model comparison, knowledge base building, AI image generation, and MCP extension support.
AlphaAvatar
✨A learnable, configurable, and pluggable Omni-Avatar Assistant framework built on LiveKit, featuring real-time interaction, multimodal memory, user persona, and external tool integration.
Docs, Tutorials & ResourcesRAGMultimodal
Rodel Agent
🧠A Windows desktop application integrating chat, text-to-speech, image generation from text, and machine translation. It supports mainstream AI services and MCP server plugins, with full AOT compilation to deliver an excellent desktop AI experience.
Agent & ToolingC#LangChain
Agent Zero
✨An AI agent framework for building and managing multi-modal agent systems with augmented reality capabilities and remote operation functionality.
Agent & ToolingPythonAgent Framework