DISCOVER THE FUTURE OF AI AGENTS

All Projects

39 projects

Genkit

An open-source AI application building framework for full-stack apps, providing a unified SDK and developer toolchain.

Model & Inference FrameworkLarge Language ModelsSDK

vllm-mlx

🧠

A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.

MultimodalLarge Language ModelsPython

Rapid-MLX

A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.

AI AgentsLarge Language ModelsModel Context Protocol

Sparrow

Production-ready structured data extraction system supporting Vision LLMs and pluggable workflow orchestration for invoices, bank statements, financial tables, and more.

Model & Inference FrameworkLarge Language ModelsMultimodal

npcpy

A Python library providing key functional primitives for research in multimodal language models, agentic AI, and knowledge graphs, featuring unified model invocation, multi-agent collaboration and debate, knowledge graph lifecycle management, and multimodal generation.

Model & Inference FrameworkLarge Language ModelsMultimodal

NodeTool

A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.

Model & Inference FrameworkLarge Language ModelsMultimodal

verl

🧠

A flexible, efficient, and production-ready post-training reinforcement learning framework for LLMs

OtherDeep LearningMultimodal

OmniRoute

AI Gateway / Universal LLM Proxy providing a single OpenAI-compatible endpoint that intelligently routes to 100+ AI providers, with multimodal API support, MCP/A2A protocols, and enterprise-grade resilience.

Model & Inference FrameworkLarge Language ModelsMultimodal

AutoRound

An advanced post-training quantization toolkit for LLMs and VLMs by Intel, leveraging SignRound optimization to support 2–4 bit weight quantization and automatic mixed-precision scheme generation across Intel CPU/GPU, NVIDIA GPU, and Habana Gaudi.

MultimodalLarge Language ModelsTransformers

Harbor

🧠

A Docker Compose-based CLI orchestrator for local LLM stacks — spin up pre-wired inference backends, frontend UIs, RAG, voice, image generation, and more with a single command

Model & Inference FrameworkMultimodalLarge Language Models
Per page

Page 1 / 4 · 39 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.