DISCOVER THE FUTURE OF AI AGENTS

Model & Inference Framework

21 projects

Skyvern

An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.

Model & Inference FrameworkMultimodalAI Agents

Peekaboo

macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows

Model & Inference FrameworkMultimodalModel Context Protocol

vllm-mlx

🧠

A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.

MultimodalLarge Language ModelsPython

Rapid-MLX

A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.

AI AgentsLarge Language ModelsModel Context Protocol

Sparrow

Production-ready structured data extraction system supporting Vision LLMs and pluggable workflow orchestration for invoices, bank statements, financial tables, and more.

Model & Inference FrameworkLarge Language ModelsMultimodal

npcpy

A Python library providing key functional primitives for research in multimodal language models, agentic AI, and knowledge graphs, featuring unified model invocation, multi-agent collaboration and debate, knowledge graph lifecycle management, and multimodal generation.

Model & Inference FrameworkLarge Language ModelsMultimodal

NodeTool

A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.

Model & Inference FrameworkLarge Language ModelsMultimodal

OpenOmniBot

An on-device Android AI assistant powered by VLM, supporting local model inference and screen-level automated interaction.

Model & Inference FrameworkMultimodalModel Context Protocol

AutoRound

An advanced post-training quantization toolkit for LLMs and VLMs by Intel, leveraging SignRound optimization to support 2–4 bit weight quantization and automatic mixed-precision scheme generation across Intel CPU/GPU, NVIDIA GPU, and Habana Gaudi.

MultimodalLarge Language ModelsTransformers

CookHero

An LLM-powered personalized diet management platform featuring RAG hybrid retrieval, multi-modal understanding, and nutrition analytics

Model & Inference FrameworkMultimodalRAG
Per page

Page 1 / 3 · 21 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.