DISCOVER THE FUTURE OF AI AGENTS

All Projects

74 projects

Skyvern

An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.

Model & Inference FrameworkMultimodalAI Agents

Second Brain

A local-first agentic framework acting as a personal operating system, leveraging file intelligence, event-driven workflow automation, and LLMs for cross-modal task execution and multi-platform interaction.

OtherMultimodalRAG

Peekaboo

macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows

Model & Inference FrameworkMultimodalModel Context Protocol

vllm-mlx

🧠

A vLLM-style inference server for Apple Silicon with a native MLX backend, exposing both OpenAI and Anthropic compatible APIs in a single process, featuring multimodal unified serving, continuous batching, paged KV cache, and SSD-tiered caching.

MultimodalLarge Language ModelsPython

Pipecat

An open-source Python framework for real-time voice and multimodal conversational AI agents, enabling end-to-end streaming voice interaction via composable Pipeline architecture.

MultimodalMulti-Agent SystemAI Agents

OpenMontage

The first open-source, agentic video production system with 12 structured pipelines and 52 production tools, enabling end-to-end video creation via natural language inside AI coding assistants.

Natural Language ProcessingMultimodalAI Agents

screenpipe

AI memory for your screen. Turns your computer into a personal AI by continuously recording screen and audio to build a searchable, local AI memory system.

Docs, Tutorials & ResourcesRAGMultimodal

Rapid-MLX

A local AI inference engine for Apple Silicon with OpenAI-compatible API, supporting multi-modal, tool calling, and smart cloud routing.

AI AgentsLarge Language ModelsModel Context Protocol

CyberVerse

Open-source digital human agent platform that creates real-time video-callable AI agents from a single photo, with RAG knowledge import, voice cloning, and modular plugin architecture.

Docs, Tutorials & ResourcesMultimodalRAG

Sparrow

Production-ready structured data extraction system supporting Vision LLMs and pluggable workflow orchestration for invoices, bank statements, financial tables, and more.

Model & Inference FrameworkLarge Language ModelsMultimodal
Per page
...

Page 1 / 8 · 74 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.