DISCOVER THE FUTURE OF AI AGENTS

Agent & Tooling

57 projects

Skyvern

An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.

Model & Inference FrameworkMultimodalAI Agents

Second Brain

A local-first agentic framework acting as a personal operating system, leveraging file intelligence, event-driven workflow automation, and LLMs for cross-modal task execution and multi-platform interaction.

OtherMultimodalRAG

Peekaboo

macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows

Model & Inference FrameworkMultimodalModel Context Protocol

Pipecat

An open-source Python framework for real-time voice and multimodal conversational AI agents, enabling end-to-end streaming voice interaction via composable Pipeline architecture.

MultimodalMulti-Agent SystemAI Agents

OpenMontage

The first open-source, agentic video production system with 12 structured pipelines and 52 production tools, enabling end-to-end video creation via natural language inside AI coding assistants.

Natural Language ProcessingMultimodalAI Agents

screenpipe

AI memory for your screen. Turns your computer into a personal AI by continuously recording screen and audio to build a searchable, local AI memory system.

Docs, Tutorials & ResourcesRAGMultimodal

CyberVerse

Open-source digital human agent platform that creates real-time video-callable AI agents from a single photo, with RAG knowledge import, voice cloning, and modular plugin architecture.

Docs, Tutorials & ResourcesMultimodalRAG

Sparrow

Production-ready structured data extraction system supporting Vision LLMs and pluggable workflow orchestration for invoices, bank statements, financial tables, and more.

Model & Inference FrameworkLarge Language ModelsMultimodal

npcpy

A Python library providing key functional primitives for research in multimodal language models, agentic AI, and knowledge graphs, featuring unified model invocation, multi-agent collaboration and debate, knowledge graph lifecycle management, and multimodal generation.

Model & Inference FrameworkLarge Language ModelsMultimodal

NodeTool

A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.

Model & Inference FrameworkLarge Language ModelsMultimodal
Per page

Page 1 / 6 · 57 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.