Skyvern
✨An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.
An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.
A local-first agentic framework acting as a personal operating system, leveraging file intelligence, event-driven workflow automation, and LLMs for cross-modal task execution and multi-platform interaction.
macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows
An open-source Python framework for real-time voice and multimodal conversational AI agents, enabling end-to-end streaming voice interaction via composable Pipeline architecture.
The first open-source, agentic video production system with 12 structured pipelines and 52 production tools, enabling end-to-end video creation via natural language inside AI coding assistants.
AI memory for your screen. Turns your computer into a personal AI by continuously recording screen and audio to build a searchable, local AI memory system.
Open-source digital human agent platform that creates real-time video-callable AI agents from a single photo, with RAG knowledge import, voice cloning, and modular plugin architecture.
Production-ready structured data extraction system supporting Vision LLMs and pluggable workflow orchestration for invoices, bank statements, financial tables, and more.
A Python library providing key functional primitives for research in multimodal language models, agentic AI, and knowledge graphs, featuring unified model invocation, multi-agent collaboration and debate, knowledge graph lifecycle management, and multimodal generation.
A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.
Page 1 / 6 · 57 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.