Peekaboo
✨macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows
macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows
A proactive AI desktop companion platform featuring multimodal dialogue, a three-layer memory system, Agent-driven automation, Live2D/VRM/MMD multi-form avatars, and a Steam Workshop UGC ecosystem.
Full computer-use system for AI agents on macOS, exposing 29 MCP tools for structured perception, visual grounding, synthetic input, and self-learning Recipe workflows.
A fully on-device voice AI assistant for macOS Apple Silicon, integrating STT, LLM, TTS, VLM, RAG, and system control with zero cloud dependency.
A computer control agent driven by visual language large models that enables AI to interact with GUIs by observing screenshots and outputting mouse and keyboard operations, completing multi-step tasks.
An AI-powered local automation tool that uses natural language to make computers work autonomously, understanding screen content and performing operations like humans, without requiring programming knowledge for complex automation workflows.
An open-source multimodal AI Agent stack developed by ByteDance, comprising the general Agent TARS framework and the UI-TARS Desktop client. It enables natural language control of computers, browsers, and terminals via Vision-Language Models.
Page 1 / 1 · 7 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.