DISCOVER THE FUTURE OF AI AGENTS

All Projects

7 projects

Peekaboo

macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows

Model & Inference FrameworkMultimodalModel Context Protocol

Project N.E.K.O.

A proactive AI desktop companion platform featuring multimodal dialogue, a three-layer memory system, Agent-driven automation, Live2D/VRM/MMD multi-form avatars, and a Steam Workshop UGC ecosystem.

MultimodalAI AgentsElectron

Ghost OS

Full computer-use system for AI agents on macOS, exposing 29 MCP tools for structured perception, visual grounding, synthetic input, and self-learning Recipe workflows.

Docs, Tutorials & ResourcesMultimodalModel Context Protocol

RCLI

A fully on-device voice AI assistant for macOS Apple Silicon, integrating STT, LLM, TTS, VLM, RAG, and system control with zero cloud dependency.

Model & Inference FrameworkLarge Language ModelsMultimodal

ScreenAgent

A computer control agent driven by visual language large models that enables AI to interact with GUIs by observing screenshots and outputting mouse and keyboard operations, completing multi-step tasks.

Agent & ToolingPythonPyTorch

autoMate

An AI-powered local automation tool that uses natural language to make computers work autonomously, understanding screen content and performing operations like humans, without requiring programming knowledge for complex automation workflows.

Agent & ToolingPythonAI Agents

UI-TARS-desktop

An open-source multimodal AI Agent stack developed by ByteDance, comprising the general Agent TARS framework and the UI-TARS Desktop client. It enables natural language control of computers, browsers, and terminals via Vision-Language Models.

Agent & ToolingTypeScriptNode.js
Per page

Page 1 / 1 · 7 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.