DISCOVER THE FUTURE OF AI AGENTS

All Projects

5 projects

Peekaboo

macOS GUI automation tool powered by AI vision — captures screenshots, detects UI elements via multi-provider LLMs, and executes clicks/types/scrolls through natural language or scripted workflows

Model & Inference FrameworkMultimodalModel Context Protocol

OpenMontage

The first open-source, agentic video production system with 12 structured pipelines and 52 production tools, enabling end-to-end video creation via natural language inside AI coding assistants.

Natural Language ProcessingMultimodalAI Agents

NodeTool

A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.

Model & Inference FrameworkLarge Language ModelsMultimodal

RCLI

A fully on-device voice AI assistant for macOS Apple Silicon, integrating STT, LLM, TTS, VLM, RAG, and system control with zero cloud dependency.

Model & Inference FrameworkLarge Language ModelsMultimodal

UI-TARS-desktop

An open-source multimodal AI Agent stack developed by ByteDance, comprising the general Agent TARS framework and the UI-TARS Desktop client. It enables natural language control of computers, browsers, and terminals via Vision-Language Models.

Agent & ToolingTypeScriptNode.js
Per page

Page 1 / 1 · 5 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.