Skyvern
✨An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.
An AI Agent platform that automates browser workflows using Vision LLMs, extending Playwright with natural language commands for web automation, workflow orchestration, and structured data extraction.
SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, focusing on large multimodal models (LMMs) like GPT-4V. It consists of a robust codebase for running web agents on live websites and an innovative framework that utilizes LMMs as generalist web agents.
An open-source multimodal AI Agent stack developed by ByteDance, comprising the general Agent TARS framework and the UI-TARS Desktop client. It enables natural language control of computers, browsers, and terminals via Vision-Language Models.
Page 1 / 1 · 3 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.