DISCOVER THE FUTURE OF AI AGENTS

All Projects

15 projects

CyberVerse

Open-source digital human agent platform that creates real-time video-callable AI agents from a single photo, with RAG knowledge import, voice cloning, and modular plugin architecture.

Docs, Tutorials & ResourcesMultimodalRAG

NodeTool

A node-based visual AI workflow and LLM Agent builder with local model support and multimodal orchestration across desktop, web, CLI, and mobile.

Model & Inference FrameworkLarge Language ModelsMultimodal

CookHero

An LLM-powered personalized diet management platform featuring RAG hybrid retrieval, multi-modal understanding, and nutrition analytics

Model & Inference FrameworkMultimodalRAG

AlphaAvatar

A learnable, configurable, and pluggable Omni-Avatar Assistant framework built on LiveKit, featuring real-time interaction, multimodal memory, user persona, and external tool integration.

Docs, Tutorials & ResourcesRAGMultimodal

WiFi DensePose

A production-ready implementation of InvisPose that enables real-time, camera-free full-body tracking through walls using commodity WiFi mesh routers and CSI signals, with advanced analytics like fall detection and multi-person tracking.

MultimodalDeep LearningDocker

StreamRAG

A GPT-powered video retrieval and streaming agent that enables developers to upload multiple videos, search across content in real-time, generate summarized text answers through RAG, and publish searchable collections on the ChatGPT store.

Agent & ToolingPythonFlask

4KAgent

An intelligent agent system designed to process and display 4K video content, offering high-quality video processing capabilities.

Agent & ToolingPythonAI Agents

DeepVideoDiscovery

A video content discovery tool developed by Microsoft that uses deep learning technology to automatically identify and extract key content from videos, helping users efficiently browse and understand video information。

Agent & ToolingPythonPyTorch

Nekro Agent

Nekro Agent is an extensible multi-person interactive agent framework that combines code execution capabilities with high extensibility, featuring sandbox-driven architecture, visual interface, and multimodal interaction support across multiple platforms.

Agent & ToolingPythonDocker

LLaVA-Plus

LLaVA-Plus is a multimodal assistant system that learns to use tools, combining large language models with visual capabilities to enable AI agents to perform general vision tasks.

Model & Inference FrameworkPythonPyTorch
Per page

Page 1 / 2 · 15 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.