Agent Park - Agent Project Navigator

All Projects

2 projects

BullshitBench

✨

A benchmark measuring whether AI models challenge nonsensical prompts rather than confidently answering them, featuring 100 questions across 5 domains with a 3-tier judgment system and multi-judge panel.

PythonLarge Language ModelsCLI

VIEW DETAILS →

Local Deep Research

🧠

A local-first AI research assistant featuring multi-LLM support, 20+ research strategies, multi-search-engine integration, and automated quality scoring for 212K+ academic sources, producing citation-backed PDF/Markdown reports via CLI, Web UI, REST API, or MCP Server.

PythonKnowledge BaseFastAPI

VIEW DETAILS →

Per page

Page 1 / 1 · 2 total

Browse by Filters

Project Type

Filter by Domain

Filter by Product Form

All Projects

BullshitBench

Local Deep Research

STAY UPDATED