PaperFarm
✨An AI Agent-driven automated experiment framework that points at any code repo, autonomously analyzes, designs, runs experiments, and keeps improvements that work
An AI Agent-driven automated experiment framework that points at any code repo, autonomously analyzes, designs, runs experiments, and keeps improvements that work
An all-in-one AI ecosystem browser in the terminal — explore models, benchmarks, coding agents, and provider status via TUI/CLI
A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.
A step-by-step workshop that teaches you how to build your own AI-powered coding assistant, starting from a basic chatbot and progressively adding powerful tools like file reading, shell command execution, and code search.
An open-source deep research agent optimized for research and prediction tasks, achieving 80.8% Avg@8 score on the challenging GAIA benchmark, featuring 256K context window support and up to 600 tool calls per task.
An educational project that teaches you how to build modern AI coding agents from scratch through progressive tutorials, featuring 5 versions from simple bash tools to a complete skills system.
Page 1 / 1 · 6 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.