WebsiteHunt

First interactive reasoning benchmark for AI agents.