HUD (YC W25) is a pioneering company focused on developing agentic evaluations for Computer Use Agents (CUAs) that browse the web. Their innovative CUA Evals framework is the first comprehensive evaluation tool designed specifically for CUAs, addressing the critical need for detailed evaluations to ensure AI agents function effectively in real-world scenarios. Backed by Y Combinator, HUD collaborates closely with leading AI labs to provide scalable agent evaluation infrastructure. The team comprises highly skilled individuals, including international Olympiad medallists and experienced AI startup founders, dedicated to advancing the field of AI evaluation.
HUD is seeking candidates for various roles in AI evaluation, offering remote work opportunities.
No More Jobs Found