HUD is developing agentic evals for Computer Use Agents (CUAs) that browse the web, aiming to provide detailed evaluations for AI agents to function effectively in real-world scenarios.
HUD (YC W25) is a pioneering company focused on developing agentic evaluations for Computer Use Agents (CUAs) that browse the web. Their innovative CUA Evals framework is the first comprehensive evaluation tool designed specifically for CUAs, addressing the critical need for detailed evaluations to ensure AI agents function effectively in real-world scenarios. Backed by Y Combinator, HUD collaborates closely with leading AI labs to provide scalable agent evaluation infrastructure. The team comprises highly skilled individuals, including international Olympiad medallists and experienced AI startup founders, dedicated to advancing the field of AI evaluation.
Share This Job!
Save This Job!
HUD (YC W25) is a pioneering company focused on developing agentic evaluations for Computer Use Agents (CUAs) that browse the web. Their innovative CUA Evals framework is the first comprehensive evaluation tool designed specifically for CUAs, addressing the critical need for detailed evaluations to ensure AI agents function effectively in real-world scenarios. Backed by Y Combinator, HUD collaborates closely with leading AI labs to provide scalable agent evaluation infrastructure. The team comprises highly skilled individuals, including international Olympiad medallists and experienced AI startup founders, dedicated to advancing the field of AI evaluation.
Share This Job!
Save This Job!
S.V.C.F
Stealth Venture Capital Firm
Join a leading AI research lab as an AI Research Engineer, collaborating with top talent in a dynamic environment.
A.A
Axelera AI
Join Axelera as an AI Research Engineer to advance data generation and optimization for cutting-edge AI models.
Join our AI Research Team as an AI Research Engineer to bridge cutting-edge AI research with large-scale production.
Join Tiger Analytics as an Agentic AI Engineer to leverage your expertise in Gen AI and Machine Learning for Fortune 500 clients.
Join Pallon as a Research Engineer to develop AI solutions for sewer inspection, contributing to urban sustainability.