Remote Otter LogoRemoteOtter

Staff AI Research Scientist - Evaluation - Remote

Posted 2 days ago
Software Development
Full Time
USA

Overview

As a Staff Research Scientist at Handshake AI, you will drive frontier research on AI evaluation methodologies, focusing on large language models and their interactions with human knowledge.

In Short

  • Lead teams in original research on LLM evaluation and interpretability.
  • Develop frameworks to assess model capabilities and limitations.
  • Collaborate with engineers to create scalable evaluation systems.
  • Pioneer methods for measuring reasoning and trustworthiness in AI.
  • Author code for large-scale experimentation and evaluation workflows.
  • Publish research in top-tier venues.
  • Work cross-functionally to set industry standards for AI evaluation.

Requirements

  • PhD or equivalent in machine learning, computer science, or related fields.
  • 6+ years of experience in a research-first environment.
  • Strong background in LLM research and evaluation methodologies.
  • Proven ability to design and execute evaluation research.
  • Proficiency in Python and PyTorch for model analysis.
  • Experience in benchmark development and systematic assessment.
  • Strong publication record in evaluation or interpretability.
  • Ability to communicate complex insights clearly.

Benefits

  • Equity in a fast-growing company.
  • 401(k) match and competitive compensation.
  • Paid parental leave and fertility benefits.
  • Medical, dental, vision, and mental health support.
  • $2,000 learning stipend for ongoing development.
  • Flexible PTO and holidays.
  • Stipends for home office setup and commuting.
Handshake logo

Handshake

Handshake is a leading career platform designed specifically for Gen Z, connecting over 17 million students, alumni, employers, and career educators. With a diverse network that includes nearly 1 million companies ranging from Fortune 500 firms to startups, Handshake facilitates the recruitment of emerging talent. The company is committed to fostering an inclusive culture and values diverse teams, believing they create better products. Handshake operates with a hybrid work model, allowing employees to collaborate in vibrant offices while enjoying the flexibility of remote work. Headquartered in San Francisco, with additional offices in New York, London, and Berlin, Handshake prioritizes employee well-being through comprehensive benefits and a supportive work environment.

Share This Job!

Save This Job!

Similar Jobs:

MindBridge Analytics logo

AI Research Scientist - Remote

MindBridge Analytics

5 weeks ago

Join MindBridge as an AI Research Scientist to design and validate advanced machine learning solutions in a collaborative environment.

Ontario, Canada
Full-time
Software Development
Weave logo

Research/AI Scientist - Remote

Weave

8 weeks ago

Join Weave as a Research/AI Scientist to advance therapeutic knowledge through innovative AI solutions.

Worldwide
Full-time
Software Development
SwordHealth logo

AI Research Scientist - Remote

SwordHealth

10 weeks ago

Sword Health is seeking an AI Research Scientist to contribute to the development of advanced AI models for healthcare.

Portugal
Full-time
Software Development
Phare Health logo

AI Research Scientist - Remote

Phare Health

13 weeks ago

Join our mission-driven team as an AI Research Scientist to innovate healthcare reimbursement through advanced AI technologies.

USA
Full-time
Software Development

Autodesk

AI Research Scientist - Remote

Autodesk

14 weeks ago

Join Autodesk as an AI Research Scientist to lead innovative research projects in applied AI and machine learning.

Worldwide
Full-time
Software Development