Staff AI Research Scientist - Evaluation - Remote

Posted 39 weeks ago

Software Development

Full Time

USA

AI Research

Large Language Models

Evaluation Methodologies

Interpretability

Python

PyTorch

Benchmark Development

Overview

As a Staff Research Scientist at Handshake AI, you will drive frontier research on AI evaluation methodologies, focusing on large language models and their interactions with human knowledge.

In Short

Lead teams in original research on LLM evaluation and interpretability.
Develop frameworks to assess model capabilities and limitations.
Collaborate with engineers to create scalable evaluation systems.
Pioneer methods for measuring reasoning and trustworthiness in AI.
Author code for large-scale experimentation and evaluation workflows.
Publish research in top-tier venues.
Work cross-functionally to set industry standards for AI evaluation.

Requirements

PhD or equivalent in machine learning, computer science, or related fields.
6+ years of experience in a research-first environment.
Strong background in LLM research and evaluation methodologies.
Proven ability to design and execute evaluation research.
Proficiency in Python and PyTorch for model analysis.
Experience in benchmark development and systematic assessment.
Strong publication record in evaluation or interpretability.
Ability to communicate complex insights clearly.

Benefits

Equity in a fast-growing company.
401(k) match and competitive compensation.
Paid parental leave and fertility benefits.
Medical, dental, vision, and mental health support.
$2,000 learning stipend for ongoing development.
Flexible PTO and holidays.
Stipends for home office setup and commuting.

Handshake

Handshake is a leading career platform designed specifically for Gen Z, connecting over 17 million students, alumni, employers, and career educators. With a diverse network that includes nearly 1 million companies ranging from Fortune 500 firms to startups, Handshake facilitates the recruitment of emerging talent. The company is committed to fostering an inclusive culture and values diverse teams, believing they create better products. Handshake operates with a hybrid work model, allowing employees to collaborate in vibrant offices while enjoying the flexibility of remote work. Headquartered in San Francisco, with additional offices in New York, London, and Berlin, Handshake prioritizes employee well-being through comprehensive benefits and a supportive work environment.

Staff AI Research Scientist - Evaluation - Remote

Overview

In Short

Requirements

Benefits

Handshake

Handshake

Similar Jobs:

AI Research Scientist - Remote

Research/AI Scientist - Remote

AI Research Scientist - Remote

AI Research Scientist - Remote

AI Research Scientist - Remote