Remote Otter LogoRemoteOtter

Staff Software Engineer, ML Performance & Systems - Remote

Posted 2 weeks ago
Software Development
Full Time
USA

Overview

Help fal maintain its frontier position on model performance for generative media models. Design and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on maximizing throughput while minimizing latency and resource usage. Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Work closely with our Applied ML team and customers (frontier labs on the media space) and make sure their workloads benefit from our accelerator.

In Short

  • Help fal maintain its frontier position on model performance for generative media models.
  • Design and implement novel approaches to model serving architecture.
  • Focus on maximizing throughput while minimizing latency and resource usage.
  • Develop performance monitoring and profiling tools.
  • Identify bottlenecks and optimization opportunities.
  • Work closely with the Applied ML team and customers.

Requirements

  • Strong foundation in systems programming.
  • Deep understanding of cutting edge ML infrastructure stack.
  • Fundamental view of the underlying hardware.
  • Proficient in Triton or willingness to learn.
  • Familiar with internals of Ring Attention, FA3, FusedMLP implementations.

Benefits

  • Opportunity to work on cutting-edge technology.
  • Collaborate with leading experts in the field.
  • Flexible working environment.

fal

fal

fal is a cutting-edge technology company focused on advancing model performance for generative media models. The company is dedicated to maintaining its leadership position in the industry by designing and implementing innovative model serving architectures that optimize throughput, reduce latency, and minimize resource usage. fal collaborates closely with its Applied ML team and clients in the media sector to ensure that their workloads effectively leverage the company's advanced accelerator technology. With a strong emphasis on performance monitoring and profiling, fal is committed to identifying bottlenecks and exploring optimization opportunities to enhance the efficiency of its systems.

Share This Job!

Save This Job!

Similar Jobs:

Gather logo

Senior Software Engineer - Performance Systems - Remote

Gather

25 weeks ago

Gather is seeking a performance engineer to enhance the efficiency of their virtual office application.

Worldwide
Full-time
Software Development
$160,650 - $203,175/year
Abnormal Security logo

Staff Software Engineer - Distributed Systems Performance Engineering - Remote

Abnormal Security

9 weeks ago

Join Abnormal Security's Advanced Technology Group to optimize cloud systems and drive efficiency in SaaS email products.

USA
Full-time
Software Development
$209,800 - $246,800/year
Pharma Universe logo

Performance Engineer - Software - Remote

Pharma Universe

3 weeks ago

A Performance Engineer to ensure optimal performance, testing, and observability within a product domain.

United Kingdom
Full-time
Software Development

Mitratech

Software Engineer - Performance Management System - Remote

Mitratech

2 weeks ago

Join Mitratech as a Software Engineer to work on a performance management system focused on employee engagement.

Worldwide
Full-time
Software Development

Seeking a Systems Performance Engineer to optimize performance-critical systems for custom hardware in the field of cryptography.

USA
Full-time
DevOps / Sysadmin