Remote Otter LogoRemoteOtter

Software Data Engineer - Remote

Posted 1 week ago
Software Development
Full Time
USA

Overview

In this role, you’ll play a pivotal part in building and optimizing data pipelines that transform large, multi-modal datasets into high-quality training inputs for cutting-edge AI models for drug discovery.

In Short

  • Design and improve data pipelines that process large, multi-modal datasets from a variety of internal and external sources into training datasets for AI models.
  • Evolve our data storage layer to support analytics, schema evolution, reproducibility, and efficient data access.
  • Collaborate with ML engineers to improve the performance and reliability of Python-based data processing workflows.
  • Collaborate on the creation, testing and maintenance of software systems.
  • Code review for pull requests in adjoining areas.
  • Maintenance of and mentorship in software best practices, including version control, testing and documentation.
  • Clear oral communication of work in meetings and company demos, at a level suited to the audience.

Requirements

  • Minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.
  • Proven ability to design flexible, maintainable ETL systems.
  • Experience with data pipeline orchestration tools such as Prefect, Airflow, Argo, Databricks, or Spark.
  • Understanding of the ML model lifecycle; prior work with scientific or ML workflows is a plus.
  • Hands-on experience with multi-terabyte scale data processing.
  • Familiarity with AWS; Kubernetes experience is a bonus.
  • Knowledge of data lake technologies such as Parquet, Iceberg, AWS Glue etc.
  • Strong Python software engineering skills.
  • Pragmatic mindset — able to evaluate tradeoffs find solutions that empower ML researchers to move quickly.
  • Background in bioinformatics or chemistry is a plus.

Benefits

  • Industry leading competitive pay.
  • Company paid healthcare.
  • Flexible spending accounts.
  • Voluntary life Insurance.
  • 401K matching.
  • Uncapped vacation.
  • State-of-the-art facility in San Diego with an onsite gym and dining.
  • Easy access to great places to live and play.
Iambic Therapeutics logo

Iambic Therapeutics

Iambic Therapeutics is a pioneering biotechnology company focused on developing innovative therapies to address unmet medical needs. With a commitment to advancing healthcare through cutting-edge research and development, Iambic Therapeutics leverages the latest scientific discoveries to create effective treatments. The company fosters a collaborative and dynamic work environment, encouraging creativity and excellence among its team members to drive breakthroughs in therapeutic solutions.

Share This Job!

Save This Job!

Similar Jobs:

Black & White Zebra logo

Software Engineer, Data - Remote

Black & White Zebra

1 week ago

Join BWZ as a Software Engineer, focusing on back-end development and data engineering in a flexible and innovative environment.

Brazil
Full-time
Software Development
Peak Support logo

Software & Data Engineer - Remote

Peak Support

4 weeks ago

Join Peak Support as a Software & Data Engineer, focusing on delivering and managing software solutions in a remote environment.

Worldwide
Full-time
Software Development
MWDN logo

Software / Data Engineer - Remote

MWDN

5 weeks ago

Join MWDN as a Software / Data Engineer to work on high-volume data pipelines and advanced crypto compliance technology.

Worldwide
Full-time
Software Development
Indebted logo

Software Engineer (Data) - Remote

Indebted

6 weeks ago

Join InDebted as a Software Engineer focusing on Big Data and contribute to evolving their AWS Data Lake.

USA
Full-time
Software Development
Found logo

Software Engineer, Data - Remote

Found

35 weeks ago

Found is seeking a Data Engineer to enhance their data infrastructure and support decision-making across the organization.

USA
Full-time
Data Analysis
$181,000 - $209,000/year