Remote Otter LogoRemoteOtter

Data Infrastructure Engineer - Remote

Posted Yesterday
Software Development
Full Time
USA

Overview

We are seeking a Data Infrastructure Engineer to join our growing team. In this role, you will design, build, and operate distributed data systems that power large-scale ingestion, processing, and transformation of datasets used for AI model training.

In Short

  • Design and maintain distributed ingestion pipelines for structured and unstructured data.
  • Build scalable ETL/ELT workflows for AI/ML model training.
  • Support preprocessing of unstructured assets for training pipelines.
  • Architect pipelines across cloud storage and optimize processing with distributed frameworks.
  • Use infrastructure-as-code to manage environments.
  • Maintain data lineage and governance for AI/ML datasets.
  • Collaborate with ML researchers and engineers.
  • Adapt pipelines to evolving pretraining and evaluation needs.
  • Embrace versatility in problem-solving.
  • Contribute to a culture of fast iteration and collaborative ownership.

Requirements

  • 5+ years of experience in data engineering or distributed systems.
  • Strong programming skills in Python; Scala/Java/C++ is a plus.
  • Solid SQL skills for analytics and transformations.
  • Proficiency with distributed frameworks like Spark or Dask.
  • Familiarity with cloud platforms (AWS/GCP/Azure).
  • Experience with workflow orchestration tools.

Benefits

  • Competitive salary, equity, and benefits package.
  • Opportunity to work with a talented team at the forefront of AI and 3D technology.
  • Flexible work environment, with options for remote and on-site work.
  • Fast professional growth and development opportunities.
  • An inclusive culture that values creativity and collaboration.
  • Unlimited, flexible time off.
Meshy logo

Meshy

Meshy is a pioneering 3D generative AI company based in Silicon Valley, dedicated to unleashing 3D creativity by transforming text and images into stunning 3D models in just minutes. With a global team of 30 experts, including alumni from prestigious institutions like MIT and Harvard, and veterans from leading tech companies such as Nvidia, Microsoft, and Google, Meshy has garnered a user base of 1 million, including notable clients like Supercell, Sega, and Snap. Backed by top venture capital firms like GGV and Sequoia, Meshy is at the forefront of innovation in the 3D asset creation space, catering to both professional artists and hobbyists.

Share This Job!

Save This Job!

Similar Jobs:

Worth AI logo

Data Infrastructure Engineer - Remote

Worth AI

13 weeks ago

Join Worth AI as a Data Infrastructure Engineer to design and maintain scalable data systems that support analytics and product applications.

USA
Full-time
Software Development
Worth AI logo

Data Infrastructure Engineer - Remote

Worth AI

13 weeks ago

Join Worth AI as a Data Infrastructure Engineer to design and maintain scalable data systems in a cloud environment.

FL, USA
Full-time
Software Development
Houzz logo

Data Infrastructure Engineer - Remote

Houzz

15 weeks ago

Join our data infrastructure team as a Data Infrastructure Engineer to build scalable and reliable data systems.

Taiwan
Full-time
Software Development
P 1ai logo

Data Infrastructure Engineer - Remote

P 1ai

15 weeks ago

Join P-1 AI as a Data Infrastructure Engineer to manage and enhance data systems for our innovative AI solutions.

USA, Canada
Full-time
Software Development
Intellectsoft logo

Data Infrastructure Engineer - Remote

Intellectsoft

17 weeks ago

Join Intellectsoft as a Data Infrastructure Engineer to design and optimize data pipelines while collaborating with various teams.

Colombia
Full-time
Software Development