Remote Otter LogoRemoteOtter

Machine Learning Engineer - d-Matrix Model Factory - Remote

Posted 13 weeks ago
Software Development
Full Time
CA, USA

Overview

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.

In Short

  • Design, build, and optimize machine learning deployment pipelines for large-scale models.
  • Implement and enhance model inference frameworks.
  • Develop automated workflows for model development, experimentation, and deployment.
  • Collaborate with research, architecture, and engineering teams to improve model performance and efficiency.
  • Work with distributed computing frameworks (e.g., PyTorch/XLA, JAX, TensorFlow, Ray) to optimize model parallelism and deployment.
  • Implement scalable KV caching and memory-efficient inference techniques for transformer-based models.
  • Monitor and optimize infrastructure performance across different levels of custom hardware hierarchy.
  • Ensure best practices in ML model versioning, evaluation, and monitoring.

Requirements

  • Strong programming skills in Python and experience with ML frameworks like PyTorch, TensorFlow, or JAX.
  • Hands-on experience with model optimization, quantization, and inference acceleration.
  • Deep understanding of Transformer architectures, attention mechanisms, and distributed inference.
  • Knowledge of quantization (INT8, BF16, FP16) and memory-efficient inference techniques.
  • Solid grasp of software engineering best practices, including CI/CD, containerization (Docker, Kubernetes), and MLOps.
  • Strong problem-solving skills and ability to work in a fast-paced, iterative development environment.

Benefits

  • Work at the intersection of AI software and custom AI hardware, enabling cutting-edge model acceleration.
  • Collaborate with world-class engineers and researchers in a fast-moving AI-driven environment.
  • Freedom to experiment, innovate, and build scalable solutions.
  • Competitive compensation, benefits, and opportunities for career growth.
d-Matrix logo

d-Matrix

d-Matrix is a pioneering company dedicated to harnessing the power of generative AI to drive technological transformation. With a strong emphasis on software and hardware innovation, d-Matrix is committed to pushing the boundaries of what is possible in the tech industry. The company fosters a culture of respect, collaboration, and inclusivity, valuing diverse perspectives to create better solutions. d-Matrix seeks passionate individuals who are eager to tackle challenges and contribute to shaping the future of AI. The company operates in a flexible work environment, offering remote or hybrid options, and is dedicated to maintaining an inclusive workplace where all employees feel empowered to excel.

Share This Job!

Save This Job!

Similar Jobs:

Replicate logo

Machine Learning Engineer - Media Models - Remote

Replicate

30 weeks ago

Join Replicate as a machine learning engineer to enhance AI model deployment and contribute to innovative projects.

United States
Full-time
Software Development
Shield logo

Machine Learning Engineer - Remote

Shield

14 weeks ago

Join Shield as a Machine Learning Engineer to develop and maintain advanced machine learning infrastructure and collaborate with a talented team.

IL
Full-time
Software Development
Influur logo

Machine Learning Engineer - Remote

Influur

14 weeks ago

Join Influur as a Machine Learning Engineer to design and implement impactful ML models in a collaborative environment.

USA
Full-time
Software Development
Influur logo

Machine Learning Engineer - Remote

Influur

14 weeks ago

Join Influur as a Machine Learning Engineer to design and deploy impactful ML models.

USA
Full-time
Software Development
Truelogic logo

Machine Learning Engineer - Remote

Truelogic

14 weeks ago

Join Truelogic as a Machine Learning Engineer to develop and maintain systems for data science applications.

Mexico
Full-time
Software Development