Remote Otter LogoRemoteOtter

Deep Learning Solutions Architect – Inference Optimization - Remote

Posted Yesterday
Software Development
Full Time
UK

Overview

NVIDIA’s Worldwide Field Operations (WWFO) team is seeking a Solution Architect with a deep understanding of neural network inference. The ideal candidate will be proficient using tools such as TRT LLM, vLLM, SGLang or similar, and have strong systems knowledge, enabling customers to fully use the capabilities of the new GB300 NVL72 systems.

In Short

  • Work directly with key customers to understand their technology and provide the best AI solutions.
  • Perform in-depth analysis and optimization on GPU architecture systems.
  • Partner with Engineering, Product, and Sales teams to develop suitable solutions for customers.
  • Enable development and growth of product features through customer feedback.
  • Engage with developers, researchers, data scientists, and IT managers.
  • Demonstrate excellent communication and technical presentation skills.
  • 5+ years of experience with Python/C++ and modern NLP.
  • Experience with key libraries for NLP/LLM training and deployment.
  • Collaborate with various teams in a dynamic environment.
  • Passion for continuous learning and sharing findings across the team.

Requirements

  • MS/PhD or equivalent experience in a relevant field.
  • 5+ years of work or research experience in software development.
  • Understanding of transformer and diffusion model architectures.
  • Familiarity with DevOps tools including Docker and Kubernetes.
  • Experience with HPC systems and data center design.
  • Ability to thrive in dynamic environments.
  • Self-starter with a passion for growth and continuous learning.
  • Experience in running and debugging large-scale deep learning processes.
  • Applied NLP technology in production environments.
  • Strong interpersonal skills to engage with various stakeholders.

Benefits

  • Highly competitive salaries.
  • Comprehensive benefits package.
  • Diverse work environment.
  • Opportunities for growth and development.
  • Work with cutting-edge technology in AI.

N.U

NVIDIA USA

VN01 NVIDIA Vietnam Company Limited is a subsidiary of NVIDIA, a global leader in accelerated computing. The company focuses on pioneering technologies in AI and digital twins, transforming major industries and making a significant impact on society. With a commitment to innovation, NVIDIA Vietnam plays a crucial role in the manufacturing and engineering processes, ensuring high standards of manufacturability and production capabilities in a fast-paced environment. The team collaborates closely with global contract manufacturers and engineering teams to enhance production efficiency and drive continuous improvement.

Share This Job!

Save This Job!

Similar Jobs:

F.N.F

Deep Learning Solutions Architect – Large Scale Inference Optimization - Remote

FR01 NVIDIA France

14 weeks ago

Join NVIDIA as a Deep Learning Solutions Architect to optimize large scale inference and drive AI solutions.

UK, Remote
Full-time
Software Development

phData

Machine Learning Solutions Architect - Remote

phData

12 weeks ago

Join phData as a Machine Learning Solutions Architect, where you'll design and implement data solutions and ensure the successful deployment of machine learning models.

LATAM
Full-time
Software Development

C.S

Generative AI Inference Solutions Architect - Remote

Cerebras Systems

30 weeks ago

Join Cerebras Systems as a Generative AI Inference Solutions Architect to lead technical sales initiatives and drive customer engagement.

Worldwide
Full-time
Sales / Business

phData

Machine Learning Engineer / Solutions Architect - Remote

phData

39 weeks ago

phData is seeking a Machine Learning Engineer to design and implement data solutions in a remote-first environment.

LATAM
Full-time
Software Development

phData

Machine Learning Engineer / Solutions Architect - Remote

phData

39 weeks ago

Join phData as a Machine Learning Engineer to design and implement data solutions in a remote-first environment.

Worldwide
Full-time
Software Development