Remote Otter LogoRemoteOtter

CPU Workload Performance Optimization Engineer - Remote

Posted 14 weeks ago
Software Development
Full Time
USA
$100000 - $500000/year

Overview

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

In Short

  • Conduct competitive analysis for latest CPU products with industry-standard and emerging benchmarks
  • Analyze and optimize CPU workloads, identifying performance bottlenecks and inefficiencies in microarchitecture and software interactions.
  • Reduce workloads for CPU performance modeling and performance correlation
  • Run performance models, EDA frameworks, and/or profiling tools to measure, characterize and predict CPU behavior under various workloads.
  • Collaborate with CPU architects and hardware designers to enhance microarchitectural features and overall processor efficiency.
  • Work closely with software engineers to optimize applications, compilers, supporting libraries, and operating systems for better CPU performance.
  • Drive key workload performance optimization on existing hardware by developing handwritten kernels.
  • Publish performance tuning guidelines and best practices for internal teams and external developers and customers.
  • Stay abreast of industry trends, new workload requirements, and advancements in CPU performance analysis and optimization techniques.

Requirements

  • Bachelor’s, Master’s, or PhD in Computer Engineering, Electrical Engineering, or a related field.
  • At least 5 years of research or industry experience in CPU performance analysis, workload optimization, or microarchitecture design.
  • Strong expertise in microarchitectural performance tuning, profiling, and analysis.
  • Proficiency in performance profiling tools such as perf, VTune, AMDuProf or proprietary CPU performance analysis tools.
  • Deep understanding of CPU microarchitecture concepts, including CPU pipelines, memory hierarchy, and branch prediction.
  • Experience with benchmarking and workload characterization methodologies.
  • Strong programming skills in C/C++, assembly language (with a focus on vector-length agnostic programming), and scripting languages such as Python and Shell.
  • Knowledge of operating system internals, compilers, and performance optimizations at the OS level.
  • Experience with GCC or LLVM compiler development is a plus.
  • Familiarity with cloud computing and HPC workloads is a plus.
  • Excellent problem-solving skills and ability to work across multidisciplinary teams.

Benefits

  • Highly competitive compensation package
  • Equal opportunity employer
  • Comprehensive benefits

Tenstorrent

Tenstorrent

Tenstorrent is a forward-thinking company focused on building strong relationships with investors and the financial community to achieve its business objectives. With a hybrid or remote work model, Tenstorrent operates from key locations including NYC, San Francisco, Austin, and Santa Clara. The company values diverse experience levels and emphasizes collaboration across teams, including finance, legal, and HR, to ensure effective communication and reporting. Tenstorrent is committed to providing a competitive compensation package and fostering an inclusive work environment, while also adhering to U.S. Export Control laws and regulations.

Share This Job!

Save This Job!

Similar Jobs:

Beekeeper

DevOps Engineer - Performance Optimization - Remote

Beekeeper

17 weeks ago

Join Beekeeper as a DevOps Engineer to enhance platform performance and optimize API and database efficiency.

Worldwide
Full-time
DevOps / Sysadmin
ClickHouse logo

C++ Performance Engineer - Remote

ClickHouse

17 weeks ago

Join ClickHouse as a C++ Performance Engineer to optimize the performance of our core database in a remote-first environment.

Worldwide
Full-time
Software Development
ClickHouse logo

C++ Performance Engineer - Remote

ClickHouse

17 weeks ago

Join ClickHouse as a C++ Performance Engineer to optimize the performance of our core database in a remote-first environment.

Worldwide
Full-time
Software Development
ClickHouse logo

C++ Performance Engineer - Remote

ClickHouse

17 weeks ago

Join ClickHouse as a C++ Performance Engineer to optimize the performance of our high-performance database in a fully remote role.

United Kingdom
Full-time
Software Development
ClickHouse logo

C++ Performance Engineer - Remote

ClickHouse

17 weeks ago

Join ClickHouse as a C++ Performance Engineer to optimize the performance of their high-performance database in a remote-first environment.

Worldwide
Full-time
Software Development