Director of Engineering, Machine Learning Performance | Sunnyvale, CA

Detailed Information

  • Location: Sunnyvale, CA

  • Company: Google

our customers and go out of your way to enable our customers to succeed. Google Cloud accelerates organizations' ability to digitally transform their business with the best infrastructure, platform, industry solutions and expertise. We deliver enterprise-grade solutions that leverage Google's cutting-edge technology - all on the cleanest cloud in the industry.

Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems. The US base salary range for this full-time position is $271,000-$399,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. The range

displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Minimum qualifications: 15 years of experience as an Engineering leader. Experience with CPU,

GPU, or TPU architecture. Experience with XLA, MLIR, or similar infrastructure for building high-performance compilers.

Preferred qualifications: Expertise in reasoning and quantifying relative impacts and risks of technical work that could require the involvement of engineers. Technical expertise in systems and software with the leadership skills to influence technical leaders across the company. Ability to quickly ramp up in new subject areas with short notice to guide a few weeks of mission-critical research into the largest potential risks related to infrastructure usage. Knowledge of compiler internals and optimization techniques. - Develop high-performance infrastructure for analyzing, evaluating, and deploying high-performance techniques to ML workloads.

- Performance analysis of LLM training and serving models to identify performance inefficiencies across the software stack on TPUs and GPUs. - Increase efficiency of ML workloads through co-designed algorithmic and system level techniques (e. g. quantization, sparsity). - Develop high-performance tools and techniques for Machine Learning workloads and frameworks through automation, dashboarding, and increased developer velocity. - Participate in hardware-software co-design activities for the upcoming generation of TPUs that are potentially specialized to LLMs.

Requisition #: 137956014348477126pca3lyuhf

View Jobs by Category >>

Related Jobs