About the Role The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Neuron Compiler team constructs a deep learning compiler stack that allows state‑of‑the‑art large language, vision, and multi‑modal models created in TensorFlow, PyTorch, and JAX to run efficiently on the accelerators.Responsibilities Our performance engineers collaborate across compiler, runtime, and framework teams to optimize machine learning workloads for our global customer base. They:Analyze and optimize system‑level performance of machine learning models across the entire technology stack, from frameworks to runtime.Conduct detailed performance analysis and profiling of ML workloads, identifying and resolving bottlenecks in large‑scale ML systems.Work directly with customers to enable and optimize their ML models on AWS accelerators, understanding their specific requirements and use cases.Design and implement compiler optimizations, transforming manual performance improvements into automated compiler passes.Collaborate across teams to develop innovative optimization techniques that enhance AWS Neuron SDK’s performance capabilities.Work in a fast‑paced, startup‑like environment, constantly focusing on the most impactful work.Basic Qualifications3+ years of non‑internship professional software development experience.2+ years of non‑internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience.Experience programming with at least one software programming language.Preferred Qualifications3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.Bachelor's degree in computer science or equivalent.Experience in compiler design for CPU/GPU/Vector engines/ML‑accelerators.Experience with system‑level performance analysis and optimization.Experience with LLVM and/or MLIR.Experience with PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms.Equal Opportunity Statement Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Company: Amazon Development Centre Canada ULCJob ID: A #J-18808-Ljbffr
Ml Compiler Engineer , Aws Neuron, Annapurna Labs
AMAZON WEB SERVICES (AWS)
toronto, toronto
Published 18 days ago
Report job