Lead advancements in AI as a Senior Performance and Reliability Engineer. Focus on benchmarking and optimizing hardware/software systems to enhance power management and reliability.In this role, you will contribute to the development of the next-generation AI architecture. Characterize advanced ML hardware performance, while collaborating closely with ML engineers and researchers to drive impactful system-level improvements. Your software solutions will be essential in enhancing reliability and performance across innovative applications.Key Responsibilities:• Characterize and optimize advanced ML systems• Analyze workloads for performance and power impacts• Develop solutions for enhanced software reliability• Influence AI architecture design through analysis• Collaborate with cross-disciplinary engineering teamsRequirements:• BS, MS, or PhD degree in a related field• 3+ years in performance engineering/optimization• Skilled in Python and C/C++ programming• Experience with ML frameworks and thermal management is a plus• Excellent communication capabilities are essentialDrive the evolution of AI systems, applying your engineering expertise to ensure robust performance and reliability in a dynamic, innovative environment.#J-18808-Ljbffr
Senior Engineer In Ai Performance And Reliability Optimization
CEREBRAS
, , canada, , , canada
Published 27 days ago
Report job