Lead advancements in AI as a Senior Performance and Reliability Engineer. Focus on benchmarking and optimizing hardware/software systems to enhance power management and reliability.In this role, you will contribute to the development of the next-generation AI architecture. Characterize advanced ML hardware performance, while collaborating closely with ML engineers and researchers to drive impactful system-level improvements. Your software solutions will be essential in enhancing reliability and performance across innovative applications.Key Responsibilities: • Characterize and optimize advanced ML systems • Analyze workloads for performance and power impacts • Develop solutions for enhanced software reliability • Influence AI architecture design through analysis • Collaborate with cross-disciplinary engineering teamsRequirements: • BS, MS, or PhD degree in a related field • 3+ years in performance engineering/optimization • Skilled in Python and C/C++ programming • Experience with ML frameworks and thermal management is a plus • Excellent communication capabilities are essentialDrive the evolution of AI systems, applying your engineering expertise to ensure robust performance and reliability in a dynamic, innovative environment. #J-18808-Ljbffr
Senior Engineer In Ai Performance And Reliability Optimization
CEREBRAS
winnipeg, winnipeg
Published 28 days ago
Report job