Become a Deployment Engineer focused on revolutionizing AI inference capabilities. Enhance deployment reliability and operational efficiency within sophisticated AI compute infrastructures.In this essential role, you will lead the deployment of AI inference replicas and optimize software rollout across various global datacenters. Utilizing your systems engineering and operational skills, you will develop advanced telemetry solutions and automated pipelines, playing a key part in capacity management. Your work will bridge technical requirements with internal teams to ensure seamless operations.Key Responsibilities:• Deploy and manage AI inference software across multiple datacenters• Operate in rapidly growing heterogeneous environments• Optimize capacity allocation and replica positioning• Enhance telemetry and observability frameworks• Build automated deployment pipelines for agile operationsRequirements:• 2-5 years in on-prem compute infrastructure• Expertise in Python for tooling and automation• Proficient in Linux and command-line utilities• Experience with Docker containers and Kubernetes• Familiarity with telemetry tools like InfluxDB and GrafanaDrive innovation in AI model deployment with your technical expertise and contribute to groundbreaking advancements in the AI landscape.#J-18808-Ljbffr
Dynamic Deployment Engineer For Machine Learning Inference Clusters
CEREBRAS
, , canada, , , canada
Published 27 days ago
Report job