Lead the charge in site reliability engineering focusing on cloud systems and AI-driven observability. Leverage your strong Python scripting and experience with tools like PagerDuty and Moogsoft.In this position, you will utilize your strong understanding of SRE principles and distributed systems. Your role will involve working with Ansible and Git for automation, while also exploring Kubernetes and Docker environments. Engaging with generative AI and event-driven architectures will enhance operational efficiency.Key Responsibilities:• Implement and manage observability with Splunk and Dynatrace• Develop automation scripts using Python and Ansible• Collaborate within distributed cloud systems• Utilize Git and GitHub Actions for version control• Engage with container solutions like KubernetesRequirements:• Excellent scripting skills in Python• Experience with AI/ML observability platforms• Familiarity with Kubernetes and Docker environments• Exposure to ChatOps frameworks is a plus• Knowledge of event-driven architectures desiredTransform site reliability with your innovative skills in cloud and AI technologies.#J-18808-Ljbffr
Innovative Site Reliability Engineer For Cloud And Ai Solutions
THEMESOFT INC.
toronto, toronto
Published 27 days ago
Report job