Job Overview This role will be responsible for leading the design, development, implementation and support of Site Reliability Engineering (SRE) solutions for applications supported by the Cloud organization.ResponsibilitiesLead code and non-functional reviews of all production-bound SRE solutionsDrive transformation by automating existing processes and conducting engineering mindset meetupsManage SRE application assets such as cloud instances and source code repositories and publish technical designsPublish and review implementation plans for SRE solutions bound to production, explore new capabilities and technologies, and document how-to guidesTrack, audit, monitor and implement on technical work streams, acting as a portfolio SME and documenting common components and infrastructureAct as the escalation point in the on-call rotation, supporting maintenance, scheduled work, release deployments, incident and problem managementOwn RCA action items, focus on continuous improvement and technical standards, and drive productivity gains in monitoring, tooling and best practicesMaintain technology currency through patching, certificate renewal and compliance, with a focus on automationEnsure application availability and uptime per SLAs, manage PagerDuty rules and thresholds, Moogsoft situation management, Dynatrace tuning, and coach the SRE teamAssist developers in delivering reliable, high-performing codeHelp build a high-performing diverse team that leverages individual strengthsQualificationsAdvanced knowledge of industry practices, with a focus on SREAdvanced experience in diverse environments (cloud, distributed, mainframe, business workflows, APIs, databases)Excellent communication skills with a direct styleEffective negotiation and stakeholder management skills, with the ability to influence at the director levelHands-on experience with SRE tools and languages (Ansible, Dynatrace Managed, Moogsoft, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Grafana, Catch Point, RedHat OCP)Nice-to-haveComputer Engineering, Computer Science, or related technical degree or experienceExposure to Azure, AWS, Docker, OCP, GitHubExperience working in agile environmentsExposure to Java, Go, Terraform, Spring, TemporalBenefitsA comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, and stock where applicableLeaders who support your development through coaching and managed opportunitiesOpportunity to make a lasting impact in technology transformationWork in a dynamic, collaborative, high-performing teamFlexible work/life balance optionsOpportunities to take on progressively greater accountabilities#J-18808-Ljbffr