The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment. Key Responsibilities Design and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar. Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting. Ensure compliance and performance optimization across multi-tenant deployments. Conduct code quality analysis and security assessments using Sonar. Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience. Develop and maintain documentation for system configurations and recovery procedures in a classified environment. Required Skills and Qualifications Expertise in RKE2, Kubernetes, Ingress, Kong, Artifactory, Prometheus, Grafana, Splunk, Elastic, and Sonar. Strong background in site reliability engineering and system observability. Experience working in air-gapped environments with a focus on classified data protection. Proficiency in troubleshooting and optimizing complex, multi-tenant infrastructures. Preferred Qualifications SRE or DevOps certifications (e.g., CKAD, CKA). Prior experience with government or defense-related SRE roles. Seniority levelSeniority level Mid-Senior level Employment typeEmployment type Full-time Job functionJob function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Orion Innovation by 2x Get notified about new Site Reliability Engineer jobs inQuebec, Canada . Drone Operations and Ground Equipment System EngineerGreater Montreal Metropolitan Area 3 days ago Senior Site Reliability Engineer- Central PlatformsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsFreelance Software Developer (Python Engineer) - AI TrainerPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython Software Engineer - Ubuntu Hardware Certification TeamStaff Software Engineer, Social Media & Client MarketingGreater Montreal Metropolitan Area 4 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.#J-18808-Ljbffr
Senior Site Reliability Engineer
ORION INNOVATION
québec, québec
Published 7 days ago
Report job