The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment. Key Responsibilities Design and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar. Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting. Ensure compliance and performance optimization across multi-tenant deployments. Conduct code quality analysis and security assessments using Sonar. Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience. Develop and maintain documentation for system configurations and recovery procedures in a classified environment. Required Skills and Qualifications Expertise in RKE2, Kubernetes, Ingress, Kong, Artifactory, Prometheus, Grafana, Splunk, Elastic, and Sonar. Strong background in site reliability engineering and system observability. Experience working in air-gapped environments with a focus on classified data protection. Proficiency in troubleshooting and optimizing complex, multi-tenant infrastructures. Preferred Qualifications SRE or DevOps certifications (e.g., CKAD, CKA). Prior experience with government or defense-related SRE roles. Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Orion Innovation by 2x Get notified about new Site Reliability Engineer jobs in Quebec, Canada . Drone Operations and Ground Equipment System Engineer Greater Montreal Metropolitan Area 3 days ago Senior Site Reliability Engineer- Central Platforms Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Freelance Software Developer (Python Engineer) - AI Trainer Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Python and Kubernetes Software Engineer - Data, AI/ML & Analytics Python Software Engineer - Ubuntu Hardware Certification Team Staff Software Engineer, Social Media & Client Marketing Greater Montreal Metropolitan Area 4 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
Senior Site Reliability Engineer
ORION INNOVATION
quebec, quebec
Published 27 days ago
Report job