Your Career, our Future—Together.Ready to join something big? At SoundHound AI, we bring voice, generative, and conversational AI together to transform how people interact with products and services. From voice-enabled vehicles to food ordering and customer support, our multilingual, omnichannel technology already impacts hundreds of millions worldwide.The Opportunity This is a high-ownership role with direct influence over infrastructure decisions. The team has a clear roadmap focused on improving reliability, security posture, and operational maturity. The Senior Site Reliability Engineer helps build first-class infrastructure to deliver our best-in-class technology to the world. The infrastructure is large and complex, running in the cloud and on Kubernetes, so there's no shortage of interesting problems.What You'll DoBuild software and systems for cloud infrastructure management and automation (Terraform, Ansible, Oracle Cloud, GCP)Participate in developing frameworks for application deployment, customization, and upgrades (Kubernetes, ArgoCD, Vault, Jenkins)Ensure application and infrastructure security complies with ISO 27001 / SOX / PCIImprove observability, implement and measure key metrics, and define and enforce SLOs/SLAs (Prometheus, Grafana, ELK)Collaborate with engineering, quality engineering, and product management to architect and build highly available, reliable, and secure systemsWhat You'll Bring8 years of experience working with cloud services at scale in a high-volume customer-facing environment with a Bachelor's degree in Computer Science or equivalentWilling to participate in on-call rotationVast experience working in Linux environments, security, and networking with Python, Go, or BashVery experienced with monitoring and alerting tools such as Prometheus, Grafana, ELK stack, and PagerDutyExperience with deployments in cloud technologies and architectures, CI/CD tools, and configuration management such as Ansible, Terraform, and KubernetesProficient with a wide range of relevant server-side technologies such as Consul, Vault, Kafka, MongoDB, PostgreSQL, MySQLPragmatic, problem‑solving approach when designing and implementing solutionsWorkplace & Compensation This role is available throughout Canada. Employees within a 100-kilometer radius of our Toronto office are expected to work from the office on three pre-scheduled “core days” each month to encourage cross-team connection and in-person collaboration.Compensation includes salary, equity, comprehensive healthcare, paid time off, and other benefits. Our recruiting team will provide a specific salary range based on location and years of experience.Benefits Employees are supported with reasonable accommodations for individuals with disabilities throughout the hiring process and employment. We provide comprehensive benefits including healthcare, paid time off, and equity.#J-18808-Ljbffr
Senior Site Reliability Engineer
SOUNDHOUND AI
toronto, toronto
Published 24 days ago
Report job