Grafana Labs is a remote‑first, open‑source powerhouse. With more than 20 M users worldwide, Grafana powers dashboards used by NASA, Microsoft, eBay, JPMorgan Chase and many more. Grafana Labs also helps over 3 000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self‑managed with the Grafana Enterprise Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki) and traces (Grafana Tempo). We’re scaling fast while staying true to our open‑source legacy, global collaborative culture, and passion for meaningful work. Our team thrives in an innovation‑driven environment where transparency, autonomy and trust fuel everything we do. We’d love you to raise your hand for what could be a truly career‑defining opportunity, even if you don’t meet every requirement. This is a remote opportunity and we would be interested in applicants based in Canada, EST timezones at this time. Staff Software Engineer – Grafana Cloud Observability, Kubernetes Monitoring The Opportunity: Grafana Cloud is our composable observability platform that integrates metrics, logs and traces with Grafana. It allows customers to leverage the best open‑source observability software – including Prometheus, Mimir, Loki and Tempo – without the overhead of installing, maintaining and scaling their own stack. The Observability department focuses on enabling developers to understand the health and performance of their applications and infrastructure in any environment. We build and maintain the backend for opinionated applications such as Cloud Provider Observability, Database Observability and Kubernetes Monitoring, including dashboards, alerts, documentation and infrastructure, while working closely with other teams to ensure seamless experiences. We also strive to incorporate OSS contributions into our work by contributing to projects such as Alloy, Prometheus, OpenTelemetry and Beyla. The Observability department provides a core building block for customers using Grafana Cloud. What You’ll Be Doing: In this role, you will bring your passion for observability and software engineering expertise to help us take our infrastructure monitoring capabilities within Grafana Cloud to the next level. Responsibilities include: Designing and implementing high‑quality, scalable integrations for various infrastructure components, applications and data ingestion pipelines. Creating middleware components and libraries that simplify development and maintenance of observability solutions. Representing Grafana Labs in open‑source forums, working groups and events when necessary. Working with product teams, designers and documentation to develop features that align with wider product strategy and customer needs. Leading the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions. Collaborating with Sales, Product and Support teams to deliver a holistic product experience. Taking ownership of the services you run by deploying well‑tested, clean code. Embracing our open‑source culture and contributing to projects that may not directly fall within your team’s scope. As an entirely remote organization, we provide guidance and meet regularly using video calls, so an independent attitude, good communication skills and transparency are a must. We invest heavily in developer productivity. You can use modern AI coding assistants as part of your daily workflow, backed by a company‑funded usage budget so you can iterate quickly without unnecessary friction. We encourage pragmatic AI‑assisted development: faster prototyping, test generation, refactors, documentation and incident follow‑ups—always paired with strong code review and quality standards. You’ll also have access to frontier models. What Makes You a Great Fit: Passion for observability and eagerness to share knowledge through documentation and blog posts. Love to engage with customers and help them out. Excellent communication skills. Relevant open‑source experience, ideally in the observability domain. Willingness to become an active member of the OpenTelemetry and Prometheus communities. Curiosity and a desire to learn new programming languages and frameworks, set up examples and figure out how things work. Good understanding of typical production environments; ideally you have been responsible for operating production services and organizing on‑call. Active mentorship of other team members, identifying areas for focus and improvement. Requirements: Strong 8+ years of experience with at least one major programming language (Python, .NET, Java, Go, Rust, etc.). Demonstrated experience operating high‑scale production systems on Kubernetes, including on‑call participation, incident response and post‑mortem practices. Familiarity with observability tooling (e.g., Grafana). Strong understanding of time‑series data, metrics cardinality challenges, and cost/performance trade‑offs in observability systems. Hands‑on technical leadership experience—setting technical direction, leading project teams, influencing architectural decisions beyond your immediate team. Deep understanding of distributed systems concepts: scalability, consistency, high availability and failure modes. Experience writing clean, maintainable, robust and performant software. Experience delivering projects from start to finish in a self‑driven manner. Excellent problem‑solving and debugging skills. Strong mentoring and leadership skills. Bonus Points For: Operating or scaling Prometheus in high‑cardinality, multi‑tenant environments. Working with OpenTelemetry Collector pipelines or similar telemetry ingestion systems. Certified Kubernetes Administrator (CKA)/Certified Kubernetes Application Developer (CKAD) or other CNCF certifications. Developing Kubernetes operators, controllers or custom resources. Strong understanding of metrics collection, visualization and alerting concepts. Contributing to or maintaining open‑source projects with evidence of successful pull requests and community collaboration. Designing and building observability backends for various systems and applications. Compensation & Rewards: In Canada, the compensation range for this role is CAD 186,368–223,642. Actual compensation may vary based on level, experience and skillset, as assessed throughout the interview process. All roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs’ success. Compensation ranges are country specific. If you are applying from a different location than Canada, your recruiter will discuss your specific market’s defined pay range and benefits at the beginning of the process. Why You’ll Thrive at Grafana Labs: 100% Remote, Global Culture – Bring talent from around the world into a collaborative ecosystem. Scaling Organization – Tackle meaningful work in a high‑growth, ever‑evolving environment. Transparent Communication – Expect open decision‑making and regular company‑wide updates. Innovation‑Driven – Autonomy and support to ship great work and try new things. Open Source Roots – Built on community‑driven values that shape how we work. Empowered Teams – High trust, low ego culture that values outcomes over optics. Career Growth Pathways – Defined opportunities to grow and develop your career. Approachable Leadership – Transparent execs who are involved, visible and human. Passionate People – Join a team of smart, supportive folks who care deeply about what they do. In‑Person Onboarding – Learn all about what we do and how we do it from day 1. Balance is Key – Global annual leave policy of 30 days per annum, with 3 days reserved for Grafana Shutdown Days to allow the team to disconnect. We will comply with local legislation where applicable. Equal Opportunity Employer: We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status and all other characteristics that make us different and unique. We believe that equality and diversity build a strong organization and we’re working hard to make sure that’s the foundation of our organization as we grow. Grafana Labs may utilize AI tools in its recruitment process to assist in matching information provided in CVs to job postings. The recruitment team will continue to review inbound CVs manually to identify alignment with current openings. For information about how your personal data is used once you’ve applied to a job, check out our privacy policy. #J-18808-Ljbffr
Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring | Canada | Remote
GRAFANA LABS
, , canada, , , canada
Published 27 days ago
Report job