Your Impact You will design and build the infrastructure primitives that define how Chainlink Decentralized Oracle Networks (DONs) scale across internal systems and the decentralized ecosystem.You will help create the CRE (Kubernetes-based) control plane that enables:Deterministic horizontal scaling of DONsSafe and repeatable infrastructure expansionImproved operational efficiency and scalabilityYou will develop the core infrastructure components, including Kubernetes Operators and scaling automation, that Product teams will adopt and then might later be distributed to external node operators to improve decentralized scaling.This is not an operational support role. You will be building the systems that define how Chainlink scales while shaping the reliability, scalability, and decentralization of protocol-level services.Requirements6–9+ years in SRE / Platform / Infrastructure EngineeringProven experience scaling Kubernetes in high-throughput production environmentsDeep knowledge of:Scheduler behaviorStatefulSets & persistent workloadsAutoscaling strategies (HPA, VPA, KEDA, custom scaling)Resource management & performance tuningMulti-cluster and multi-region architecturesExperience in diagnosing production failures at the cluster scaleStrong Terraform or Crossplane experienceGitOps workflows (ArgoCD / Flux) experienceCI/CD reliability experienceAutomation-first mindsetAWS production experienceProficiency in Go (strongly preferred) or equivalent systems languageDesired QualificationsExperience with web3 concepts (e.g., blockchain node lifecycle, forks, reorgs, or RPC issues)Experience with oracle systems, token architectures, or decentralized servicesExperience scaling stateful high-availability distributed systemsExperience building internal platform primitivesExperience implementing custom autoscaling logicExperience designing SLO strategies and error-budget usageExperience improving diagnosability and observability frameworksExperience working in high-ambiguity environmentsExperience operating blockchain infrastructure in productionCertified Kubernetes Administrator (CKA)Experience contributing to Kubernetes ecosystem projectsExperience building multi-tenant platform infrastructureExperience working in high-security and/or SOC 2/ISO27001 compliant environmentsExperience with chaos engineering practices or implementationCommitment to Equal Opportunity Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form.Global Data Privacy Notice for Job Candidates and Applicants Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you choose to submit, is subject to our Recruiting Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required.#J-18808-Ljbffr
Senior Site Reliability Engineer, Node Platform
FRAMEWORK VENTURES
vancouver, vancouver
Published 27 days ago
Report job