Be a pivotal Site Reliability Engineer focused on improving infrastructure resilience and reliability. Collaborate remotely to drive operational success and enhance system performance in a dynamic environment.This role allows you to design and operate reliable systems, playing a key role in preventing incidents and responding effectively. By leading initiatives for continuous improvement and managing operational standards, you will ensure our services are robust and capable of scaling as necessary.Key Responsibilities:• Drive improvements in system reliability and scalability• Oversee incident management and automation• Provide support via on-call rotations for critical services• Define SLIs, SLOs, and error budgets for decision-making• Enhance monitoring and observability across systemsRequirements:• Experience with cloud environments like AWS• Solid knowledge of chaos engineering tools• Proven ability to troubleshoot live production systems• Familiarity with scripting and programming practices• Self-starter attitude in fast-paced work settingsUtilize your skills to streamline operations and enhance system reliability, ensuring a solid foundation for ongoing technological advancements.#J-18808-Ljbffr

Site Reliability Engineer For Cloud Infrastructure Management

NEWTON

Similar jobs

Accountant

VACO BY HIGHSPRING

Financial Analyst

VACO BY HIGHSPRING

Superviseur Garage

TRANSPORT GINO BOIS (GROUPE TGB)

Technicien En Installation De Systèmes De Sécurité

GROUPE PRO ACCÈS

Remorqueur

TRANSPORT GINO BOIS (GROUPE TGB)

Mécanicien D'équipement Lourd

TRANSPORT GINO BOIS (GROUPE TGB)

Chef D'équipe Mécanique (Lead Hand)

ÉQUIPEMENT ST-GERMAIN INC.

Receive similar jobs by email