Lead Site Reliability Engineer in DevOps
ITRiders • toronto, on • Posted May 20, 2026
Position Overview
Become a Lead Site Reliability Engineer to drive DevOps initiatives. Implement industry-leading observability tools and practices to maintain and enhance system performance.
We require an SRE with at least 8 years of experience in observability and infrastructure reliability. Your extensive knowledge of tools such as Dynatrace, ELK, and Splunk will enable you to architect solutions that provide visibility and performance insights. Collaborate with cross-functional teams to resolve issues swiftly and cultivate a culture of operational excellence.
Key Responsibilities:
• Architect observability-as-code solutions using Terraform
• Instrument services for comprehensive application visibility
• Manage incidents and perform root cause analysis
• Automate repetitive operational tasks using best practices
• Lead chaos engineering sessions and postmortems
Requirements:
• 8+ years of relevant experience in SRE or DevOps
We require an SRE with at least 8 years of experience in observability and infrastructure reliability. Your extensive knowledge of tools such as Dynatrace, ELK, and Splunk will enable you to architect solutions that provide visibility and performance insights. Collaborate with cross-functional teams to resolve issues swiftly and cultivate a culture of operational excellence.
Key Responsibilities:
• Architect observability-as-code solutions using Terraform
• Instrument services for comprehensive application visibility
• Manage incidents and perform root cause analysis
• Automate repetitive operational tasks using best practices
• Lead chaos engineering sessions and postmortems
Requirements:
• 8+ years of relevant experience in SRE or DevOps