jobs Logo
Tekgence Inc logo

Site Reliability Engineer

Tekgence Incabout 19 hours ago
Toronto, Ontario, Canada
Senior Level
CONTRACTOR

About the role

Position Summary: We are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in Observability, DevOps, and Reliability Engineering to design, implement, and maintain highly available, scalable, and resilient cloud-native platforms. The ideal candidate will have hands-on experience with modern monitoring and incident management platforms, Kubernetes-based infrastructure, Infrastructure as Code (IaC), and reliability frameworks such as SLI/SLOs.

Required Qualifications: 5+ years of experience in SRE, DevOps, Platform Engineering, or Infrastructure Operations roles Strong experience with observability and monitoring platforms: (Dynatrace, ELK Stack, Splunk, PagerDuty) Proven expertise in implementing and managing SLI/SLO-based reliability programs Hands-on experience with Azure Cloud and Azure-managed services Strong experience with Azure Kubernetes Service (AKS) and container orchestration Expertise in Terraform and Infrastructure as Code (IaC) Experience with Linux systems administration and cloud-native architectures Strong scripting and automation skills using Python, Bash, PowerShell, or similar languages Experience with CI/CD pipelines and DevOps tooling Excellent troubleshooting, analytical, and problem-solving skills

Preferred Qualifications: Experience with cloud-native monitoring and distributed tracing Knowledge of microservices architectures and service mesh technologies Familiarity with GitOps, Kubernetes operators, and platform engineering practices Azure certifications such as: Microsoft Certified: Azure Administrator Associate Microsoft Certified: Azure DevOps Engineer Expert Certifications in Kubernetes, SRE, or cloud technologies are a plus

About Tekgence Inc

IT Services and IT Consulting