Site Reliability Engineer

Vertex Elite LLC · ahuntsic north, qc, Canada

Location
ahuntsic north
Job Type
Full-time
Posted
May 28, 2026

Job Description

Duration: Contract Key Skills: Monitoring / Observability tools - Dynatrace, ELK etc. Platform/ cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities: Collaborate with various Infrastructure, Applications, platforms, and cloud teams on Observability solutions. Implement monitoring solutions using APM tools and Grafana for visualization - setup, configuration and developing monitoring / alerting solutions. Manage Grafana platform with team-specific dashboards covering various KPIs & data sources, enable with alerts and establish SLOs. Troubleshoot and resolve issues related to Observability solutions - Gaps, challenges and addressing solutions part of Production incidents. Analyze Infrastructure systems, services, and technologies towards monitoring, alerting and Incident response needs. Work in apps, platforms and infra services on resilient infrastructure, scalable, and highly available environment. Collaborate with App and services teams/SMEs to integrat...

Ready to Apply?

Submit your application for Site Reliability Engineer at Vertex Elite LLC

Apply Now