Platform SRE & Observability Engineer (GPU/Kubernetes)

CirrusLabs · ciudad de méxico, ciudad de méxico, Mexico

Location
ciudad de méxico
Job Type
Full-time
Posted
June 05, 2026

Job Description

CirrusLabs is searching for a Platform Site Reliability Engineer (SRE) in Mexico City. The role focuses on supporting the reliability and observability of AI platform environments. Candidates should have hands-on experience in Linux troubleshooting, operational automation, and incident response, particularly in Kubernetes and GPU operations.

Eligible applicants will also be proficient in using tools like Prometheus and Grafana to monitor platform health. This role demands strong collaboration skills and the ability to automate operational tasks.

#J-18808-Ljbffr

Ready to Apply?

Submit your application for Platform SRE & Observability Engineer (GPU/Kubernetes) at CirrusLabs

Apply Now