Intermediate Site Reliability Engineer, Environment Automation
This GitLab SRE role is a strong fit for ITSG readers who like platform operations, production reliability, and automating the lifecycle of cloud and Kubernetes environments rather than doing repetitive manual ops work.
What you would work on:
- Contribute to infrastructure automation with Terraform, Ansible, and Kubernetes for provisioning, upgrades, and operation of GitLab environments.
- Debug production issues across Kubernetes clusters, GitLab components, and cloud services, then help build safeguards to prevent repeat incidents.
- Create and maintain deployment and orchestration tooling including Helm charts, omnibus-gitlab configurations, and multi-tenant workflows.
- Build and refine observability for multi-tenant GitLab environments and participate in on-call incident response with the SRE team.
Good fit if:
- You have SRE or similar production infrastructure experience and want to automate many environments or tenants in parallel.
- You have hands-on Kubernetes production experience plus familiarity with Terraform, Ansible, Git workflows, and infrastructure as code.
- You can work with backend tooling such as Golang and are comfortable improving runbooks, documentation, and repeatable operations.
Curated from GitLab Greenhouse for IT Support Group readers. This is an external listing; use the apply link for the source listing and latest details.