GitLab Inc. is a company based on the GitLab open-source project. GitLab is an integrated product that unifies issues, code review, CI and CD into a single UI. GitLab Inc. offers self hosted products and SaaS plans for GitLab.com. GitLab is an open source project with a large community. Over 1700 people worldwide have contributed to GitLab! GitLab Inc. is an...
Read more about this company
Site Reliability Engineers (SREs) are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments and the GitLab codebase. GitLab.com is a unique site and it brings unique challenges–it’s the biggest GitLab instance in existence. In fact, it’s one of the largest single-tenancy open-source SaaS sites on the internet.
The Delivery Group consists of two teams - Orchestration and System. This role is within the Orchestration team and is focused on maintaining and creating a deployment & release platform that allows development teams to release their changes to users as quickly and effectively as possible. The role will involve collaboration across the Engineering department. Typical projects include deployment pipeline design and improvement, Kubernetes environment creation to support release processes, and improving our release tools to reduce deployment and release workload.
SREs in this role are not part of the typical on-call rotation. Instead, Delivery team members take part on the Release Management rotation to look after deployments and releases. This is a working-hours-only role and most people spend around 4 months of the year acting as a Release Manager. All other time is project-focused.
What You Need To Have
Demonstrated experience building a platform in a public cloud environment
Experience working with Kubernetes, even better if you've worked with GKE and/or Helm
Experience working with Terraform
Demonstrated coding skills, ideally with Ruby or Go but we're open to other languages if you're wiling to learn.
Comfortable working asynchronously with a globally distributed team