Senior Site Reliability Engineer - Runway
Senior Site Reliability Engineer
About the Role
The GitLab Runway team is working on our next-generation platform to rapidly deploy backend services that automatically take advantage of GitLab infrastructure, security, observability, and data access. It’s a platform engineering project in the truest sense. We’re enabling self-service development across the entire GitLab engineering ecosystem to quickly build and deploy services to complement the GitLab product offerings.
We're seeking a Senior Site Reliability Engineer to join our team and help us build, maintain, and optimize our infrastructure. In this role, you'll collaborate with cross-functional teams to ensure our systems are reliable, scalable, and performant.
Key Responsibilities
Design, implement, and maintain infrastructure on both GCP and AWS
Help create and maintain Kubernetes tooling, logging, secrets management and utilities
Build and improve monitoring, alerting, and logging systems
Participate in on-call rotation to address critical issues
Automate manual processes to increase efficiency and reduce errors
Lead incident response, including postmortem analysis
Contribute to capacity planning and cost optimization
Required Qualifications
5+ years of experience in DevOps, SRE, or similar roles
Strong experience with both GCP and AWS cloud platforms
Proficiency with Kubernetes and container orchestration
Solid programming skills in Golang and scripting languages
Experience designing and implementing logging solutions
Demonstrated ability to automate infrastructure operations
Experience with on-call rotations and incident management
Strong troubleshooting and problem-solving skills
Excellent communication skills and ability to work in a team
Comfortable in a fully remote, heavily asynchronous environment across AMER, EMEA, and APAC regions
Preferred Qualifications
Experience with infrastructure as code tools (Terraform, Pulumi)
Knowledge of observability tools (Prometheus, Grafana, etc.)
Secrets management with HashiCorp Vault or OpenBao
Experience with service mesh technologies (Istio, Linkerd)
Background in distributed systems and microservice architectures
Security best practices in cloud-native environments
Experience with GitLab CI/CD pipelines and workflows
Mandatory non-technical skills, experience and characteristics
Willingness and ability to live and promote Gitlab's unique CREDIT Values in one's day to day work and interactions with teammates.
Superior verbal and written communication skills
Cool, collected and composed under pressure
Comfortable and productive working asynchronously across timezones and cultures, at the speed and scale of business.
Enable others to excel
Be a Leader of One
Act Like an Owner with Gitlab's resources.
How GitLab will support you
All remote, asynchronous work environment
Home office support
Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.
Senior Site Reliability Engineer - Runway
Senior Site Reliability Engineer
About the Role
The GitLab Runway team is working on our next-generation platform to rapidly deploy backend services that automatically take advantage of GitLab infrastructure, security, observability, and data access. It’s a platform engineering project in the truest sense. We’re enabling self-service development across the entire GitLab engineering ecosystem to quickly build and deploy services to complement the GitLab product offerings.
We're seeking a Senior Site Reliability Engineer to join our team and help us build, maintain, and optimize our infrastructure. In this role, you'll collaborate with cross-functional teams to ensure our systems are reliable, scalable, and performant.
Key Responsibilities
Design, implement, and maintain infrastructure on both GCP and AWS
Help create and maintain Kubernetes tooling, logging, secrets management and utilities
Build and improve monitoring, alerting, and logging systems
Participate in on-call rotation to address critical issues
Automate manual processes to increase efficiency and reduce errors
Lead incident response, including postmortem analysis
Contribute to capacity planning and cost optimization
Required Qualifications
5+ years of experience in DevOps, SRE, or similar roles
Strong experience with both GCP and AWS cloud platforms
Proficiency with Kubernetes and container orchestration
Solid programming skills in Golang and scripting languages
Experience designing and implementing logging solutions
Demonstrated ability to automate infrastructure operations
Experience with on-call rotations and incident management
Strong troubleshooting and problem-solving skills
Excellent communication skills and ability to work in a team
Comfortable in a fully remote, heavily asynchronous environment across AMER, EMEA, and APAC regions
Preferred Qualifications
Experience with infrastructure as code tools (Terraform, Pulumi)
Knowledge of observability tools (Prometheus, Grafana, etc.)
Secrets management with HashiCorp Vault or OpenBao
Experience with service mesh technologies (Istio, Linkerd)
Background in distributed systems and microservice architectures
Security best practices in cloud-native environments
Experience with GitLab CI/CD pipelines and workflows
Mandatory non-technical skills, experience and characteristics
Willingness and ability to live and promote Gitlab's unique CREDIT Values in one's day to day work and interactions with teammates.
Superior verbal and written communication skills
Cool, collected and composed under pressure
Comfortable and productive working asynchronously across timezones and cultures, at the speed and scale of business.
Enable others to excel
Be a Leader of One
Act Like an Owner with Gitlab's resources.
How GitLab will support you
All remote, asynchronous work environment
Home office support
Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.