Site Reliability Engineer
To see similar active jobs please follow this link: Remote System Administration jobs
We consistently top the charts as one of if not the most used Sports Betting website in the countries we operate in.
With millions of weekly active users, we strive to be the best in industry for our users.
In addition to our DevOps Team we are building a Site Reliability Team whose purpose is to focus on site reliability and security. It will also involved deployment, configuration, and monitoring, as well as the availability, latency, change management, emergency response, and capacity management of services in production.
Responsibilities
Work with a team of DevOps/SRE and DBA professionals
Improve existing infrastructure and processes currently deployed in as well as streamlining processes deploy to new countries in the future
Holistically improve all aspects of our current infrastructure including: reducing costs; streamlining environment provisioning; lowering response times and incorporating the latest techniques and technologies
Monitor and maintain the existing cloud infrastructure via autoscaling, automated alerts, andOpsWork and Grafana dashboards
Take ownership and responsibility for our cloud operation activities
Liaise with external security agencies for annual audits as well as perform our own internal security sweeps
Aid in reconfiguring existing architecture to allow for rapid deployments to new countries
Mentoring less experienced team members
Requirements
4+ years SRE/DevOps experience
Be based in Latin America
Experience independently leading the planning and deployment of a project
Experienced with cloud platforms, especially AWS, including solid knowledge of how to utilize cloud resources to fulfill the demand from other teams and production
Familiar with one program language or script language (Python, Java....)
Experience managing multiple kubernetes clusters in production (virtualization, orchestration, scalability, security, and high availability), skillset such as Helm, Rancher, ArgoCD
Solid networking protocol and cyber security knowledge, especially the TCP / IP stack and HTTP protocol
A strong understanding of cache, including CDN, HTTP cache (CloudFlare, AWS CloudFront)
Experienced with CloudNative Monitoring solution in Large distributed system using observation model(Trace, Metric, Logging), skillset such as Prometheus, Jaeger, Loki, ELK, Grafana
Excellent troubleshooting skills, including Linux OS issue diagnosis and OS parameter optimization
Beneficial
Experience working with other cloud platform is a plus. (GCP, Azure, AliCloud)
Familiar with at least one of infrastructure as Code (Terraform, Cloudformation)
Design and implement CI/CD workflow is a plus (Jenkins, Github Action)
Experience with system automation tools (Ansible, Salt, Chef)
Understanding of modern Micro Services and Service Mesh concepts is a plus(Containers, Istio)
Benefits
Quarterly and flash bonuses
We have core hours of 10am-3pm in a local timezone, but flexible hours outside of this
Top-of-the-line equipment
Referral bonuses
28 days paid annual leave
Annual company retreat
Highly talented, dependable co-workers in a global, multicultural organisation
Payment via DEEL, a world class online wallet system
Our teams are small enough for you to be impactful
Our business is globally established and successful, offering stability and security to our Team Members
Our Mission
Our mission is to be an everyday entertainment platform for everyone
Our Operating Principles
1. Create Value for Users
2. Act in the Long-Term Interests of Sporty
3. Focus on Product Improvements & Innovation
4. Be Responsible
5. Preserve Integrity & Honesty
6. Respect Confidentiality & Privacy
7. Ensure Stability, Security & Scalability
8. Work Hard with Passion & Pride
Interview Process
Online HackerRank Test (Max time of 90 Minutes)
Remote video screening with our Talent Acquisition Team
Remote video interview with 3 x Team Members (45 mins each, not separate days)
24-72 hour feedback loops throughout process
Post Interview Process
Feedback call on successful interview
Offer released followed by contract
ID Check Via Zinc & 2 references from previous employers
Working at Sporty
The top-down mentality at Sporty is high performance based, meaning we trust you to do your job with an emphasis on support to help you achieve, grow and de-block any issues when they're in your way.
Generally employees can choose their own hours, as long as they are collaborating and doing stand-ups etc. The emphasis is really on results.
As we are a highly structured and established company we are able to offer the security and support of a global business with the allure of a startup environment. Sporty is independently managed and financed, meaning we don’t have arbitrary shareholder or VC targets to cater to.
We literally build, spend and make decisions based on the ethos of building THE best platform of its kind. We are truly a tech company to the core and take excellent care of our Team Members.
Site Reliability Engineer
To see similar active jobs please follow this link: Remote System Administration jobs
We consistently top the charts as one of if not the most used Sports Betting website in the countries we operate in.
With millions of weekly active users, we strive to be the best in industry for our users.
In addition to our DevOps Team we are building a Site Reliability Team whose purpose is to focus on site reliability and security. It will also involved deployment, configuration, and monitoring, as well as the availability, latency, change management, emergency response, and capacity management of services in production.
Responsibilities
Work with a team of DevOps/SRE and DBA professionals
Improve existing infrastructure and processes currently deployed in as well as streamlining processes deploy to new countries in the future
Holistically improve all aspects of our current infrastructure including: reducing costs; streamlining environment provisioning; lowering response times and incorporating the latest techniques and technologies
Monitor and maintain the existing cloud infrastructure via autoscaling, automated alerts, andOpsWork and Grafana dashboards
Take ownership and responsibility for our cloud operation activities
Liaise with external security agencies for annual audits as well as perform our own internal security sweeps
Aid in reconfiguring existing architecture to allow for rapid deployments to new countries
Mentoring less experienced team members
Requirements
4+ years SRE/DevOps experience
Be based in Latin America
Experience independently leading the planning and deployment of a project
Experienced with cloud platforms, especially AWS, including solid knowledge of how to utilize cloud resources to fulfill the demand from other teams and production
Familiar with one program language or script language (Python, Java....)
Experience managing multiple kubernetes clusters in production (virtualization, orchestration, scalability, security, and high availability), skillset such as Helm, Rancher, ArgoCD
Solid networking protocol and cyber security knowledge, especially the TCP / IP stack and HTTP protocol
A strong understanding of cache, including CDN, HTTP cache (CloudFlare, AWS CloudFront)
Experienced with CloudNative Monitoring solution in Large distributed system using observation model(Trace, Metric, Logging), skillset such as Prometheus, Jaeger, Loki, ELK, Grafana
Excellent troubleshooting skills, including Linux OS issue diagnosis and OS parameter optimization
Beneficial
Experience working with other cloud platform is a plus. (GCP, Azure, AliCloud)
Familiar with at least one of infrastructure as Code (Terraform, Cloudformation)
Design and implement CI/CD workflow is a plus (Jenkins, Github Action)
Experience with system automation tools (Ansible, Salt, Chef)
Understanding of modern Micro Services and Service Mesh concepts is a plus(Containers, Istio)
Benefits
Quarterly and flash bonuses
We have core hours of 10am-3pm in a local timezone, but flexible hours outside of this
Top-of-the-line equipment
Referral bonuses
28 days paid annual leave
Annual company retreat
Highly talented, dependable co-workers in a global, multicultural organisation
Payment via DEEL, a world class online wallet system
Our teams are small enough for you to be impactful
Our business is globally established and successful, offering stability and security to our Team Members
Our Mission
Our mission is to be an everyday entertainment platform for everyone
Our Operating Principles
1. Create Value for Users
2. Act in the Long-Term Interests of Sporty
3. Focus on Product Improvements & Innovation
4. Be Responsible
5. Preserve Integrity & Honesty
6. Respect Confidentiality & Privacy
7. Ensure Stability, Security & Scalability
8. Work Hard with Passion & Pride
Interview Process
Online HackerRank Test (Max time of 90 Minutes)
Remote video screening with our Talent Acquisition Team
Remote video interview with 3 x Team Members (45 mins each, not separate days)
24-72 hour feedback loops throughout process
Post Interview Process
Feedback call on successful interview
Offer released followed by contract
ID Check Via Zinc & 2 references from previous employers
Working at Sporty
The top-down mentality at Sporty is high performance based, meaning we trust you to do your job with an emphasis on support to help you achieve, grow and de-block any issues when they're in your way.
Generally employees can choose their own hours, as long as they are collaborating and doing stand-ups etc. The emphasis is really on results.
As we are a highly structured and established company we are able to offer the security and support of a global business with the allure of a startup environment. Sporty is independently managed and financed, meaning we don’t have arbitrary shareholder or VC targets to cater to.
We literally build, spend and make decisions based on the ethos of building THE best platform of its kind. We are truly a tech company to the core and take excellent care of our Team Members.