Site Reliability Engineer
To see similar active jobs please follow this link: Remote Development jobs
About the Role:
We are seeking a Site Reliability Engineer to help build a reliable web experience for our users. We believe that moving fast is our competitive advantage, and enables us to better serve our users. We also know that the faster we move, the more likely we are to break things.
You Will:
Actively seek and identify opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.
Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
Implement SRE practices ensuring availability, scalability and observability of production systems with a strong focus on excellent customer experience
Understanding of Infrastructure as Code
Incident management and emergency response, track outages, ensure data integrity and engineer releases to promote rapid deployments
Handle emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed
Implement monitoring, logging, alerting and SLO Reporting
Identify Service Level Indicators (SLIs) that will align the team to meet the availability and performance objectives.
Perform and run blameless RCAs on incidents and outages aggressively looking for answers that will prevent incident reoccurrence.
Demonstrates strong technical skills and expertise in any one of OOO programming languages
Independently handle complex technical tasks in projects.
You Have:
3+ years as a software engineer, shipping production code.
1+ years of experience as a Site Reliability Engineer or Production support Engineer
Bachelor's degree in Computer Science, Engineering, or related field, or relevant years of work experience
Proficiency with RDBMS databases (PostgreSQL, MySQL, SQL Server, etc.)
Proficiency in SQL scripting
Proficiency developing in one or more languages such as Java, Kotlin, Python, and/or others
Proficiency in Git or other VCS
Good debugging and troubleshooting skills
Strong technical competency, with a data-driven analytical approach towards solving complex challenge
Our Benefits (there are more but here are some highlights):
Competitive salary & equity compensation for full-time roles
Unlimited PTO, company holidays, and quarterly mental health days
Comprehensive health benefits including medical, dental & vision, and parental leave
Employee Stock Purchase Program (ESPP)
Employee discounts on hims & hers & Apostrophe online products
401k benefits with employer matching contribution
Offsite team retreats
#LI-Remote
About the job
Site Reliability Engineer
To see similar active jobs please follow this link: Remote Development jobs
About the Role:
We are seeking a Site Reliability Engineer to help build a reliable web experience for our users. We believe that moving fast is our competitive advantage, and enables us to better serve our users. We also know that the faster we move, the more likely we are to break things.
You Will:
Actively seek and identify opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.
Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
Implement SRE practices ensuring availability, scalability and observability of production systems with a strong focus on excellent customer experience
Understanding of Infrastructure as Code
Incident management and emergency response, track outages, ensure data integrity and engineer releases to promote rapid deployments
Handle emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed
Implement monitoring, logging, alerting and SLO Reporting
Identify Service Level Indicators (SLIs) that will align the team to meet the availability and performance objectives.
Perform and run blameless RCAs on incidents and outages aggressively looking for answers that will prevent incident reoccurrence.
Demonstrates strong technical skills and expertise in any one of OOO programming languages
Independently handle complex technical tasks in projects.
You Have:
3+ years as a software engineer, shipping production code.
1+ years of experience as a Site Reliability Engineer or Production support Engineer
Bachelor's degree in Computer Science, Engineering, or related field, or relevant years of work experience
Proficiency with RDBMS databases (PostgreSQL, MySQL, SQL Server, etc.)
Proficiency in SQL scripting
Proficiency developing in one or more languages such as Java, Kotlin, Python, and/or others
Proficiency in Git or other VCS
Good debugging and troubleshooting skills
Strong technical competency, with a data-driven analytical approach towards solving complex challenge
Our Benefits (there are more but here are some highlights):
Competitive salary & equity compensation for full-time roles
Unlimited PTO, company holidays, and quarterly mental health days
Comprehensive health benefits including medical, dental & vision, and parental leave
Employee Stock Purchase Program (ESPP)
Employee discounts on hims & hers & Apostrophe online products
401k benefits with employer matching contribution
Offsite team retreats
#LI-Remote