Senior Staff Operations Engineer
The Community You Will Join:
Airbnb is a company with a mission to create a world where anyone can belong anywhere, achieved through a unified team adhering to core values. The BizTech department plays a crucial role in this mission by providing reliable internal systems, innovative products, and technical support, fostering an empowered and inclusive progress. They also create technical breakthroughs and strategies that redefine the concept of belonging anywhere, delivering value to both the business and its people.
The Global Operations arm of BizTech manages production services in the corporate environment. A Senior Staff Operations Engineer within this team focuses on technology strategy for observability architecture, operational efficacy, and automation. They work closely with fellow Operations team members and BizTech engineering teams to develop solutions, anticipate, and resolve issues. Their role requires experience in Site Reliability Engineering and Observability development.
The Difference You Will Make:
Your scope includes driving projects spanning multiple products and platforms. The work you lead will create customer and community value, balance for present and future, and ensure world class work product.
You’re the owner of the BizTech Observability platform vision, strategy, and roadmap, leveraging your infrastructure and software architecture expertise. You will ensure execution of the strategy is equally important and successful, where you will plan and prioritize effectively, instill team and individual accountability, collaborate and navigate conflict. You will collaborate with various BizTech engineering teams to establish and maintain service level objectives and indicators, contributing to the overall efficiency and security of our services. You will lead the planning of operations architecture for the next 1-3 years, connecting disparate systems in production to improve compatibility and stability.
You're tasked with identifying and rectifying persistent issues reported, through scalable automated solutions, thereby enhancing operational performance and productivity. You're also responsible for spearheading the development and upkeep of testing and monitoring tools, ensuring the continuous operation of all automation platforms.
You're in charge of the quality and reliability of BizTech services, which includes verifying post mortems, conducting root-cause analysis, and implementing corrective actions.
A Typical Day:
Lead technical strategy and discussions, collaborating with Operations peers and cross-functional BizTech teams to build operations solutions for observability and automation.
Work as a team, stay on top of tasks, engagements, and interactions with colleagues. Active participation and collaboration is a recipe for success.
Work in a model of sprints, project tasks that involve coding, testing, designing, documenting, and reviewing operational readiness.
Dedicate a portion of the day to core Operations tasks, which involve addressing requests and issues that our users have identified and reported via tickets. Strive to comprehend the requests at hand, identify patterns, and resolve them with solutions that can make handling these types of problems more efficient.
Being part of an on-call rotation could mean that you are called upon to address and lead resolution of high-severity incidents related to production services, taking on a double role as an incident commander and operations engineer.
Your Expertise:
15+ years combination of IT Operations, Site Reliability, Observability, Infrastructure, Software Development, AIOps, architecture.
Proven experience with Software Development Lifecycles including infrastructure as code, configuration management, distributed version control system, and continuous delivery technology processes
Proficient in complex corporate infrastructure environments, particularly in Cloud technologies like AWS, with a focus on automation and observability. This includes designing logging pipelines, monitoring and alerting frameworks, open telemetry, and tracing tools, as well as CI/CD pipelines
Have good understanding of networking (e.g. Cisco, Palo Alto), systems (e.g. Chef, Terraform, Jenkins, Kubernetes), applications, and SaaS technologies
Proven expertise in leading across various teams and organizations on large-scale, technically complex, and ambiguous projects that proactively address the needs of the Business and ensure successful implementation.
Coding proficiency in Python or Go
Primary developer for API integrations, and event driven architecture (AWS Lambda/SQS architecture).
This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.
Our Commitment To Inclusion & Belonging:
Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply.
We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: reasonableaccommodations@airbnb.com. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process.
We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.
About the job
Apply for this position
Senior Staff Operations Engineer
The Community You Will Join:
Airbnb is a company with a mission to create a world where anyone can belong anywhere, achieved through a unified team adhering to core values. The BizTech department plays a crucial role in this mission by providing reliable internal systems, innovative products, and technical support, fostering an empowered and inclusive progress. They also create technical breakthroughs and strategies that redefine the concept of belonging anywhere, delivering value to both the business and its people.
The Global Operations arm of BizTech manages production services in the corporate environment. A Senior Staff Operations Engineer within this team focuses on technology strategy for observability architecture, operational efficacy, and automation. They work closely with fellow Operations team members and BizTech engineering teams to develop solutions, anticipate, and resolve issues. Their role requires experience in Site Reliability Engineering and Observability development.
The Difference You Will Make:
Your scope includes driving projects spanning multiple products and platforms. The work you lead will create customer and community value, balance for present and future, and ensure world class work product.
You’re the owner of the BizTech Observability platform vision, strategy, and roadmap, leveraging your infrastructure and software architecture expertise. You will ensure execution of the strategy is equally important and successful, where you will plan and prioritize effectively, instill team and individual accountability, collaborate and navigate conflict. You will collaborate with various BizTech engineering teams to establish and maintain service level objectives and indicators, contributing to the overall efficiency and security of our services. You will lead the planning of operations architecture for the next 1-3 years, connecting disparate systems in production to improve compatibility and stability.
You're tasked with identifying and rectifying persistent issues reported, through scalable automated solutions, thereby enhancing operational performance and productivity. You're also responsible for spearheading the development and upkeep of testing and monitoring tools, ensuring the continuous operation of all automation platforms.
You're in charge of the quality and reliability of BizTech services, which includes verifying post mortems, conducting root-cause analysis, and implementing corrective actions.
A Typical Day:
Lead technical strategy and discussions, collaborating with Operations peers and cross-functional BizTech teams to build operations solutions for observability and automation.
Work as a team, stay on top of tasks, engagements, and interactions with colleagues. Active participation and collaboration is a recipe for success.
Work in a model of sprints, project tasks that involve coding, testing, designing, documenting, and reviewing operational readiness.
Dedicate a portion of the day to core Operations tasks, which involve addressing requests and issues that our users have identified and reported via tickets. Strive to comprehend the requests at hand, identify patterns, and resolve them with solutions that can make handling these types of problems more efficient.
Being part of an on-call rotation could mean that you are called upon to address and lead resolution of high-severity incidents related to production services, taking on a double role as an incident commander and operations engineer.
Your Expertise:
15+ years combination of IT Operations, Site Reliability, Observability, Infrastructure, Software Development, AIOps, architecture.
Proven experience with Software Development Lifecycles including infrastructure as code, configuration management, distributed version control system, and continuous delivery technology processes
Proficient in complex corporate infrastructure environments, particularly in Cloud technologies like AWS, with a focus on automation and observability. This includes designing logging pipelines, monitoring and alerting frameworks, open telemetry, and tracing tools, as well as CI/CD pipelines
Have good understanding of networking (e.g. Cisco, Palo Alto), systems (e.g. Chef, Terraform, Jenkins, Kubernetes), applications, and SaaS technologies
Proven expertise in leading across various teams and organizations on large-scale, technically complex, and ambiguous projects that proactively address the needs of the Business and ensure successful implementation.
Coding proficiency in Python or Go
Primary developer for API integrations, and event driven architecture (AWS Lambda/SQS architecture).
This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.
Our Commitment To Inclusion & Belonging:
Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply.
We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: reasonableaccommodations@airbnb.com. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process.
We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.