Senior Software Engineer - Data Platform
The Data Platform team is looking to hire a Software Engineer who is excited about building Reddit’s next generation Data Warehouse Platform to support a rapidly growing business with an ever expanding number of use cases. A subset of current focuses include:
Managing BigQuery + Airflow infrastructure for the entire company
Designing clear interfaces and self-service tooling to enable both technical and non technical users to interact with the DW platform
Optimizing cost efficiency within the warehouse and building robust solutions for cost attribution
Building opinionated guardrails to drive improvements in data quality, cost efficiency, and data governance
Developing APIs and controllers to maintain the state of our IAM, compute, and storage resources
Building software automation to simplify the ingestion of data into the system and the consumption of downstream data and metadata
If you have a passion for building and maintaining high quality code, want to improve how Reddit makes strategic decisions at the company level, and are excited about applying engineering best practices to one of the most powerful corpus of data in the world, then this is the team for you!
In your day-to-day, you can expect to:
Collaborate effectively with a team of proficient software engineers to develop and maintain the fundamental platform that powers the cutting-edge Reddit's data warehouse infrastructure
Engage in the complete data lifecycle at Reddit, participating in the development process and working with one of the world's most extensive and data-rich datasets.
Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform
Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.
Collaborate and Share on-call responsibilities, including incident management, with the Data Warehouse team
Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation
Who you might be:
4+ years of software engineering experience in a production setting writing clean, maintainable, and well-tested code
Proficient in object-oriented programming languages like Python and Scala, experienced writing code in Go, and having expertise in SQL languages like BigQuery, SparkSQL or Postgres
Demonstrated expertise in designing and implementing large-scale systems, diligently monitoring project progress, and showcasing proactive leadership as a self-starter on diverse projects
Experience working with cloud services, GCP products, terraform, airflow, Kubernetes, CI/CD, and working with modern cloud-based infrastructure
Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context
Benefits:
Comprehensive Healthcare Benefits
401k Matching
Workspace benefits for your home office
Personal & Professional development funds
Family Planning Support
Flexible Vacation (please use them!) & Reddit Global Wellness Days
4+ months paid Parental Leave
Paid Volunteer time off
#LI-remote, #LI-JS5
About the job
Apply for this position
Senior Software Engineer - Data Platform
The Data Platform team is looking to hire a Software Engineer who is excited about building Reddit’s next generation Data Warehouse Platform to support a rapidly growing business with an ever expanding number of use cases. A subset of current focuses include:
Managing BigQuery + Airflow infrastructure for the entire company
Designing clear interfaces and self-service tooling to enable both technical and non technical users to interact with the DW platform
Optimizing cost efficiency within the warehouse and building robust solutions for cost attribution
Building opinionated guardrails to drive improvements in data quality, cost efficiency, and data governance
Developing APIs and controllers to maintain the state of our IAM, compute, and storage resources
Building software automation to simplify the ingestion of data into the system and the consumption of downstream data and metadata
If you have a passion for building and maintaining high quality code, want to improve how Reddit makes strategic decisions at the company level, and are excited about applying engineering best practices to one of the most powerful corpus of data in the world, then this is the team for you!
In your day-to-day, you can expect to:
Collaborate effectively with a team of proficient software engineers to develop and maintain the fundamental platform that powers the cutting-edge Reddit's data warehouse infrastructure
Engage in the complete data lifecycle at Reddit, participating in the development process and working with one of the world's most extensive and data-rich datasets.
Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform
Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.
Collaborate and Share on-call responsibilities, including incident management, with the Data Warehouse team
Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation
Who you might be:
4+ years of software engineering experience in a production setting writing clean, maintainable, and well-tested code
Proficient in object-oriented programming languages like Python and Scala, experienced writing code in Go, and having expertise in SQL languages like BigQuery, SparkSQL or Postgres
Demonstrated expertise in designing and implementing large-scale systems, diligently monitoring project progress, and showcasing proactive leadership as a self-starter on diverse projects
Experience working with cloud services, GCP products, terraform, airflow, Kubernetes, CI/CD, and working with modern cloud-based infrastructure
Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context
Benefits:
Comprehensive Healthcare Benefits
401k Matching
Workspace benefits for your home office
Personal & Professional development funds
Family Planning Support
Flexible Vacation (please use them!) & Reddit Global Wellness Days
4+ months paid Parental Leave
Paid Volunteer time off
#LI-remote, #LI-JS5