Senior Software Engineer - Data Products
The Data Snapshot team is looking to hire a Senior Software Engineer who is excited to solve large scale batch and streaming data challenges.
Our community of users generates over 100B analytics events per day, each of which is ingested by the Data Infrastructure team into a data warehouse that sees 55,000+ daily queries. We utilize this data to enable both batch and streaming data usage at the company. The team also owns our Streaming Platform that is build using Kafka
As a senior engineer, you will help develop our Snapshot product used to deliver high quality data to partners. You will partner with teams around Reddit to create and execute a strategy to ensure data quality and consistency at scale.
In your day-to-day, you can expect to:
Refine and maintain our data infrastructure technologies to support batch and real-time processing of hundreds of millions of users.
Own the tools we use to ingest, store and improve data quality.
Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform.
Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.
Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation.
Who you might be:
4+ years of coding experience in a production setting writing clean, maintainable, and well-tested code.
Excellent communication skills to collaborate with stakeholders in engineering, data science, machine learning, and product.
Experience with programming languages such as Scala, Go, Java, or Python with expertise in SQL languages like BigQuery, SparkSQL or Postgres.
Experience working with Terraform, Helm, Kafka, Flink, CDC, Airflow, Prometheus, Docker, Kubernetes, and CI/CD.
Degree in Computer Science or equivalent experience.
Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context
Benefits:
Comprehensive Healthcare Benefits
401k Matching
Workspace benefits for your home office
Personal & Professional development funds
Family Planning Support
Flexible Vacation (please use them!) & Reddit Global Wellness Days
4+ months paid Parental Leave
Paid Volunteer time off
#LI-CK2 #LI-Remote
About the job
Apply for this position
Senior Software Engineer - Data Products
The Data Snapshot team is looking to hire a Senior Software Engineer who is excited to solve large scale batch and streaming data challenges.
Our community of users generates over 100B analytics events per day, each of which is ingested by the Data Infrastructure team into a data warehouse that sees 55,000+ daily queries. We utilize this data to enable both batch and streaming data usage at the company. The team also owns our Streaming Platform that is build using Kafka
As a senior engineer, you will help develop our Snapshot product used to deliver high quality data to partners. You will partner with teams around Reddit to create and execute a strategy to ensure data quality and consistency at scale.
In your day-to-day, you can expect to:
Refine and maintain our data infrastructure technologies to support batch and real-time processing of hundreds of millions of users.
Own the tools we use to ingest, store and improve data quality.
Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform.
Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.
Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation.
Who you might be:
4+ years of coding experience in a production setting writing clean, maintainable, and well-tested code.
Excellent communication skills to collaborate with stakeholders in engineering, data science, machine learning, and product.
Experience with programming languages such as Scala, Go, Java, or Python with expertise in SQL languages like BigQuery, SparkSQL or Postgres.
Experience working with Terraform, Helm, Kafka, Flink, CDC, Airflow, Prometheus, Docker, Kubernetes, and CI/CD.
Degree in Computer Science or equivalent experience.
Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context
Benefits:
Comprehensive Healthcare Benefits
401k Matching
Workspace benefits for your home office
Personal & Professional development funds
Family Planning Support
Flexible Vacation (please use them!) & Reddit Global Wellness Days
4+ months paid Parental Leave
Paid Volunteer time off
#LI-CK2 #LI-Remote