Senior Data Engineer (MLOps)
Who We Are
Massive Rocket is a high-growth Braze & Snowflake agency that has made significant strides in connecting digital marketing teams with product and engineering units. Founded just 5 years ago, we have experienced swift growth and are now at a crucial juncture, aspiring to reach $100M in revenue. Our focus is on delivering human experiences at scale, leveraging the latest in web, mobile, cloud, data, and AI technologies. We pride ourselves on innovation and the delivery of cutting-edge digital solutions.
Every role at Massive Rocket is Entrepreneurial - Successful people at Massive Rocket will not only think about their role but understand the roles around them, their goals and contribute to the success and growth of their team, customers and partners.
What We Offer
π Fast-moving environment β you will never stop learning and growing
β€οΈ Supportive and positive work culture with an emphasis on our values
π International presence β work with team members in Europe, the US, and around the globe
πͺ 100% remote forever
π΄ Flexible Vacation Policy
π§πΌββοΈ Career progression paths and opportunities for promotion/advancement
π Organised team events and outings
What weβre looking for
Massive Rocket, a global Martech agency specializing in Braze and Snowflake, is looking for a talented Data Engineer to join our growing team. We work with clients across the U.S., U.K., and European Union, delivering cutting-edge marketing technology solutions.
We are seeking a highly skilled and motivated Data Engineer to join our growing team. As a key member of our engineering organization, you will be resigning and implementing robust, scalable, and efficient data systems that power analytics, machine learning models, and business insights.. You will work closely with our engineering and product teams to deliver cutting edge AI and data solutions.
Responsibilities
i) Data Architecture & Development:
- Design and implement scalable, secure, and high-performance data lake and data warehouse solutions.
- Leverage best practices in schema design, partitioning, and optimisation for efficient storage and retrieval.
- Build and maintain data models to support analytics and machine learning workflows.
ii) Pipeline Orchestration:
- Develop, monitor, and optimize ETL/ELT workflows using Apache Airflow.
- Ensure data pipelines are robust, error-tolerant, and scalable for real-time and batch processing.
iii) Data Scraping & Unstructured Data Processing:
- Develop and maintain scalable web scraping solutions to collect data from diverse sources, including APIs, websites, and other unstructured data sources.
- Extract, clean, and transform unstructured data such as text, images, and log files into structured formats suitable for analysis.
- Use tools and frameworks like BeautifulSoup, Scrapy, or Selenium for web scraping, and natural language processing (NLP) techniques for text processing.
iv) Cloud Integration:
- Design and implement cloud-native data solutions with Microsoft Azure.
- Optimize costs and performance of cloud-based data solutions.
v) Infrastructure as Code (IaC):
- Use Terraform to automate the provisioning and management of cloud infrastructure.
- Define reusable and modular Terraform configurations to support scalable deployment of resources.
vi) MLOps:
- Collaborate with data scientists and machine learning engineers to operationalise machine learning models.
- Implement CI/CD pipelines for machine learning workflows, ensuring efficient model deployment and monitoring.
vii) Containerisation and Orchestration:
- Utilize Kubernetes and containerisation technologies (e.g., Docker) to deploy scalable, fault-tolerant data processing systems.
- Manage infrastructure and resource allocation for containerised data applications.
viii) Collaboration: Collaborate effectively with developers and other stakeholders to understand their needs and provide appropriate platform solutions.
ix) Documentation: Maintain comprehensive documentation of platform architecture, processes, and procedures.
Required Skills and Qualifications:
- 5+ years of experience in data engineering or a related field.
- Strong expertise in data pipeline orchestration tools such as Apache Airflow.
- Proven track record of designing and implementing data lakes and warehouses (experience with Azure is a plus).
- Solid understanding of MLOps practices, including model training, deployment, and monitoring.
- Proficiency in programming languages such as Python & SQL.
- Experience with distributed computing frameworks such as Spark.
- Familiarity with version control systems (e.g., Git) and CI/CD pipelines.
- Collaboration and Communication: Effective communicator and team player, comfortable working with cross-functional teams to deliver high-quality solutions.
- Agency experience: Experience working in an agency setting with clients
- English C1 Level: strong communication skills with professional level of proficiency in english
Bonus Skills and Experiences:
- Demonstrated experience with Terraform for infrastructure provisioning and management.
- Hands-on experience with Kubernetes and containerised environments.
- Experience in the healthcare or medical industry.
- Familiarity with compliance standards like HIPAA.
Desired Qualities:
- Innovative Problem-Solver: A creative thinker who can efficiently solve complex problems and adapt to new technologies and changing product requirements.
- Quality Advocate: Passion for quality and a dedication to understanding the userβs perspective and how it impacts the product's overall experience.
- Effective Communicator: Strong interpersonal and communication skills, with the ability to articulate issues, solutions, and concepts to technical and non-technical stakeholders alike.
- Leadership Potential: While direct leadership experience is not mandatory, the aptitude to mentor others and lead by example in software engineering practices is highly valued.
During the process, please be ready to provide:
β’ Valid work visa - Massive Rocket does not provide sponsorship at the moment.
β’ Proof of identification: ID card, passport, Utility bill (Gas, Water, Electricity)
β’ 2 references - Name, Relationship, Contact details (Email, Mobile)
β’ Contractors Only: proof of incorporation and insurance
Note: Please ensure that your qualifications closely match the criteria outlined in the job description. Applications not meeting the specified criteria may not be processed or considered for this position.
Senior Data Engineer (MLOps)
Who We Are
Massive Rocket is a high-growth Braze & Snowflake agency that has made significant strides in connecting digital marketing teams with product and engineering units. Founded just 5 years ago, we have experienced swift growth and are now at a crucial juncture, aspiring to reach $100M in revenue. Our focus is on delivering human experiences at scale, leveraging the latest in web, mobile, cloud, data, and AI technologies. We pride ourselves on innovation and the delivery of cutting-edge digital solutions.
Every role at Massive Rocket is Entrepreneurial - Successful people at Massive Rocket will not only think about their role but understand the roles around them, their goals and contribute to the success and growth of their team, customers and partners.
What We Offer
π Fast-moving environment β you will never stop learning and growing
β€οΈ Supportive and positive work culture with an emphasis on our values
π International presence β work with team members in Europe, the US, and around the globe
πͺ 100% remote forever
π΄ Flexible Vacation Policy
π§πΌββοΈ Career progression paths and opportunities for promotion/advancement
π Organised team events and outings
What weβre looking for
Massive Rocket, a global Martech agency specializing in Braze and Snowflake, is looking for a talented Data Engineer to join our growing team. We work with clients across the U.S., U.K., and European Union, delivering cutting-edge marketing technology solutions.
We are seeking a highly skilled and motivated Data Engineer to join our growing team. As a key member of our engineering organization, you will be resigning and implementing robust, scalable, and efficient data systems that power analytics, machine learning models, and business insights.. You will work closely with our engineering and product teams to deliver cutting edge AI and data solutions.
Responsibilities
i) Data Architecture & Development:
- Design and implement scalable, secure, and high-performance data lake and data warehouse solutions.
- Leverage best practices in schema design, partitioning, and optimisation for efficient storage and retrieval.
- Build and maintain data models to support analytics and machine learning workflows.
ii) Pipeline Orchestration:
- Develop, monitor, and optimize ETL/ELT workflows using Apache Airflow.
- Ensure data pipelines are robust, error-tolerant, and scalable for real-time and batch processing.
iii) Data Scraping & Unstructured Data Processing:
- Develop and maintain scalable web scraping solutions to collect data from diverse sources, including APIs, websites, and other unstructured data sources.
- Extract, clean, and transform unstructured data such as text, images, and log files into structured formats suitable for analysis.
- Use tools and frameworks like BeautifulSoup, Scrapy, or Selenium for web scraping, and natural language processing (NLP) techniques for text processing.
iv) Cloud Integration:
- Design and implement cloud-native data solutions with Microsoft Azure.
- Optimize costs and performance of cloud-based data solutions.
v) Infrastructure as Code (IaC):
- Use Terraform to automate the provisioning and management of cloud infrastructure.
- Define reusable and modular Terraform configurations to support scalable deployment of resources.
vi) MLOps:
- Collaborate with data scientists and machine learning engineers to operationalise machine learning models.
- Implement CI/CD pipelines for machine learning workflows, ensuring efficient model deployment and monitoring.
vii) Containerisation and Orchestration:
- Utilize Kubernetes and containerisation technologies (e.g., Docker) to deploy scalable, fault-tolerant data processing systems.
- Manage infrastructure and resource allocation for containerised data applications.
viii) Collaboration: Collaborate effectively with developers and other stakeholders to understand their needs and provide appropriate platform solutions.
ix) Documentation: Maintain comprehensive documentation of platform architecture, processes, and procedures.
Required Skills and Qualifications:
- 5+ years of experience in data engineering or a related field.
- Strong expertise in data pipeline orchestration tools such as Apache Airflow.
- Proven track record of designing and implementing data lakes and warehouses (experience with Azure is a plus).
- Solid understanding of MLOps practices, including model training, deployment, and monitoring.
- Proficiency in programming languages such as Python & SQL.
- Experience with distributed computing frameworks such as Spark.
- Familiarity with version control systems (e.g., Git) and CI/CD pipelines.
- Collaboration and Communication: Effective communicator and team player, comfortable working with cross-functional teams to deliver high-quality solutions.
- Agency experience: Experience working in an agency setting with clients
- English C1 Level: strong communication skills with professional level of proficiency in english
Bonus Skills and Experiences:
- Demonstrated experience with Terraform for infrastructure provisioning and management.
- Hands-on experience with Kubernetes and containerised environments.
- Experience in the healthcare or medical industry.
- Familiarity with compliance standards like HIPAA.
Desired Qualities:
- Innovative Problem-Solver: A creative thinker who can efficiently solve complex problems and adapt to new technologies and changing product requirements.
- Quality Advocate: Passion for quality and a dedication to understanding the userβs perspective and how it impacts the product's overall experience.
- Effective Communicator: Strong interpersonal and communication skills, with the ability to articulate issues, solutions, and concepts to technical and non-technical stakeholders alike.
- Leadership Potential: While direct leadership experience is not mandatory, the aptitude to mentor others and lead by example in software engineering practices is highly valued.
During the process, please be ready to provide:
β’ Valid work visa - Massive Rocket does not provide sponsorship at the moment.
β’ Proof of identification: ID card, passport, Utility bill (Gas, Water, Electricity)
β’ 2 references - Name, Relationship, Contact details (Email, Mobile)
β’ Contractors Only: proof of incorporation and insurance
Note: Please ensure that your qualifications closely match the criteria outlined in the job description. Applications not meeting the specified criteria may not be processed or considered for this position.