Staff Data Pipeline Engineer
at Mozilla Corporation
Team:
Core Product-Pocket
Locations:
Remote US, Remote Canada, Remote Toronto/Vancouver Area, Remote San Francisco Bay Area

The company

Pocket empowers people to discover, organize, consume, and share content that matters to them. Our apps and platform are essential ways that tens of millions of people discover and consume content on the web. Pocket is the Web, curated: for you and by you.

The opportunity

For content recommendations, everything starts with data. Pocket’s Data Products team builds systems that combine machine learning with editorial expertise to surface high-quality content from across the internet. Ensuring data privacy when collecting, distributing, validating, and securing data at scale is no small task and every engineer on our team plays a vital role in shaping each user’s experience.

We are looking for a Lead Data Pipeline Engineer to own the design and development of data pipeline applications for complex, extensible, and highly scalable cloud-based data platforms. Are you passionate about building intuitive data models? Do you excel at taking vague requirements and crystallizing them into scalable data solutions? We invite you to apply!

People who excel on our team thrive in a small, dynamic environments. We cover many areas including machine learning, product engineering, machine learning operations, and data modeling, among others.

Who you are

  • Enjoy working on small, dynamic teams.
  • Understand Data Lifecycle and concepts such as lineage, governance, privacy, retention, anonymity, etc.
  • Conceptually familiar with AWS cloud resources (S3, EC2, RDS etc).
  • A trusted authority in distributed data processing patterns.
  • Highly proficient in at least one of Java, Python or Scala.
  • Comfortable with complex SQL
  • Experience designing, building, and maintaining data lakes.

What you'll do

  • Build and maintain data pipeline applications
  • Design, create and maintain the data platform data model at the conceptual, logical, and physical levels.
  • Establish data security, quality, load, transport and performance models.
  • Research, design, document and modify data pipeline software specifications throughout the production life cycle.
  • Develop and maintain stakeholder documentation and operations procedures, programs, security, etc. and assist in eliminating
  • redundancy and automating manual processes.
  • Assist in developing standards and criteria for the successful implementation of new systems.
  • Perform code reviews and mentor other engineers.

Bonus experience

  • Cloud warehouses: Snowflake, BigQuery, Redshift
  • Feature stores: Sagemaker, Databricks, Vertex
  • Orchestrators: Airflow, Prefect
  • Compute frameworks: AWS Glue, Spark, Hadoop, Athena
  • Streaming data: Kinesis, Kafka
  • Data modeling: DBT

Commitment to diversity, equity, inclusion, and belonging

Mozilla understands that valuing diverse creative practices and forms of knowledge are crucial to and enrich the company’s core mission. We encourage applications from everyone, including members of all equity-seeking communities, such as (but certainly not limited to) women, racialized and Indigenous persons, persons with disabilities, persons of all sexual orientations, gender identities and expressions.

We will ensure that qualified individuals with disabilities are provided reasonable accommodations to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment, as appropriate. Please contact us at hiringaccommodation@mozilla.com to request accommodation.

We are an equal opportunity employer. We do not discriminate on the basis of race (including hairstyle and texture), religion (including religious grooming and dress practices), gender, gender identity, gender expression, color, national origin, pregnancy, ancestry, domestic partner status, disability, sexual orientation, age, genetic predisposition, medical condition, marital status, citizenship status, military or veteran status, or any other basis covered by applicable laws. Mozilla will not tolerate discrimination or harassment based on any of these characteristics or any other unlawful behavior, conduct, or purpose.

Group: C

#LI-REMOTE

Why Mozilla?

At Mozilla, we’re serving humanity—by maintaining a safe, open internet—while also helping the individual humans employed here to reach their personal and professional goals. With a relatively small team serving hundreds of millions of people, a culture of exploration, and a commitment to mentorship, opportunities abound to learn and grow at Mozilla.


Our values drive our actions

  • Purpose is built into our work, with our mission driving every decision
  • We challenge assumptions, the status quo, ourselves, and each other
  • We are transparent: in our code, our business partnerships, and our everyday interactions
  • We seek out people from diverse backgrounds and with perspectives different from our own
  • We pair purpose with performance and put people ahead of profit

Our impact is global

  • 700+ paid staff from over 30 countries
  • Thousands of volunteer contributors across six continents
  • 9 global offices: Mountain View, San Francisco, Portland, Vancouver, Toronto, Paris, London, Berlin, and Beijing
  • Hundreds of home offices globally

Our benefits are world-class

  • Flexible work environment (nearly half of Mozillians work remotely)
  • Industry-leading paid parental leave (up to 26 weeks of fully paid leave for childbearing parents and up to 12 weeks for non-childbearing parents)
  • Reimbursement for professional development (up to $3,000/year)
  • A work setup including the latest hardware and software of your choice