Remote Otter LogoRemoteOtter

Lead Data Engineer - Privacy-Preserving ML Pipelines - Remote

Posted 12 hours ago
Software Development
Full Time
Switzerland

Overview

Decentriq is the rising leader in data-clean-room technology. With Decentriq, advertisers, retailers, and publishers securely collaborate on 1st-party data for optimal audience targeting and campaign measurement. Headquartered in Zürich, Decentriq is trusted by renowned institutions in the DACH market and beyond, such as RTL Ad Alliance, Publicis Media, and PostFinance.

Our analytics & ML pipelines are the heartbeat of this platform. Built in Python (pandas) and Apache Spark, they run either in Databricks workspaces or on our own Spark clusters deployed inside confidential-computing enclaves. We are looking for a Senior Data Engineer (≥ 80 %, start as soon as possible) to take end-to-end ownership of these pipelines, raise their resilience to the next level, and push the boundaries of privacy-preserving machine-learning for AdTech. The role can be fully remote (± 4 h CET) or based in our Zürich/Berlin office.

Would you like to help us make the advertising industry ready for the 1st-party era? Then we’d love to hear from you!

In Short

  • Own, Design & Operate Data Pipelines – Take full responsibility for all pandas- and Spark-based pipelines, from development through production and monitoring.
  • Advance our ML Models – Improve and productionise models for AdTech use-cases such as look-a-like modelling, audience expansion, and campaign measurement.
  • Engineer for the Invisible – Build extra-robust validation at the data source, exhaustive test coverage, and self-healing jobs to guarantee reliability.
  • Collaborate Cross-Functionally – Work closely with data scientists, backend engineers (Rust), and product teams to ship features end-to-end.
  • AI-Powered Productivity – Leverage LLM-based code assistants, design generators, and test-automation tools to move faster and raise the quality bar.
  • Drive Continuous Improvement – Profile, benchmark, and tune Spark workloads, introduce best practices in orchestration & observability, and keep our tech stack future-proof.

Requirements

  • (Must have) Bachelor/Master/PhD in Computer Science, Data Engineering, or a related field and 5+ years of professional experience.
  • (Must have) Expert-level Python plus solid hands-on experience with pandas, PySpark/Scala Spark, and distributed-data processing.
  • (Must have) Proven track record building resilient, production-grade data pipelines with rigorous data-quality and validation checks.
  • (Must have) Working knowledge of ML lifecycle and model serving; familiarity with techniques for audience segmentation or look-a-like modelling is a big plus.
  • (Plus) Experience running workloads in Databricks, Spark on Kubernetes, or other cloud/on-prem big-data platforms.
  • (Plus) Exposure to confidential computing, secure enclaves, homomorphic encryption, or similar privacy-preserving tech.
  • (Plus) Rust proficiency (we use it for backend services and compute-heavy client-side modules).
  • (Plus) Data-platform skills: operating Spark clusters, job schedulers, or orchestration frameworks (Airflow, Dagster, custom schedulers).

Benefits

  • Join Decentriq's Engineering team as an individual contributor and earn growing responsibilities.
  • Being able to create, shape, and benefit from a young company.
  • An amazing and fun team that is distributed all over Europe.
  • Competitive salary.
  • A lot of opportunities for self-development.

No need for a formal motivational letter. Just send your CV along with a few bullet points about why you’re excited to work with us. We look forward to your application!

Decentriq logo

Decentriq

Decentriq is a pioneering company that specializes in secure data collaboration through its innovative platform based on Confidential Computing technology. This platform enables organizations to share sensitive data without exposing the raw data, making it ideal for privacy-conscious industries such as media, healthcare, banking, and the public sector. With headquarters in Zurich and a presence in cities like Berlin, Barcelona, and Budapest, Decentriq is committed to redefining data collaboration on a global scale. The company fosters a positive and enthusiastic work environment, encouraging team members to engage in meaningful projects while benefiting from mentorship and professional development opportunities.

Share This Job!

Save This Job!

Similar Jobs:

Airbnb logo

Privacy Engineering Lead - Remote

Airbnb

9 weeks ago

Join Airbnb's Privacy Engineering team to lead initiatives that enhance data privacy and compliance in AI/ML systems.

USA
Full-time
Software Development
$204,000 - $259,000 USD
Superside logo

Lead Data/ML Engineer - Remote

Superside

16 weeks ago

SuperAds is looking for a Lead Data/ML Engineer to design scalable data pipelines and integrate machine learning models.

Worldwide
Full-time
Data Analysis
ImagineX Consulting logo

Data Engineering & AI/ML Lead - Remote

ImagineX Consulting

5 weeks ago

Lead the Data Engineering & AI/ML practice at ImagineX, driving innovation and collaboration while mentoring a dynamic team.

GA, USA
Full-time
Software Development

Y.A

Data Pipeline Engineer - Remote

YipitData (Alternative)

5 weeks ago

Join YipitData as a Data Pipeline Engineer to build and maintain data pipelines in a dynamic environment.

India
Full-time
Data Analysis
Yipitdatajobs logo

Data Pipeline Engineer - Remote

Yipitdatajobs

6 weeks ago

YipitData is looking for a Data Pipeline Engineer to build and maintain data pipelines as part of a new team in India.

India
Full-time
Data Analysis