Free cookie consent management tool by TermsFeed Lead Data Engineer | People in AI
Image
img
img

Lead Data Engineer

  • Permanent
  • 200,000 - 300,000
  • Remote
Image

Lead Data Engineer, Behavioral AI Infrastructure
Compensation:
$250,000 + equity
Location: US Remote or Hybrid (East Coast preferred)

Join a pioneering machine learning company that transforms messy, hard-to-use behavioral data into powerful, privacy-safe insights for some of the world’s most data-driven enterprises.

This stealth-scaleup is reshaping how third-party data fuels predictive models—by converting fragmented, high-liability consumer data into compact, ML-ready features without exposing any raw information. With over $100M in non-dilutive funding and strategic partnerships in place, they're approaching unicorn status and scaling fast.

The Opportunity
This is a high-impact leadership role for a systems-minded data engineer who wants to shape the technical backbone of privacy-first behavioral AI. You'll architect and evolve critical data workflows—balancing scale, performance, cost, and reliability—while enabling ML teams to move fast with confidence.

What You’ll Do

  • Own end-to-end data infrastructure powering behavioral ML systems
  • Design and optimize pipelines for batch and streaming ingestion, transformation, and feature generation
  • Define schemas, data contracts, lineage, and orchestration best practices
  • Collaborate across engineering and ML teams to build scalable abstractions
  • Lead by example in coding, architecture, and reliability mindset
  • Balance pragmatic delivery with long-term platform quality

What You’ll Bring

  • Proven experience building robust, modular data workflows at scale
  • Strong command of architectural tradeoffs: batch vs stream, push vs pull, eventual consistency, etc.
  • Ability to influence engineering standards and data conventions in evolving environments
  • Deep expertise in tools like Delta Lake, Spark (Scala and/or Python), and Airflow
  • Pragmatic mindset—knows when to polish and when to ship
  • Clear systems thinking and comfort navigating ambiguity

Tech Stack

  • Delta Lake
  • Apache Spark (Scala & Python)
  • Airflow
  • [Add relevant infrastructure/tooling here if available]

Why Join?

  • Work with massive, real-world behavioral datasets
  • Help redefine how machine learning leverages third-party data—ethically and effectively
  • Collaborate with top-tier talent in data engineering, AI research, and product
  • Be part of a mission-driven team using data for positive, privacy-respecting impact
  • Shape foundational infrastructure as the company scales 10x

About People In AI
We’re a specialized recruitment partner helping exceptional technical talent connect with ambitious AI-driven companies. Our focus is on meaningful work, ethical innovation, and long-term fit.

Share job:
Decor
Image

Upload resume

Boost your career with expert recruitment solutions!

Your resume will be confidentially submitted to our team, who will be in touch if we have a match for your job search

Upload resume
Image
jobs