Free cookie consent management tool by TermsFeed SWE, Research Data Infrastructure | People in AI
Image
img
img

SWE, Research Data Infrastructure

  • Permanent
  • $500,000 - $800,000
  • New York, United States, New York City
Image

Senior Software Engineer, Research Data Infrastructure

New York City, 4 days per week onsite

Compensation: Not disclosed

AI-driven institutional investment platform

We are working with a highly regarded institutional investment firm building a specialist AI research and engineering group focused on applying advanced machine learning to live investment decision-making.

This is not a typical data engineering role and it is not an internal analytics function. The team builds AI systems that operate in production, support real investment workflows, and are held accountable to measurable outcomes. You would join a small, high-caliber group of research scientists, machine learning engineers, platform engineers, and investment specialists working at the intersection of AI, markets, distributed systems, and large-scale experimentation.

The role is best suited to a strong software engineer who has built production-grade distributed systems, data platforms, or ML infrastructure, and wants to work directly with research teams on hard, ambiguous problems.

What you’ll work on

You will build the core research data infrastructure that powers AI models, experimentation systems, and production investment workflows.

This includes designing and scaling systems for high-throughput data movement, processing, storage, orchestration, observability, and reproducibility. You will partner closely with research scientists to turn early-stage research needs into reliable infrastructure, while also modernizing existing systems and building new platforms where the current tooling does not yet exist.

The work will span pipelines, orchestration layers, monitoring, introspection, data quality checks, developer tooling, researcher tooling, and infrastructure that helps research move from idea to production more quickly and safely.

What we’re looking for

We are looking for someone with strong software engineering fundamentals and real production ownership.

You should have experience building distributed systems, data platforms, orchestration frameworks, or infrastructure that supports machine learning, scientific research, or complex experimentation. Strong Python and systems design ability are important, as is evidence that you have owned architecture, scalability, reliability, observability, and operational maturity.

This is a hands-on role. You should be comfortable designing the system, writing the code, debugging production issues, and improving the platform over time. You should also be comfortable working with research scientists where the requirements may be ambiguous and the best solution needs to be shaped collaboratively.

Tech stack and environment

The environment includes technologies such as Python, Spark, Kafka, Flink, Ray, Airflow, Databricks, Snowflake, Kubernetes, and similar distributed data systems.

You do not need to have used every tool listed, but you should have strong depth across distributed systems, high-throughput data infrastructure, workflow orchestration, batch or streaming systems, monitoring, schema design, data quality, and failure recovery.

The team also uses AI-native development workflows, coding agents, and automated engineering systems as part of how software is built.

Why this role stands out

This is a rare opportunity to work on infrastructure that sits directly behind advanced AI research and live investment systems. The team is applying modern machine learning and reasoning models to financial markets, with a strong focus on causality, diagnostics, reliability, and explainability.

The culture is fast-moving, direct, and highly collaborative. Engineers and scientists work closely together, with real ownership from paper to production. This will suit someone who wants to be close to the research, close to the systems, and close to the business impact.

Location

This is an onsite role based in New York City, with an expectation of being in the office four days per week.

People In AI partners with leading AI, machine learning, and data-driven companies to help candidates understand the real scope, team context, and technical expectations behind each opportunity. We aim to make the process clear, direct, and useful, so you can make an informed decision about whether the role is genuinely aligned with your goals.

Share job:
Decor
Image

Upload resume

Boost your career with expert recruitment solutions!

Your resume will be confidentially submitted to our team, who will be in touch if we have a match for your job search

Upload resume
Image
A woman sitting outdoors working on a laptop.