Back to positions

Senior Staff Machine Learning Engineer, Data & Eval

Remote role Full-time Open position

Job Description:

  • In this Senior Staff role, you will set technical direction and lead execution for ML evaluation and the end-to-end data flywheel powering CSxAI products (e.g., assistive agents, issue resolution, and tooling).
  • Your work will define how we measure quality, how we turn feedback into learning signals, and how we continuously improve models and products safely and efficiently.
  • You will partner closely with product, engineering, design, operations to build evaluation systems that are trusted, scalable, and actionable - connecting offline metrics to online outcomes.
  • Work with large scale structured and unstructured data; explore, experiment, build and continuously improve Machine Learning models and pipelines for Airbnb product, business and operational use cases.
  • Work collaboratively with cross-functional partners including product managers, operations and data scientists, to identify opportunities for business impact; understand, refine, and prioritize requirements for machine learning, and drive engineering decisions.
  • Hands-on develop, productionize, and operate Machine Learning models and pipelines at scale, including both batch and real-time use cases.
  • Leverage third-party and in-house Machine Learning tools & infrastructure to develop reusable, highly differentiating and high-performing Machine Learning systems, enable fast model development, low-latency serving and ease of model quality upkeep.

Requirements:

  • Educational Background: PhD in Computer Science, Mathematics, Statistics, or related technical field (or equivalent practical experience).
  • Industry Experience: 10+ years building, testing, and shipping ML/AI systems end-to-end; including 2+ years of experience with GenAI/LLM systems in production.
  • Leadership Experience: 5+ years leading large, ambiguous technical initiatives as a senior IC, influencing roadmap and engineering/science direction across teams.
  • Technical Proficiency:
  • Deep expertise in evaluation methodology (offline/online alignment, metric design, human-in-the-loop evaluation, A/B testing, power analysis, regression testing).
  • Hands-on experience with GenAI systems, including orchestration, retrieval, tool calling, memory, etc.
  • Experience building data pipelines and quality systems (labeling workflows, dataset curation, versioning, monitoring, and governance).
  • Solid ML fundamentals and best practices (model selection, training/serving, monitoring, reliability, and model lifecycle management).

Benefits:

  • This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.

Apply tot his job Apply To this Job

Further positions

Staff Machine Learning Engineer - AI Platform

Remote role Full-time

Senior Computer Vision /Machine Learning Engineer II (Indianapolis)

Remote role Full-time

Principal Machine Learning & Data Engineer

Remote role Full-time

Senior Machine Learning Engineer - ML Planner

Remote role Full-time

Principal ML Engineer, Machine Learning Platform and Systems Architecture

Remote role Full-time

Machine Learning Engineer | MLOps & Scalable Systems

Remote role Full-time

Senior Machine Learning Engineer, Perception - Autonomous Driving

Remote role Full-time

Staff / Principal Machine Learning Engineer, Serving - USA

Remote role Full-time

Senior Staff Machine Learning Engineer, GenAI Platform

Remote role Full-time

Senior Machine Learning Engineer, Shopping AI

Remote role Full-time

Video Editor - Gaming Trailers / Campaigns

Remote role Full-time

Client Operations Specialist - Renewals | Remote, USA

Remote role Full-time

Meetings Manager

Remote role Full-time

Manager of Agronomy

Remote role Full-time

Remote Customer Experience Specialist – Work From Home Customer Support Representative at arenaflex

Remote role Full-time

Experienced Customer Service Representative - APPRAISER at arenaflex

Remote role Full-time

Sr. Director, Data & Integrations Engineering

Remote role Full-time

Collaborating Psychiatrist

Remote role Full-time

Data Scientist (AI & Synthetic Intelligence)

Remote role Full-time

Experienced Full Stack Remote Chat Support Specialist – Web & Cloud Application Development

Remote role Full-time