Why Harvey
At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 700+ customers in 58+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.
Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes. We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.
At Harvey, the future of professional services is being written today — and we’re just getting started.
Role Overview
At Harvey, we’re building the AI platform for the world’s top legal and professional services teams. As we scale, our data team sits at the heart of this mission—turning raw data and research into robust, intelligent systems that power reasoning at scale. Our Data Team powers Harvey’s ability to understand and leverage both public and private data at scale — building the infrastructure that ingests, transforms, and retrieves millions of documents to make our AI systems smarter every day.
We’re looking for a Director of Engineering, Data to lead this function into its next chapter. You’ll shape the strategy, architecture, and team behind the systems that make advanced reasoning possible. The Data team owns end-to-end retrieval-augmented generation (RAG) stacks across complex domains — including Case Laws, Legislation, and Tax codes across 50+ international jurisdictions. As generation and reasoning improve, retrieval quality has become the new frontier. Solving it at scale is one of Harvey’s top priorities.
If you’re excited by large-scale data engineering, complex information retrieval, and building the backbone of cutting-edge AI systems, we’d love to talk.
What You'll Do
Lead and scale the Data organization from a single high-performing team into multiple specialized teams.
Partner closely with leadership to define the strategic roadmap for Harvey’s data ecosystem and ensure it scales with our global growth
Own and evolve Harvey’s end-to-end data architecture — from ingestion to transformation, storage, retrieval, and delivery — ensuring performance, reliability, and scalability to power LLMs and downstream applications.
Design and oversee large-scale data ingestion pipelines that aggregate, normalize, and maintain data from thousands of heterogeneous, publicly available legal and regulatory sources across global jurisdictions.
Integrate private and partner data sources, ensuring robust access controls, lineage tracking, and compliance with security and privacy requirements.
Evaluate and implement data infrastructure technologies to support large-scale document processing, embedding generation, vector storage, and retrieval optimization.
Collaborate closely with the Applied AI team to drive experimentation and model improvements that directly enhance AI quality and differentiation across Harvey’s products.
Drive the development of end-to-end research experiences that weave together our retrieval, reasoning, and UX layers — transforming AI insights into intuitive, lawyer-friendly workflows that redefine how professionals engage with complex information.
Partner cross-functionally with Product Engineering, Applied AI, Research, and Platform teams to deliver high-quality, production-ready systems.
What You Have
You have 10+ years of experience in data engineering, data architecture, or platform engineering, with 5+ years of leading high-performance teams.
You’ve led data or ML infrastructure teams through scale — from startup to multi-team org.
You have a proven track record of building and scaling distributed data systems handling large, complex, and heterogeneous datasets.
You bring depth in backend, data infrastructure, or information retrieval, with a strong appreciation for applied AI.
You value clarity, craftsmanship, and high trust as the foundations of great engineering.
Compensation Range
$320,000 - $360,000 USD
Please find our CA applicant privacy notice here.
#LI-KV1
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai