About the Team
OpenAI is building the infrastructure foundation for the next generation of AI. The Data Center Engineering team defines the strategy, reference architectures, technical requirements, and delivery standards for the large-scale data centers that support OpenAI research, products, and infrastructure partners.
As a Data Center Infrastructure Mechanical Engineer, you will help design, validate, and scale the cooling and mechanical systems that make high-density AI compute possible. You will work across thermal architecture, equipment development, manufacturing validation, construction, commissioning, deployment, and operations, partnering with research, hardware engineering, data center engineering, supply chain, EHS, operations, and external delivery partners.
About The Role
We are seeking a senior mechanical infrastructure engineer to lead the development of reliable, efficient, safe, and scalable thermal architectures for high-density, liquid-cooled AI data centers. This role is ideal for someone who can translate evolving compute and rack-level thermal requirements into practical infrastructure designs, evaluate complex equipment and vendor solutions, and drive technical decisions across facilities, hardware, controls, telemetry, testing, commissioning, and operations.
The ideal candidate has deep hands-on experience with mission-critical mechanical systems at data center or comparable critical infrastructure scale, including chilled water plants, condenser water systems, cooling towers, dry coolers, pumps, heat exchangers, CDUs, manifolds, CRAHs, air handlers, filtration, water treatment, controls, liquid distribution, and high-density rack cooling interfaces.
Key Responsibilities
Define mechanical and cooling infrastructure requirements and reference architectures for AI data center campuses, including heat rejection, chilled water, condenser water, liquid cooling distribution, air handling, containment, filtration, water treatment, controls, and metering.
Review and develop basis-of-design documents, design narratives, P&IDs, mechanical schedules, equipment specifications, thermal capacity models, hydraulic models, CFD analysis, controls sequences, and commissioning requirements.
Evaluate mechanical architectures for high-density compute, including direct-to-chip liquid cooling, CDU topology, facility water interfaces, manifolds, quick disconnects, hoses, heat exchangers, pumping strategies, leak detection, serviceability, and failure isolation.
Partner with electrical, controls, hardware, networking, construction, and operations teams to ensure cooling systems support liquid-cooled GPU rack deployments and reliable facility operation.
Develop technical specifications and acceptance criteria for chillers, cooling towers, dry coolers, evaporative coolers, pumps, heat exchangers, CDUs, manifolds, valves, filters, water treatment systems, CRAHs, air handlers, VFDs, controls panels, sensors, and monitoring devices.
Lead technical evaluation of equipment vendors, manufacturers, design consultants, commissioning agents, contractors, and testing laboratories; review submittals, P&IDs, control diagrams, performance curves, test reports, certifications, and quality documentation.
Drive factory acceptance testing, site acceptance testing, witness testing, pressure testing, leak testing, thermal performance testing, reliability testing, interoperability testing, and integrated systems testing for critical equipment and high-density rack deployments.
Help design and operate a hardware and data center infrastructure R&D laboratory used to validate new cooling equipment, liquid-cooled GPU rack designs, operating envelopes, fault scenarios, telemetry, and facility-hardware interactions.
Collaborate with hardware manufacturers to evaluate L10 and L11 test procedures, yield, throughput, reliability, serviceability, and readiness for large-scale deployment.
Define telemetry and controls requirements for mechanical and supporting electrical systems, including temperatures, flow rates, differential pressure, pump status, valve position, chiller status, CDU status, coolant quality, leak detection, alarms, control states, and rack-level thermal signals.
Analyze lab data, operational incidents, thermal excursions, flow excursions, pressure excursions, leak events, contamination events, nuisance alarms, controls alarms, and rack-level thermal failures to improve designs, procedures, vendor quality, and reliability models.
Create clear engineering documentation, standards, test procedures, commissioning scripts, operating procedures, decision records, risk registers, and executive-ready technical summaries.
Provide senior technical escalation support during design reviews, construction, manufacturing validation, commissioning, startup, production deployment, vendor testing, and operational events.
Mentor partner teams and raise the technical bar for mechanical design, thermal validation, safety, water quality, documentation, reliability, and operational rigor.
Required Qualifications
Extensive senior-level experience in data center mechanical engineering, mission-critical facilities, high-performance computing infrastructure, thermal systems, or comparable critical infrastructure environments, typically gained through 10+ years of relevant work or equivalent depth of experience.
Deep expertise in mechanical design, commissioning, operations, maintenance, reliability, and lifecycle management at large campus or multi-site scale.
Hands-on experience with chilled water systems, condenser water systems, process cooling water, cooling towers, dry coolers, evaporative cooling, pumps, heat exchangers, CDUs, manifolds, CRAHs, air handlers, filtration, water treatment, valves, piping, controls, and monitoring systems.
Experience designing, testing, certifying, commissioning, or operating equipment used in high-density compute or mission-critical environments, including liquid cooling systems, CDUs, high-flow hydronic systems, heat exchangers, rack manifolds, quick disconnects, and thermal monitoring systems.
Experience reviewing and producing technical design packages, equipment specifications, design narratives, sequence-of-operations, CFD or hydraulic studies, test plans, commissioning scripts, operating procedures, and acceptance criteria.
Proven ability to evaluate and manage external engineering firms, equipment manufacturers, construction partners, commissioning providers, testing laboratories, and strategic suppliers.
Strong understanding of product safety, agency certification, mechanical code compliance, pressure safety, environmental requirements, AHJ coordination, utility requirements, water quality programs, and standards relevant to critical mechanical infrastructure.
Experience with factory acceptance testing, site acceptance testing, integrated systems testing, leak testing, pressure testing, thermal performance testing, failure-mode testing, root-cause analysis, and corrective-action management.
Ability to use data, models, field observations, and engineering judgment to make decisions under ambiguity and communicate risk clearly to technical and non-technical stakeholders.
Bachelor's degree in Mechanical Engineering, Electrical Engineering, Systems Engineering, or a related technical field, or equivalent practical experience.
Preferred Qualifications
Advanced degree in mechanical engineering, thermal sciences, fluid mechanics, systems engineering, or a related discipline.
Professional Engineer license, Chartered Engineer status, or comparable professional certification.
Direct experience with AI infrastructure, hyperscale data centers, HPC clusters, GPU rack deployments, direct-to-chip liquid cooling, immersion cooling, or high-density rack thermal design.
Experience with high-density liquid cooling architectures, CDU qualification, facility water interface specifications, coolant chemistry, leak detection, quick disconnects, rack manifolds, cold plates, and thermal controls for accelerated compute.
Experience qualifying new mechanical or thermal products from concept through prototype, certification, manufacturing ramp, field deployment, and lifecycle support.
Experience evaluating L10 and L11 manufacturing test validation procedures, yield, throughput, burn-in, reliability screens, and test coverage for liquid-cooled GPU racks or similar complex systems.
Experience with BMS, EPMS, DCIM, SCADA, PLCs, chiller controls, pump controls, CDU controls, data historians, digital twins, CFD model validation, and reliability analytics.
Experience managing global, multi-site equipment manufacturing programs or data center programs across matrixed organizations and partner ecosystems.
Familiarity with sustainability, energy efficiency, PUE, WUE, CUE, water stewardship, grid constraints, demand response, renewable energy integration, heat reuse, and low-carbon infrastructure design considerations.
Work Environment and Travel
This role may require periodic travel to data center campuses, manufacturing partners, equipment suppliers, laboratories, construction sites, and commissioning activities.
The engineer should be comfortable working in office, lab, manufacturing, construction, and operating data center environments, including environments that require PPE, safety briefings, lockout/tagout discipline, water or chemical handling discipline, and coordination with site operations.
Work may include time-sensitive technical escalations during commissioning, startup, production deployment, vendor testing, leak events, thermal events, water treatment events, or operational incidents.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.