Why This Job is Featured on The SaaS Jobs
This Technical Program Manager role sits at the intersection of platform reliability and customer trust—an area that becomes more visible as SaaS products serve developers and enterprise workloads. The remit around major incident lifecycle, SLAs, and playbooks signals an organisation treating operational maturity as a product capability, not just an internal function, particularly relevant for AI infrastructure delivered as a service.
For SaaS career development, incident leadership is a durable skill set: it builds fluency in cloud operations, cross-team dependency management, and the mechanics of reducing downtime through post-incident learning loops. Experience coordinating P1–P4 events, tightening monitoring and triage, and translating technical impact for non-technical stakeholders transfers cleanly across B2B SaaS environments where reliability, security, and transparency shape renewals and expansion.
This position fits professionals who prefer structured ownership—running clear processes, tracking actions, and facilitating alignment across engineering, security, and IT without needing direct authority. It will particularly suit program managers who enjoy being close to technical detail, can stay objective under pressure, and want their work to influence how a platform scales operationally across a global customer base.
The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.
Job Description
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.
Join us on our mission and shape the future!
Why this role?
We’re seeking an experienced Technical Program Manager to join Cohere’s Engineering Program Management team. We need someone with curiosity, drive, independence, and leadership, who has hands-on experience managing projects for enterprise-grade software or enterprise-focused machine learning solutions.
In return, you’ll have the unique opportunity to shape Cohere’s operations, collaborate with leading minds in the LLM space. You will get a chance to create extremely high-impact contributions to our fast-growing company, product and culture.
This role is open to candidates based on the East Coast, or those who are flexible to travel.
As a Technical Program Manager, you will:
Lead and Manage: end-to-end lifecycle of all major incidents within Cohere’s environment, ensuring effective communication, escalation, and resolution.
Communicate: Deliver clear, timely, and objective updates across engineering, leadership, and non–technical teams for major incidents. Specifically leading all P1-P4 incidents throughout their lifecycle and ensuring the incident is managed within their respective SLAs.
Optimize: Break down complex challenges into actionable strategies, aligning engineering with all relevant stakeholders.
Plan: Coordinate across all engineering teams ensuring global coverage for all of Cohere’s customers.
Assist in developing and maintaining incident playbooks for common or anticipated incident scenarios.
Work with engineering managers to enhance monitoring capabilities and our triage process to mitigate future incidents. You will also work closely with our Security, IT, and Engineering teams to ensure resolutions are prioritized and mitigated.
Execute: Deliver post-mortem updates after an incident with clear actions.
Problem-solve: Proactively resolve issues, coordinate dependencies, and prioritize impacts to quality and timelines.
You may be a good fit if:
You have 5+ years of experience as an Incident Technical/Engineering Program Manager, with technical expertise and experience, including exposure to SaaS/cloud environments.
Strong understanding of incident management programs such as Incident.io, PagerDuty, ServiceNow, Rootly, Atlassian or equivalent.
You have hands-on experience with creating incident management programs from 0-1 and have successfully led incident management programs within enterprise-level environments.
You’re detail-oriented, self-organized, and collaborative—excelling at note-taking, action tracking, and enabling teams.
Strong communication skills (written and verbal), able to simplify complex issues for multiple stakeholders both internally and externally.
You have a bias for action, balancing execution focus with diplomacy and accountability.
You’ve shipped technical products/services across cross-functional teams (including remote/global stakeholders).
If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply!
If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Full-Time Employees at Cohere enjoy these Perks:
🤝 An open and inclusive culture and work environment
🧑💻 Work closely with a team on the cutting edge of AI research
🍽 Weekly lunch stipend, in-office lunches & snacks
🦷 Full health and dental benefits, including a separate budget to take care of your mental health
🐣 100% Parental Leave top-up for up to 6 months
🎨 Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
🏙 Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
✈️ 6 weeks of vacation (30 working days!)