Why This Job is Featured on The SaaS Jobs
In enterprise SaaS—especially platforms embedded in customer experience workflows—availability and trust are product features. This Incident Manager remit sits at the intersection of Support, Engineering, and Product, where P0 escalations become moments that define renewal risk and platform credibility. The emphasis on war-room leadership, monitoring visibility, and executive-ready updates reflects the reality of selling and operating SaaS for large brands: incidents are both technical events and customer outcomes.
For a SaaS career, this role builds durable operating muscle in incident lifecycle ownership: triage discipline, stakeholder alignment, and turning root-cause findings into prevention work that informs roadmaps. Experience translating logs, APM signals, and system behavior into clear decisions is highly portable across SaaS environments, as is the practice of running RCAs that improve reliability over time. The exposure to cross-functional governance also strengthens program management and customer-facing leadership in technical contexts.
This role is best suited to professionals who prefer structured crisis leadership and crisp communication under ambiguity, and who enjoy influencing without direct authority. It aligns with later-career operators who want to be the escalation “control tower” for complex, customer-impacting issues, while staying close to the technical details that drive resolution.
The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.
Job Description
About the Company:
Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences.
Want to be part of the AI revolution and transform how the world’s largest global brands do business? Join us!
About the Role
An Incident Manager plays a critical role in ensuring the effective and efficient handling of incidents within an organisation. They are responsible for managing the entire lifecycle of incidents, from identification and logging to resolution and post-incident analysis. Their primary goal is to minimize the impact of incidents on business operations and ensure that services are restored as quickly as possible.
\n
Responsibilities- Crisis Ownership & Command: Act as the single, authoritative point of contact for all P0/Critical customer escalations, owning the entire resolution lifecycle from initial triage and mobilization to final post-mortem delivery.
- Executive Alignment & Communication: Design and deliver clear, concise and compelling status updates to C-level and senior executive audiences (both internal and external), focusing relentlessly on mitigation steps, recovery progress, and clear, organized plans for resolution.
- Cross-Functional Mobilization: Lead and govern internal war rooms, mobilizing and coordinating resources across Engineering, Product, Support (L1/L2), and Commercial teams to ensure rapid problem-solving and aggressive delivery against service restoration targets.
- Technical Deep Dive & Drive: Possess a detailed understanding of Netomi’s systems and monitoring tools (e.g., Datadog, logging platforms) to rapidly determine issue scope and direct technical teams towards the root cause. You possess the authority and influence to secure internal commitment and drive focused action until the technical issue is resolved.
- Root Cause & Prevention: Drive rigorous Root Cause Analysis (RCA) processes, translating complex technical findings into clear, actionable steps for Product and Engineering roadmaps to prevent future recurrences and enhance long-term platform stability.
- Internal Force & Influence: Proactively utilize superior influence, persistence, and problem-solving skills to overcome internal resource, priority, or technical roadblocks to drive urgency and secure internal commitment needed to resolve critical incidents.
- Process Refinement: Continuously refine and improve internal standard operating procedures (SOPs) for incident management and escalation workflows to optimize the support team's performance under pressure, ensuring every critical incident improves future response times.
Requirements- 12-15 years of demonstrated experience in a high-pressure, customer-facing technical support, strategic account management, or program management role focused specifically on critical issue resolution (P0/P1 incidents).
- Expert-level proficiency in leveraging monitoring tools (e.g., Datadog, Splunk, or similar logging/APM platforms) to quickly identify and understand technical issues.
- Demonstrated ability to maintain composure, confidence, and executive presence while leading intense, real-time crisis calls with senior client leaders and internal stakeholders.
- Exceptional executive communication and influencing skills; the ability to articulate clear, organized recovery plans and technical risk to both technical and non-technical audiences.
- Proven track record of success in developing, implementing, and refining incident management and escalation processes.
- Strong technical acumen—comfortable discussing complex API integrations, data flows, systems architecture, and conversational AI model training with customer technical teams.
- Proficiency with support systems and tools such as Zendesk, Datadog, or similar incident monitoring and ticketing platforms.
- Bachelor's degree in Business, Computer Science, or a related field; MBA or advanced degree is a plus.
\n
Netomi is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics.