Why This Job is Featured on The SaaS Jobs
### Why this Role is Featured on The SaaS Jobs
Production engineering sits at the center of SaaS trust: customers judge a cloud product less by feature lists and more by uptime, latency, and how safely changes ship. This Staff Software Engineer role is notable because it is explicitly anchored in SLO-driven reliability, incident learning, and post-release health verification, all core mechanics of operating a large, always-on SaaS platform.
For a SaaS career, the work maps to durable operating skills that transfer across product-led and enterprise SaaS environments. Building detection and reliability tooling, shaping service lifecycle practices, and partnering with application engineers on performance targets develops a systems view that becomes increasingly valuable as companies scale usage and complexity. The emphasis on automation and sustainable scaling also aligns with how mature SaaS organizations balance delivery velocity with production risk.
The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.
Job Description
Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for building big, moving fast, and taking technology — and careers — to the next level.
The Production Engineering Team at Snowflake is responsible for driving the reliability tools and processes that ensure Snowflake consistently delivers a top-tier experience for its customers. This includes championing Service Level Objectives (SLOs) across all of Engineering, building the infrastructure necessary for rapid detection of reliability issues, and deeply engaging in system health verification after releases. We think about production reliability end-to-end: how do we proactively prevent issues, quickly detect and diagnose problems when they arise, and efficiently resolve them to minimize impact. We drive the culture of learning from every incident.
RESPONSIBILITIES:
Lead the improvement of the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
Drive scaling systems sustainably by automation; Drive changes that improve reliability and velocity.
Establish and practice low noise incident response rotations and blameless postmortems to prevent problem recurrence.
Write and review code. Develop documentation and capacity plans, and debug the hardest problems on large distributed systems.
Collaborate with software engineers to establish, maintain, and optimize functional and performance SLOs.
Participate in a 24x1 on-call rotation.
MINIMAL QUALIFICATIONS:
Bachelor's degree in Computer Science, a related technical field involving software engineering, or equivalent practical experience.
Proficient in at least one modern programming language, preferably Golang.
Systematic problem-solving methods, effective communication skills.
PREFERRED QUALIFICATIONS:
10+ years industry experience designing, building and supporting large scale systems in production.
Experience in modern observability tools and production monitoring practices.
Experience with capacity and load testing of the distributed applications
Experience with containers and container orchestration systems such as Kubernetes
Experience in deploying, managing, and operating scalable and fault tolerant Linux infrastructure.
Experience with the SLO-driven reliability management processes.
Hands on experience with one of more public cloud providers (AWS, Azure, or GCP)
Ability to spot systematic issues, define roadmaps and guide other engineers to resolve them.
Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.
How do you want to make your impact?
For jobs located in the United States, please visit the job posting on the Snowflake Careers Site for salary and benefits information: careers.snowflake.com