We are looking for a skilled and passionate Senior Site Reliability Engineer (SRE), based on the East Coast of the United States to join the Cloud Platform team, which empowers DataSnipper's growth through a secure and scalable enterprise cloud platform.
As a Senior SRE at DataSnipper, you will set the strategic direction for our cloud infrastructure on Microsoft Azure. You will define target-state architectures and roadmaps, lead enterprise-scale landing zone design and governance, and partner with product, SRE, security, and data teams to deliver multi-tenant, multi-region, secure-by-default solutions. You'll standardize patterns, automate with Infrastructure as Code, and guide migrations and modernizations, turning best practices into measurable reliability, security, and cost outcomes.
About DataSnipper:
DataSnipper is the driving force behind an intelligent automation platform that's transforming the world of audit and finance.
Founded in 2017, DataSnipper has skyrocketed to unicorn status, achieving a valuation of $1 billion following a successful funding round led by Index Ventures. With over 500,000 users across 160+ countries and offices in Amsterdam, New York, Kuala Lumpur, Tokyo, and Mexico City, DataSnipper is shaking things up - and we're not stopping there!
What you will do:
• Define and own the cloud infrastructure strategy, reference architectures, and platform roadmaps for Azure across compute, networking, identity, data, security, and observability
• Design and implement an enterprise-scale Azure Landing Zone (management groups, subscriptions, RBAC, Azure Policy) and governance for multi-tenant SaaS and regulated customers
• Architect highly available, multi-region solutions leveraging services such as AKS/Container Apps, App Service, Azure DB for PostgreSQL, Redis, Service Bus/Event Grid, Front Door/Traffic Manager, and CDN
• Enable secure private connectivity patterns (Private Link, VNet integration, Azure Firewall/WAF, ExpressRoute/VPN) and champion zero-trust principles with Entra ID and Managed Identity
• Establish platform engineering "golden paths" and reusable accelerators: Terraform modules, environment bootstrapping, and CI/CD templates in GitHub Actions
• Drive well-architected reviews for mission-critical workloads; translate findings into actionable improvements for reliability, security, performance, and cost optimization with measurable SLOs/SLIs
• Implement end-to-end observability using Azure Monitor, Log Analytics, Application Insights, and (where applicable) Prometheus/Grafana; automate proactive detection and post-incident
improvement plans
• Partner with Security to implement least-privilege access, PIM, Defender for Cloud, Key Vault, secret rotation, and compliance controls (e.g., SOC 2, ISO 27001)
• Define and validate DR/BCP strategies (RTO/RPO), including zone-redundancy, geo-replication, backups, and failover testing
• Mentor and coach engineering teams; lead architecture reviews, threat modeling, technical workshops, and author clear documentation and reference architectures
• Evaluate and guide adoption of new Azure capabilities; collaborate with partners and vendors to enhance our platform
What you will bring:
• 7+ years in cloud architecture or platform engineering, with deep hands-on expertise in Microsoft Azure and experience setting cloud strategy and roadmaps
• Proven track record designing multi-tenant, multi-region SaaS architectures and enterprise-scale Azure Landing Zones with strong governance and policy
• Expertise across Azure services: AKS/Container Apps, App Service, VMSS; VNet/vWAN, Private Link, Azure Firewall, App Gateway/WAF, Front Door; Entra ID (Azure AD), RBAC, Managed Identity, PIM; Storage, Azure SQL DB; Service Bus/Event Grid; Key Vault; Defender for Cloud; Azure Monitor/Log Analytics/App Insights
• Strong DevOps/SRE practices: CI/CD (GitHub Actions), GitOps, blue/green and canary deployments, infrastructure testing, and progressive delivery
• Hands-on with Infrastructure as Code (Terraform and/or Bicep; ARM), policy-as-code, and environment bootstrapping at scale
• Solid grasp of networking and hybrid connectivity (ExpressRoute, VPN), security-by-design, and zero trust
• FinOps mindset with demonstrable cost optimization, tagging/chargeback, budgets/alerts, and rightsizing
• Strong communication and stakeholder management skills; ability to influence across product, SRE, security, and leadership
• Proficiency in scripting/coding (PowerShell and one of Python/C#/Go)
• Nice to have: Azure Solutions Architect Expert (AZ-305), Azure DevOps Engineer Expert (AZ-400), CKA/CKAD; experience in regulated environments (SOC 2, ISO 27001, HIPAA, GDPR); contributions to public docs/reference architectures
What We Offer:
Flexible paid time off.
Remote work
Next steps:
1 hour live coding
1 hour of system design
Apply and let's disrupt the auditing world together! 🚀