Why This Job is Featured on The SaaS Jobs
Modern SaaS products increasingly rely on internal platforms that standardise how teams ship, observe, and secure software. This Platform Engineering DevOps role stands out for its focus on modernisation themes that are shaping the current SaaS infrastructure stack—containerisation, service mesh, microservices, and serverless—alongside an explicit push toward automation and resilience.
From a SaaS career perspective, the remit maps closely to skills that travel well across subscription businesses: building repeatable delivery systems (CI/CD and self-service tooling), codifying infrastructure with IaC, and establishing observability standards that support production-grade reliability. The inclusion of AIOps-style approaches (predictive alerting, anomaly detection, automated response) also reflects where many SaaS engineering organisations are heading as estates grow and operational complexity rises.
The role is best suited to an engineer who prefers hands-on ownership across the delivery lifecycle—designing platforms, improving operational feedback loops, and participating in incident response and post-incident learning. It fits someone in the early-to-mid stage of a DevOps/SRE path who wants breadth across cloud-native patterns and the discipline of building systems that other engineers depend on.
The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.
Job Description
DevOps Engineer – Platform Engineering
Location: Bengaluru
Employment Type: Full-time
Team: Platform Engineering
About the Role
We are looking for a passionate and hands-on DevOps Engineer to join our Platform Engineering team and accelerate our platform modernization journey. This role is ideal for engineers who thrive in automation-heavy environments and want to contribute to the evolution of scalable, secure and intelligent infrastructure platforms. You will work on next-gen cloud-native platforms, implement AI-powered automation and help build self-healing, resilient systems that support our applications at scale.
Key Responsibilities
- Manage platform modernization initiatives including containerization, service mesh adoption and migration to microservices and serverless infrastructure.
- Design and implement robust CI/CD pipelines and self-service DevOps platforms to streamline software delivery across environments.
- Develop and manage Infrastructure as a Code (IaaC) using tools like Terraform or CloudFormation for scalable and repeatable deployments.
- Automate infrastructure provisioning, configuration management and operations using tools like Ansible, Chef or Puppet.
- Leverage AI/ML-driven automation for predictive alerting, anomaly detection, auto-scaling and intelligent incident response.
- Build intelligent, self-healing systems focused on resilience, cost optimization and performance forecasting in public cloud environments.
- Drive observability by establishing standards for monitoring, logging and alerting using tools such as Prometheus, Grafana, OpenTelemetry and AIOps platforms.
- Embed security and compliance into DevOps workflows by adopting DevSecOps practices throughout the software development lifecycle.
- Evaluate emerging technologies and methodologies to improve system reliability, developer experience and platform scalability.
- Participate in SRE on-call rotations, production support and post-incident reviews to continuously improve system resilience.
- Build internal tools and automation solutions to enhance platform observability and operational efficiency.
- Identify and resolve performance bottlenecks and lead root cause analysis efforts for critical incidents.
- Collaborate cross-functionally with engineering, architecture and security teams to drive best practices and architectural alignment.
- Support disaster recovery planning, backup strategy implementation and compliance initiatives (e.g., SOC2, ISO).
- Mentor junior engineers, promote knowledge sharing and foster a culture of engineering excellence.
- 2–6 years of experience in DevOps, SRE or platform engineering roles with a software engineering mindset.
- Hands-on expertise in Kubernetes, Docker and service mesh architectures (Istio, Linkerd).
- Expertise in CI/CD tools such as Jenkins, ArgoCD, Spinnaker or similar tools for automating and managing deployment workflows.
- Experience with observability stacks (Prometheus, Grafana, ELK, Loki or Datadog) for monitoring, logging and alerting.
- Good understanding of AIOps and ML-driven automation, including anomaly detection, intelligent alerting and predictive incident response
- Strong problem-solving and debugging skills, particularly in complex, production-grade distributed systems
- Expertise in Infrastructure as a Code (IaaC) using tools like Terraform or Pulumi and proficiency in configuration management with Ansible, Puppet or similar tools.
- Familiarity with event-driven architectures using tools like Kafka or cloud-native pub/sub messaging systems.
- Good understanding of cloud cost optimization and efficiency practices through automation and resource management.
- Experience integrating security scanning and compliance checks into CI/CD pipelines using tools like Trivy, Snyk or arnica.
- Work on impactful infrastructure and DevOps challenges at scale.
- Build infrastructure that enables fast, reliable and responsible deployment of AI solutions.
- Be part of a culture that champions engineering excellence, ownership and continuous learning.
- Help shape the future of DevOps and AI integration in a fast-moving, innovation-focused environment.
- Collaborate with architects and DevOps leaders on strategic initiatives.
- Be part of a team building intelligent, resilient platforms using cutting-edge DevOps and AI technologies.