About Clay
Clay is a creative tool for growth. Our mission is to help businesses grow — without huge investments in tooling or manual labor. We’re already helping over 100,000 people grow their business with Clay. From local pizza shops to enterprises like Anthropic and Notion, our tool lets you instantly translate any idea that you have for growing your company into reality.
We believe that modern GTM teams win by finding GTM alpha—a unique competitive edge powered by data, experimentation, and automation. Clay is the platform they use to uncover hidden signals, build custom plays, and launch faster than their competitors. We’re looking for sharp, low-ego people to help teams find their GTM alpha.
Why is Clay the best place to work in New York?
Customers love the product (100K+ users and growing)
We’re growing a lot (6x YoY last year, and 10x YoY the two years before that)
Incredible culture (our customers keep applying to work here)
Well-resourced (raised a Series B expansion in January 2025 from investors like Sequoia and Meritech)
Read more about why people love working at Clay here and explore our wall of love to learn more about the product.
Senior Site Reliability Engineer @ Clay
In this role, you’ll join our growing infrastructure team in building and fine-tuning our infrastructure to keep our services running smoothly. We’re looking for someone who’s excited about automation and continuous improvement. While your main focus will be on infrastructure, coding skills are a must. As a growing startup, we all jump in where needed, so you’ll need to be comfortable taking on a variety of roles.
What You’ll Do
Architect, design, implement, and manage robust, scalable, and secure infrastructure solutions.
Develop, maintain, and enforce best practices for CI/CD, infrastructure as code, and automation.
Oversee the management and optimization of cloud infrastructure, ensuring high availability, performance, and cost-efficiency.
Implement monitoring, logging, and alerting solutions to maintain system health and quickly resolve issues.
Lead incident response efforts, troubleshooting and resolving complex issues in a timely manner.
Participate in an oncall rotation.
Work with teams across the company to ensure we achieve the right balance of developer velocity, reliability and performance, and cost efficiency.
What You’ll Bring
5+ years of experience
Experience with containerization and orchestration tools
Strong understanding of CI/CD concepts and tools
Knowledge of infrastructure automation tools
Experience with oncall and incident response
Proficiency in one or more programming languages
Familiarity with our stack or ability to learn unfamiliar technologies quickly:
Aurora Postgres RDS, Elasticache Redis, Docker + ECS, Lambda, OpenSearch
Terraform and Atlantis
CircleCI, Netlify, Playwright
Cloudwatch, Datadog, Mezmo
Typescript, Python