Why This Job is Featured on The SaaS Jobs
This Staff Data Engineer role sits at the core of a SaaS company’s operating system: a centralized data platform that feeds customer facing products and informs product decisions. In SaaS, where usage signals, risk, and customer outcomes are continuously measured, platform data engineering becomes a product enabler rather than a back office function. The remit described points to foundational work that influences multiple teams and downstream experiences.
For a long-term SaaS career, this kind of platform ownership builds durable skills in designing reliable, scalable pipelines across batch and streaming, setting data quality standards, and balancing performance with cost. The emphasis on instrumentation, experimentation, and rapid MVP delivery aligns with how modern SaaS organizations iterate, making the role a strong bridge between pure infrastructure engineering and product analytics needs.
The position is best suited to a senior individual contributor who prefers ambiguous, high-leverage problems and can set technical direction without heavy oversight. It will appeal to engineers who enjoy mentoring, partnering across product and engineering, and being accountable for production systems that must stay dependable as product scope expands.
The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.
Job Description
About the team/role
We're seeking an experienced Staff Data Engineer to join our Data Platform team. This team plays a crucial role in our mission by developing and maintaining scalable data platforms for fair and safe hiring decisions.
As a Staff Data Engineer on the Data Platform team, you'll work on Checkr’s centralized data platform, critical for the company's vision. The centralized data platform is the heart of all key customer facing products. You will work on high-impact projects that directly contribute to the next generation products.
What you’ll do:
- Architect, design, lead and build end-to-end performant, reliable, scalable data platform.
- Be an independent individual contributor who can solve problems and deliver high-quality solutions without oversight and a high level of ownership.
- Mentor, guide and work with junior engineers to deliver complex and next-generation of features.
- Bring a customer-centric, product-oriented mindset to the table - collaborate with customers and internal stakeholders to resolve product ambiguities and ship impactful features
- Partner with engineering, product, design, and other stakeholders in designing and architecting new features
- Experimentation mindset - autonomy and empowerment to validate a customer need, get team buy-in, and ship a rapid MVP
- Quality mindset - you insist on quality as a critical pillar of your software deliverables
- Analytical mindset - instrument and deploy new product experiments with a data-driven approach
- Architect, lead and build a performant, reliable, scalable data platform.
- Monitor, investigate, triage, and resolve production issues as they arise for services owned by the team
- Create and maintain data pipelines and foundational datasets to support product/business needs.
- Design and build database architectures with massive and complex data, balancing with computational load and cost
- Develop audits for data quality at scale, implementing alerting as necessary
- Create scalable dashboards and reports to support business objectives and enable data-driven decision-making
- Troubleshoot and resolve complex issues in production environments
What you bring:
- 10+ years of designing, implementing and delivering highly scalable and performant data platform.
- Experience building large-scale (100s of Terabytes and Petabytes) data processing pipelines - batch and stream
- Experience with ETL/ELT, stream and batch processing of data at scale
- Expert level proficiency in PySpark, Python, and SQL
- Expertise in data modeling, relational databases, NoSQL (such as MongoDB) data stores
- Experience with big data technologies such as Kafka, Spark, Iceberg, Datalake, and AWS stack (EKS, EMR, Serverless, Glue, Athena, S3, etc.)
- An understanding of Graph and Vector data stores (preferred)
- Knowledge of security best practices and data privacy concerns
- Strong problem-solving skills and attention to detail
- Experience/knowledge of data processing platforms such as Databricks or Snowflake.
What you get:
- A fast-paced and collaborative environment
- Learning and development allowance
- Competitive cash and equity compensation and opportunity for advancement
- 100% medical, dental, and vision coverage
- Up to $25K reimbursement for fertility, adoption, and parental planning services
- Flexible PTO policy
- Monthly wellness stipend
#LI-TD1