Join Algolia’s Data Engineering team. We gather data across all company domains and build and maintain the infrastructure and services that power internal analytics for analysts, data scientists, product managers, and engineers. We are the internal eyes of the company, a critical, central team for Algolia and its business. We aim to build a state-of-the-art data platform and stay alert to new technologies to keep modernizing. This role is with Algolia, based in France, you can work from our Paris office or fully remote.
Scale
- 3000+ TB data lake and warehouse, growing fast
- 60 Airflow DAGs
- 800+ dbt models supported
- 50+ data sources across clouds, APIs, formats, internal and third-party systems
What's Ahead:
We have started a transition from Redshift to Databricks to modernize the platform and scale for the future. The foundation is new. There is still a lot to build and many interesting challenges.
What you will do:
- Be a key contributor in a mature data engineering team composed of senior engineers
- Design, build, and operate reliable batch and streaming pipelines
- Improve orchestration, testing, observability, and cost efficiency
- Interact with many stakeholders across Product, Engineering, and Analytics
- Take strong ownership of what you build and maintain
- Share your expertise with other technical teams
Must haves:
- 8+ years of experience in data engineering
- Expertise with cloud platforms (AWS, GCP, or Azure)
- Expertise with orchestration systems (Airflow, Dagster, or similar)
- Expertise with data lakes and warehouses (Databricks, Snowflake, BigQuery, or Redshift)
- Strong Spark and SQL skills
- Familiarity with infrastructure tools (Terraform, Docker)
- Familiarity with coding best practices (Python, unit testing, CI)
- Awareness and interest in data engineering and modern development
- Motivation to build a state-of-the-art platform
- Motivation to work in a team-oriented culture
- Excellent communication skills
Nice to have:
- Familiarity with dbt or similar frameworks
- Familiarity with BI tools (ThoughtSpot, Hex)
We’re looking for someone who can live our values:
- GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment
- TRUST - Willingness to trust our co-workers and to take ownership
- CANDOR - Ability to receive and give constructive feedback.
- CARE - Genuine care about other team members, our clients and the decisions we make in the company.
- HUMILITY- Aptitude for learning from others, putting ego aside.
Team’s current stack:
- AWS infrastructure
- S3 data lake
- Databricks SQL Warehouse and Workflows
- Airflow (MWAA)
- Kafka for real time
- AWS Glue, EMR, Kinesis
- Redshift and Athena (being replaced by Databricks)
#LI-Remote