About GitGuardian
GitGuardian is a global post-Series B cybersecurity scale-up, with team members in France, Europe and in the USA.
Among our early investors who saw our market value proposition, are the co-founder of GitHub, Scott Chacon, along with Solomon Hykes, Docker's co-founder. American and European top-tier VC firms have also invested in GitGuardian.
GitGuardian leads the way in Non-Human Identity security, offering end-to-end solutions from secrets detection in code, productivity tools and environments to strong remediation, observability and proactive prevention of leaks. Our solutions are already used by more than 600K developers worldwide!
About your team and your mission
As our Lead Data Engineer, you will lead and grow a small department (2 people) of passionate and highly-skilled engineers who are pivotal to GitGuardian's growth and success. Our team's scope is uniquely broad, encompassing the entire data lifecycle—from ingestion and transformation to delivering critical business insights and enhancing our product with data-driven features. You will work with massive and growing data, including over 40 billion code files and more than 600,000 users, to solve compelling challenges. You'll be at the crossroads of Business Intelligence and Product development, with a direct impact on our company's trajectory.
Your main responsibilities will be to:
- Lead the architectural design and strategic development of our data platform, ensuring its scalability, reliability, and performance to meet the demands of a high-growth environment.
- Manage the team's roadmap, projects, and priorities, ensuring alignment with company objectives and effective communication with multiple teams and interfaces.
- Act as a key partner to stakeholders across Sales, Marketing, and Product, translating complex business questions into technical requirements and data-driven solutions.
- Define and own the data modeling strategy in our Snowflake data warehouse, transforming raw data into clean, business-ready tables that serve as the single source of truth.
- Develop and optimize robust ELT pipelines, leveraging tools like Fivetran and Airbyte for standard sources and building custom connectors where needed, to ingest data from a wide variety of sources including PostgreSQL, Elasticsearch, and third-party APIs (e.g., Hubspot, GitLab, Notion).
- Architect and implement sophisticated data models in our Snowflake data warehouse, transforming raw data into clean, business-ready tables that serve as the single source of truth.
- Power our customer-facing analytics, developing and exposing data within the GitGuardian product to provide our users with valuable insights.
- Develop innovative, data-powered applications and services, such as an application to identify the most dangerous secret leaks, or a service capable of scanning billions of files at scale, which directly enhance our product offering.
- Mentor, guide, and grow the data engineering team, fostering a culture of technical excellence, innovation, and continuous improvement.
Technical Environment
- Core Languages: Python, SQL
- Data Warehouse: Snowflake
- Orchestration & Deployment: Dagster, Kubernetes, Docker, Terraform
- Data Ingestion: Fivetran, Airbyte, custom scripts
- Data Visualization: Metabase
- Data Sources: PostgreSQL, Elasticsearch, various APIs
About you
If you think you match at least 70% of these criteria, please apply!
Here's what we consider essential for success in this role:
- 5+ years experience in data engineering, with a proven track record of building and managing complex data platforms in a cloud environment.
- Expert-level proficiency in Python and SQL; you are comfortable writing clean, efficient, and production-ready code.
- A comprehensive understanding of modern data architectures, including data warehousing, data modeling, and ETL/ELT best practices.
- Hands-on experience with our core technologies or their equivalents (e.g., Airflow instead of Dagster; Redshift or BigQuery instead of Snowflake).
- A proven ability to bridge the gap between business and technology, with a strong business sense and the skill to translate complex requirements into impactful technical solutions.
- Excellent communication and project management skills, with the ability to articulate technical concepts and manage priorities with both technical and non-technical audiences.
The following skills would strengthen your application but aren't required:
- Prior experience in a leadership, mentorship, or management role is a strong plus, but we welcome applications from senior engineers ready to take their first step into a leadership position.
- Previous experience in a high-growth B2B SaaS or cybersecurity company.
- Experience with containerization and orchestration technologies (Docker, Kubernetes).
- Knowledge of Infrastructure as Code (e.g., Terraform).
The interview process
1. Video call with a recruiter
To discover your professional project and evaluate if there could be a mutual match.
2. Interview with your future manager (60 minutes)
Purpose: To dive deep into your leadership philosophy, project management experience, and strategic vision for a Data function. We want to understand how you approach stakeholder management, translate business needs into a technical roadmap, and foster team growth.
Skills Assessed: Leadership and mentorship philosophy, project and stakeholder management, strategic data architecture, business acumen, and ownership.
3. Technical interview (60 minutes)
Purpose: You will work through a real-world system design or data architecture challenge. The goal is to see how you decompose complex problems, evaluate trade-offs, guide technical discussions, and articulate a clear technical strategy.
Skills Assessed: System design, technical decision-making and trade-offs, collaborative problem-solving, code review standards, and your ability to mentor and elevate the team's technical capabilities. Your hands-on proficiency in Python and SQL will be evaluated within this context.
4. Final interview with an Executive Manager (30 minutes)
Purpose: To ensure your vision and working style align with GitGuardian's broader company strategy and values.
Skills Assessed: Your high-level business acumen, strategic thinking, and cultural alignment. We want to understand your potential long-term impact on the company.
Benefits
- 💰 Package that includes stock-options
- 🍜 Lunch voucher (Swile)
- 🏥 Non-charged health insurance for children (Sidecare / Generali)
- 💻 Up to €300 to improve your home office set-up
- 🌴 Yearly holiday allowance
- 🤝 Referral bonus of 4000€ for any new Guardian we might hire thanks to you
- 🎡 Team building: monthly budget dedicated to each employee that you can spend as you wish, with colleagues (latest examples to date: Michelin star restaurant, karaoke, stand-up show, kitesurfing week-end, ...)
And also...
- 🏡 Remote policy: hybrid (2 days/week at the office) or full-remote (but 3 days/month at the office)
- 📈 Opportunities for career development in the long term
More about GitGuardian!
Products
- Understand how GitGuardian works in this short video!
- Want to go even further? Check out our public roadmap!
- Check out the State of Secrets Sprawl Report to understand our mission and the industry.
- Our solutions are already used by hundreds of thousands of developers in all industries and GitGuardian platform is the n°1 app on the GitHub marketplace 🔥
Clients
- GitGuardian helps organizations find exposed sensitive information that could often lead to tens of millions of dollars in potential damage.
- More than 70% of our customers are in the United States.
- Many F500 companies use GitGuardian's platform.
People
- The Guardians are knowledgeable, committed, serious, aligned with the company’s mission, and true team players: always willing to help each other grow our skill sets!
- The team is diverse and we hail from more than 20 different countries.
- We are also agile, remote-friendly, and fun people to work with.
- You will get trust & autonomy on your perimeter with a very transparent internal communication and a strong impact on the company development.