This position is no longer available.

Site Reliability Engineer

Permanent contract
Paris
Salary: Not specified
No remote work

Fintecture
Fintecture

Interested in this job?

Questions and answers about the job

The position

Job description

We are building a microservice ecosystem of applications serving a multitude of payment and data services for end users.

This is a great opportunity to help Fintecture build a new Site Reliability Engineering - Operations team in EMEA. The team will be responsible for the detection and mitigation of customer impacting incidents at Fintecture and build solutions to support the reliability and availability of Fintecture service.

You will be the first person of a newly-created team that implements this function in EMEA and partners with our strong Engineering and Product organizations. The ideal candidate will have experience in technical operations roles (ideally SRE) and programming skills.

You enjoy active participation in incident response, supporting and troubleshooting large scale distributed systems and partnering with teams to improve reliability.

Responsibilities

  • Discover problems within distributed cloud native applications using logs, telemetry and alerting.
  • Mitigate urgent problems and work with teammates to solve underlying issues Write code to automate the mitigations and improve tools.
  • Teach others how to detect and fix repeat problems.
  • Provide and institute proven practices around reliability, remediations, and troubleshooting.
  • Build vital and efficient tooling to lower the barrier of entrance for engineering teams to plug in and enjoy the benefits of Reliability.
  • Manage the run team and organize tickets

We offer:

  • To be a key member of a fast growing and ambitious startup
  • Startup environment with work flexibility
  • Career development
  • Integration in a fast-growing company
  • Remote work possible

Preferred experience

  • Experience in a Software, Infrastructure, Systems, and/or Site Reliability Engineering role.
  • A successful track record of troubleshooting distributed systems during service incidents while remaining level-headed.
  • Knowledge of Kubernetes, Networking, Databases and messaging system…
  • Experience in monitoring large-scale SaaS-type products or services Experience in a software development environment.
  • A strong curiosity for the unknown and not stopping until you have a solid understanding.
  • An understanding of what makes up the incident lifecycle.
  • Customer first mindset
  • Knowledge on GCP is a plus

Recruitment process

  • Short call with a tech talent recruiter (30 min)
  • Technical test (5 to 7 days homework test)
  • Debrief with Lead dev ops
  • Call with Head of Engineering
  • Call with CTO
  • Call with HR

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud Computing and DevOps”.

See all job openings