Note! Apply link will take you to an external website.

Senior System Reliability Engineer
HungerStation, United Arab Emirates

Experience

1 Year

Salary

0 - 0

Job Type

Full Time Job

Job Shift

Morning Job

Job Category

Pharma & Biotechnology

Traveling

Career Level

Non-Managerial

Telecommute

Qualification

Bachelor's Degree

Total Vacancies

1 Job

Posted on

Feb 23, 2021

Last Date

Mar 23, 2021

Location(s)

Dubai, United Arab Emirates

Job Description

Purpose:

System Reliability Engineers (also known as Site Reliability Engineers) are responsible for the keeping all user-facing services (most notably HungerStation.com) and many other HungerStation production systems running smoothly 24/7/365. SREs are a blend of operations gear-heads and software crafters that apply sound engineering principles, operational discipline and mature automation, specializing in systems, whether it be networking, the Linux kernel, or even a specific interest in scaling, algorithms, or distributed systems.

Responsibilities:

Be on a PagerDuty on-call rotation to respond to HungerStation.com availability incidents.
Use your on-call shift to prevent incidents from ever happening.
Manage our infrastructure with Terraform and Kubernetes and other similar tools.
Make monitoring and alerting trigger based on symptoms and not on outages.
Document every action so your learnings turn into repeatable actions and then into automation.
Improve the deployment process of all services to make it as boring as possible.
Design, build and maintain core infrastructure pieces that allow HungerStation scaling to support tens of thousands of concurrent users.
Debug production issues across services and levels of the stack.
Plan the growth of HungerStation's infrastructure.

Requirements

4- 6 years of relevant experience
Bachelor Degree in a relevant field is required
Master’s degree in a relevant field is preferred

You may be a fit to this role if you:

Think about systems - edge cases, failure modes, behaviors, specific implementations.
Know your way around Linux and the Unix Shell.
Know what is the use of config management systems like Terraform, Ansible, Chef . etc.
Have strong programming skills - Ruby and/or Go.
Have an urge to collaborate and communicate asynchronously.
Have an urge to document all the things so you don't need to learn the same thing twice.
Have a proactive, go-for-it attitude. When you see something broken, you can't help but fix it.
Have an urge for delivering quickly and iterating fast.
Share our values, and work in accordance with those values.
Have experience with Docker, Kubernetes, Prometheus and other cloud-native tools.

Job Specification

Job Rewards and Benefits

More Jobs like this Job

Senior System Reliability Engineer Jobs in Dubai United Arab Emirates
Jobs in this company

HungerStation

Information Technology and Services - Riyadh, Saudi Arabia