Senior System Reliability Engineer
HungerStation, United Arab Emirates

Experience
1 Year
Salary
0 - 0
Job Type
Job Shift
Job Category
Traveling
No
Career Level
Telecommute
No
Qualification
Bachelor's Degree
Total Vacancies
1 Job
Posted on
Feb 23, 2021
Last Date
Mar 23, 2021
Location(s)

Job Description

Purpose:

System Reliability Engineers (also known as Site Reliability Engineers) are responsible for the keeping all user-facing services (most notably HungerStation.com) and many other HungerStation production systems running smoothly 24/7/365. SREs are a blend of operations gear-heads and software crafters that apply sound engineering principles, operational discipline and mature automation, specializing in systems, whether it be networking, the Linux kernel, or even a specific interest in scaling, algorithms, or distributed systems.


Responsibilities:

  • Be on a PagerDuty on-call rotation to respond to HungerStation.com availability incidents.
  • Use your on-call shift to prevent incidents from ever happening.
  • Manage our infrastructure with Terraform and Kubernetes and other similar tools.
  • Make monitoring and alerting trigger based on symptoms and not on outages.
  • Document every action so your learnings turn into repeatable actions and then into automation.
  • Improve the deployment process of all services to make it as boring as possible.
  • Design, build and maintain core infrastructure pieces that allow HungerStation scaling to support tens of thousands of concurrent users.
  • Debug production issues across services and levels of the stack.
  • Plan the growth of HungerStation's infrastructure.

Requirements

  • 4- 6 years of relevant experience
  • Bachelor Degree in a relevant field is required
  • Master’s degree in a relevant field is preferred

You may be a fit to this role if you:

  • Think about systems - edge cases, failure modes, behaviors, specific implementations.
  • Know your way around Linux and the Unix Shell.
  • Know what is the use of config management systems like Terraform, Ansible, Chef . etc.
  • Have strong programming skills - Ruby and/or Go.
  • Have an urge to collaborate and communicate asynchronously.
  • Have an urge to document all the things so you don't need to learn the same thing twice.
  • Have a proactive, go-for-it attitude. When you see something broken, you can't help but fix it.
  • Have an urge for delivering quickly and iterating fast.
  • Share our values, and work in accordance with those values.
  • Have experience with Docker, Kubernetes, Prometheus and other cloud-native tools.

Job Specification

Job Rewards and Benefits

HungerStation

Information Technology and Services - Riyadh, Saudi Arabia
© Copyright 2004-2024 Mustakbil.com All Right Reserved.