Senior Site Reliability Engineer Full-time Job

3 weeks ago   Engineering   Dubai   78 views Reference: 37461
Job Details

Responsibilities Will Include

Assist in the design and continuously improve our team’s processes, tools and solutions used to build, deploy, monitor, maintain and scale production systems

Assist in the design and improve our monitoring, alerting and remediation solutions with focus on proactively identifying and addressing production issues

Collaborate with platform, support and dev teams for events such as production releases, change management and incident management

Participate in the on-call rotation for critical system alerts

Work in shifts in order to cover an extended time frame including evenings and weekends

Investigate and lead efforts to remediate critical operational production issues

Would be great if you brought this to the role

Minimum Requirements

Excellent communication skills (writing and speaking) in English

In-depth understanding of production management principles for distributed systems

3-5 years experience working with Infrastructure as Code and cloud provisioning tools

3-5 years experience working in operations teams managing production environments

3-5 years experience of utilizing and writing in languages such as Bash, Python, JavaScript and/or Go, or equivalents

3-5 years experience of general Linux experience

Hands-on experience with AWS, experience with Azure is a significant plus

Company Description
A complete suite of trusted products to build anything in web3.