Job Description
As the Site Reliability Engineer (SRE) you will ensure our customers get the best quality of service and up-time we can give them. You will Identify where we can expect and how we can tolerate IT failures from our systems as well as those we depend upon. You will be responsible for the availability, performance, monitoring, incident response, and general service management, of the platforms and services that our company runs and owns.
You will be working closely with developers and infrastructure engineers to build and operate services and systems that can handle failures and ensure operational deliverables are met. Your role includes ensuring systems remain available, reliable, and efficient as they scale, and involve every aspect of the company's site. You will also engage in designing and automating tools and processes for scalability and performance, as well as identifying and predicting failures. This is an opportunity to work for a leading company in the online gambling industry.
Qualifications
Essential skills:
- Have some experience in IT Service Management (ITIL) and have an understanding of which parts apply in an agile DevOps environment
- A desire to learn new technologies and apply them where appropriate to improve the quality of our software and processes
- Experience with AWS services
- Experience with Docker/Kubernetes
- Experience with Jenkins
- Infrastructure as a code - ideally Terraform
- UNIX/Linux systems administration background
- Understand and troubleshooting TCP/IP network stacks.
- Good experience with HTTP web technologies (Apache/Nginx) and highly available, scalable web architecture
- Good understanding of database administration (ideally MS SQL, AWS RDS and Elasticsearch)
- Experience in at least one configuration management solution (preferably Ansible)
- Experience in using monitoring tools (Splunk/New Relic/Elasticsearch/AWS CloudWatch)
- Programming skills (Python, Bash, Java)
- You love to automate everything!!
- Ensuring the compliance of responsible gaming protocols;
- Any other duties as requested by the company.
Attributes:
- Proven experience of automating and deploying software to production;
- Ability to find creative solutions and implement them in a fast pace work environment;
- Technical analysis experience translating business requirements into functionality;
- Ability to work as part of a team (but take individual ownership and responsibility);
- Excellent written communication skills (articulate and polite);
- Good verbal communication skills;
- Excellent attention to detail;
- Demonstrate a calm manner and an ability to handle difficult situations;
- Ability to work independently and meet deadlines.
Personal Attributes:
A confident individual with a friendly personality, drive, energy, and enthusiasm together with a strong personal commitment to the delivery of high standards in a timely, efficient and friendly manner who works in alignment with the Rank Interactive policies, procedures and values.
Additional Information
Requirements:
- Degree or equivalent qualification desirable (ideally in Computer Science, Mathematics, Engineering or a similar discipline beneficial);
- Passionate about quality;
- Excellent verbal and written communication skills and able to interact professionally with a diverse group of people;
- Experience implementing scalable software systems and platforms;
- Exceptional problem-solving skills;