Senior Site Reliability Engineer

Формат: Remote Повна зайнятість
Leading international iGaming company is looking for a Senior Site Reliability Engineer to join their team. The company is recognized for platform stability, strong engineering culture, and top-notch security standards, operating among the top slot products providers.

Key Requirements:
– Hands-on experience with Kubernetes (deployment, scaling, troubleshooting)
– Proficiency with FluxCD / ArgoCD and Helm
– Willingness to participate in 24/7 support
– Strong knowledge of AWS, Terraform, Docker
– Experience building and maintaining CI/CD pipelines
– Familiarity with monitoring tools (Prometheus, Grafana, Datadog) and logging (ELK Stack, AWS CloudWatch)
– Solid understanding of networking and security principles
– Scripting in Python, NodeJS, or Go
– Experience with Git and incident management tools (PagerDuty, Opsgenie, etc.)

Responsibilities:
– Ensure platform stability: monitor alerts, perform system checks, escalate incidents
– Participate in on-call rotations and handle critical incidents
– Deploy and maintain EKS/K8s clusters with Terraform and Helm/Flux
– Automate infrastructure processes
– Introduce new technologies and improve monitoring/logging
– Collaborate closely with other tech teams
– Conduct RCA and prevent repeated issues
– Maintain internal documentation and processes
– Define follow-up tasks based on technical investigations

Offer:
– Highly competitive salary
– Fully remote work
– Strong technical team and mature development culture
– Medical insurance (employee + partner), and more…

Send your CV: [email protected]
➡️ Telegram: @Apercon

Apply for this position

Allowed Type(s): .pdf, .doc, .docx