The Opportunity :
Our infrastructure team in Bangkok is searching for an experienced System Engineer to help us build and manage our infrastructure. We are looking for somebody with hands-on experience in infrastructure system lifecycle management, writing scripts and tools to development to interface with the system management APIs exposed by OpenStack, Ceph, Kubernetes, etc.
Responsibilities / Key skills :
- Design and development of automation toolsets to help drive efficiency in Agoda’s IT infrastructure (bare metal deployment, software installation / patching, monitoring and remediation, etc.).
- Manage incidents and daily operational tasks on production and development environments, occasionally outside of business hours
- Provide expert advice and guidance to other infrastructure team staff and software developers; can effectively mentor less experiences staff.
- Lead and manage implementation projects from end to end, working across multiple team and departments.
- Conducts performance tuning and troubleshooting investigations, working across the entire organization
- Coordinate datacenter operations tasks with remote DCOE staff (server / rack / row / cage provisioning, rolling replacements, power & temperature management, etc.)
Experience / Requirements :
At least 5 years of IT operations experience in LARGE heterogeneous environmentsMust have Kubernetes experience , understanding of complex Kubernetes architectures and can effortlessly design and manage scalable, resilient, and efficient containerized environments.Competent in one or more common scripting / automation languages : Python (mandatory), Go (mandatory) , YAML, ruby, JavaScript, bash, PowerShell, ansible playbook.Expert in Grafana dashboard development and query languages (SQL, JQL, Elasticsearch, PromQL).Excellent troubleshooting skills, deep dive analysis, capable to break down issues into testable hypotheses and develop tools to assist during troubleshooting. Can troubleshoot “full stack” issues.Preferred / Advantage
Some experience in CI / CD, preferably form a DevOps background, gitlab (in dept is advantage)Able to work under pressure and deliver projects on time.High sense of ownership. Actively looks for lingering problems and proactively fixes them.Good knowledge of networking architecture within complex e-commerce environmentsGood English skills, strong analytical skills, eager to learn new things.Self-motivated, approachable and adaptable, with good communication skills (working language is English).