Senior Site Reliability Engineer
Grab View all jobs
- Beijing
- Permanent
- Full-time
- Provide automated solutions for manual tasks and challenges we're facing.
- Manage cloud infrastructure using Infrastructure as Code solution.
- Get involved in deep diagnosis of incidents, and engage with multiple accomplished engineering teams on resolutions.
- Supporting different teams to make long-term infrastructure decisions, provide suggestions for infrastructure optimizations.
- Drive operational excellence practice and help improve reliability, stability and scalability challenges with engineering teams.
- Lead junior engineers to complete projects and help them grow.
- Mentor other engineers, define our technical culture, and help build a fast-growing team.
- Bachelor's degree in Computer Science, Software Engineering, Information Technology or related technical field involving coding.
- Preferably with at least 4 years of relevant experience of this role.
- Solid experience with algorithms, data structures, complexity analysis and software design.
- Experience in the following: Go, Python, C, C++, Java, Perl or Ruby.
- Experience using service monitoring, log, and alarm-related environments and tools.
- Experience in system troubleshooting in Linux environment.
- Solid experience using Linux commands and shell script, and can automate routine tasks.
- Solid experience with automation and provisioning tools (e.g Jenkins, Ansible/Chef/SaltStack/Puppet).
- Solid experience clarifying and breakdown vague problem into goals and workable solutions.
- Accountable, takes ownership and open to learn new technology.
- Proficiency in verbal and written English.
- Experience in Golang.
- Experience with cloud-based large-scale infrastructure from cloud providers such as AWS, Azure or Google Cloud Platform. Preferably have certification.
- Experience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes). Preferably have certification.
- Experience building high throughput streaming services, and knowledge on the streaming processing framework such as Flink.
- Contribute to open source project experience with performance analysis and debugging tools.
- We have your back with Term Life Insurance and comprehensive Medical Insurance.
- With GrabFlex, create a benefits package that suits your needs and aspirations.
- Celebrate moments that matter in life with loved ones through Parental and Birthday leave, and give back to your communities through Love-all-Serve-all (LASA) volunteering leave
- We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through life's challenges
- Balance personal commitments and life's demands are made easier with our FlexWork arrangements such as differentiated hours