Site Reliability Engineer (SRE)

Robert Half

  • Shanghai
  • Permanent
  • Full-time
  • 15 days ago
  • Apply easily
Job Description:We're seeking a proactive Site Reliability Engineer to ensure the reliability, scalability, and performance of our cloud-based applications. You'll bridge development and operations, automating systems, monitoring infrastructure, and resolving critical incidents to deliver exceptional user experiences.Key ResponsibilitiesSystem Reliability & Performance
  • Design and maintain scalable, fault-tolerant systems for AWS/Alicloud.
  • Implement monitoring, alerting, and automation tools (Prometheus, Grafana, K8s).
  • Optimize infrastructure for high availability and minimal latency.
Incident Response & Automation
  • Lead incident response, root cause analysis, and post-mortem documentation.
Collaboration & DevOps Culture
  • Partner with Development teams to embed reliability practices.
  • Advocate for automation, observability, and performance engineering.
Capacity Planning & Risk Management
  • Forecast resource needs and plan for scaling infrastructure.
  • Identify and mitigate risks to ensure service stability.
Preferred Skills and Qualifications
  • Education: Bachelor's in Computer Science/Engineering or equivalent.
  • Cloud Proficiency: AWS/Alicloud with experience in containerization (Docker, K8s, ACK).
  • OS Knowledge: Solid understanding of operating systems (Linux preferred) and networking fundamentals.
  • Automation Tools: Terraform, Ansible, Jenkins, Git, Harness.
  • Programming: Python, Go, Java, Javascript/Typescript or similar scripting languages.
  • Monitoring: Prometheus, Grafana, ELK Stack, etc.
  • Problem-Solving: Strong debugging skills and ability to work under pressure.
  • Communication: Fluent English (written/verbal) for cross-team collaboration.
Preferred Experience
  • SRE/DevOps experience in high-traffic environments (preferred).
  • AWS Certified DevOps Engineer, CKA, or equivalent Certifications (preferred AWS Certified DevOps Engineer, CKA, or equivalent).
  • Experience with microservices architecture and large-scale distributed systems (preferred).
By clicking 'apply', you give your express consent that Robert Half may use your personal information to process your job application and to contact you from time to time for future employment opportunities. For further information on how Robert Half processes your personal information and how to access and correct your information, please read the Robert Half privacy notice https://www.roberthalf.cn/en/privacy-statement. Please do not submit any sensitive personal data to us in your resume (such as government ID numbers, ethnicity, gender, religion, marital status or trade union membership) as we do not collect your sensitive personal data at this time.点击"申请",即表示您明确同意 Robert Half 可以使用您的个人信息来处理您的工作申请,并不时与您联系以获得未来的就业机会。 如需进一步了解 Robert Half 如何处理您的个人信息以及如何访问和更正您的信息,请阅读 Robert Half 隐私声明 。请不要在您的简历中向我们提交任何敏感的个人数据(例如身份证号码、种族、性别、宗教、婚姻状况或工会会员身份),因为我们此时不收集您的敏感个人数据。

Robert Half