
Senior Consultant Specialist
- Guangzhou, Guangdong
- Contract
- Full-time
- Design, build, and maintain scalable and reliable systems.
- Monitor system performance and troubleshoot issues to ensure high availability.
- Develop and implement automation tools and frameworks for deployment and monitoring.
- Collaborate with software engineering teams to improve service reliability and performance.
- Create and maintain documentation for systems and processes.
- Participate in on-call rotations and incident response.
- Conduct post-mortem analyses to identify root causes of outages and implement preventive measures.
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Proven experience in a Site Reliability Engineer or similar role.
- Strong knowledge of cloud services (AWS, Google Cloud, , etc.).
- Proficiency in scripting and programming languages (e.g., Python, Go, Bash).
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Understanding of networking concepts and protocols.
- Excellent problem-solving skills and a proactive attitude.
- Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI).
- Knowledge of configuration management tools (e.g., Ansible, Chef, Puppet).