Senior Site Reliability Engineer (SRE)
Activision
- Shanghai
- Permanent
- Full-time
- Native speaker/fluent Mandarin
- Good English speaking and communication skills
- 4-5 years relevant work experience, including in a high-volume or critical production service environment
- Experience working at scale - thousands of servers running a high-volume or critical production service environment
- Interest in automation and scripting
- Comfortable with at least one scripting language, e.g. Python, Bash, Perl or Go
- Experience with at least one major database e.g. MySQL, Cassandra, Redis or Vitess
- Interest in fundamental technologies, e.g. TCP/IP, Linux/Unix internals
- Experience in configuration management systems, e.g. Ansible, Puppet, Terraform
- Interest in an investigative approach and excited to learn new technologies
- Experience in communicating within and across teams
- Interest in participation of an On-call support rotation as required.
- Experience working with container orchestration e.g. Kubernetes, Helm, Argo
- Strong analytical / troubleshooting skills
- Excellent written and verbal communication skills
- Ability to spend up to 1 week per month on call
- Experience working with public cloud providers and cloud technologies e.g. Amazon, GCP
- Experience in monitoring and metrics systems e.g. Sensu, Zabbix, Graphite, ELK
- Background in Software Engineering
- Experience with Python/Go scripting language
- Experience with K8S operator writing