
Senior Production Support Engineer
- Shanghai
- Permanent
- Full-time
- Monitor, troubleshoot, and resolve production incidents for local and global banking applications in a timely manner to minimize downtime.
- Provide L1 and L2 support, including initial triage, diagnostics, and resolution, and collaborate with application support teams and vendors for partial L3 support to address complex issues.
- Act as the primary point of contact between local teams in China and global/regional system teams, including SRE and DevOps teams, to ensure seamless incident resolution and system alignment.
- Coordinate with global teams to manage incidents affecting distributed banking systems, ensuring consistency in processes and standards.
- Work closely with application support teams to resolve escalated issues and implement fixes for production systems.
- Engage with the bank's operation resilience project team to align on initiatives for system robustness, disaster recovery, and regulatory compliance.
- Collaborate with internal IT/tech center staff and external vendors to manage service-level agreements (SLAs) and ensure effective incident resolution.
- Lead post-incident root cause analysis (RCA) and coordinate with problem management teams to identify and implement preventive measures.
- Drive initiatives to reduce recurring incidents and improve system stability.
- Oversee monitoring systems (e.g., Splunk, Nagios) to proactively detect issues and analyze performance metrics.
- Provide regular reports to senior management on system health, incident trends, and SLA adherence.
- Enhance support processes, tools, and documentation to improve operational efficiency and response times.
- Collaborate with SRE and DevOps teams to integrate automation and resilience practices into production support workflows.
- Ensure compliance with China's regulatory requirements (e.g., Cybersecurity Law, data localization) and global banking standards.
- Work with security teams to protect sensitive financial data during incident resolution.
- Manage and mentor a team of production support engineers, fostering a culture of collaboration, accountability, and technical excellence.
- Ensure team readiness for on-call support and efficient incident handling.
- Education:
- Experience:
Proven experience supporting complex banking applications in a global banking environment.
Experience in L1/L2 support and coordination with application teams/vendors.
- Technical Skills:
Monitoring Tools: Proficiency in Splunk, Nagios, Zabbix, or similar for real-time system monitoring.
Scripting: Basic scripting skills in Bash, Python, or PowerShell for automating support tasks.
Database: Familiarity with SQL (e.g., MySQL, Oracle) for querying and troubleshooting database issues.
Networking: Understanding of TCP/IP, DNS, and firewalls for diagnosing connectivity issues.
Incident Management: Experience with Jira, ServiceNow, or Remedy for tracking and resolving incidents.
Banking Systems: Knowledge of banking applications and regulatory compliance in China.
- Communication Skills:
Ability to communicate technical issues clearly to non-technical stakeholders, including bank operations and compliance teams.
- Soft Skills:
Proactive mindset with a commitment to driving operational excellence and process improvement.
- Additional Requirements:
Ability to work across time zones to coordinate with global and regional teams.
Strong understanding of banking systems and compliance with local and global regulations.You'll achieve more when you join HSBC.
www.hsbc.com.cn/careersHSBC is committed to building a culture where all employees are valued, respected and opinions count. We take pride in providing a workplace that fosters continuous professional development, flexible working and opportunities to grow within and inclusive and diverse environment. Personal data held by the Bank relating to employment applications will be used in accordance with our Privacy Statement, which is available on our website. /JJIssued by HSBC Bank (China) Company Limited