Location
Shenzhen
Job Type
Full-time
Posted
May 22, 2026
Job Description
Some careers have more impact than others.
If you’re looking for a career where you can make a real impression, join HSBC and discover how valued you’ll be.
We are currently seeking an experienced professional to join our team in the role of Associate Director, Software Engineering (Model Hosting/Inference Optimisation).
Business: CTO Platforms (AI Platforms)
Location: Shenzhen / Guangzhou
Req ID: 44990
Principal responsibilities
- Design, build, and operate scalable, reliable model hosting platforms for LLMs, embeddings, and STT/TTS across heterogeneous hardware.
- Drive inference optimisation for latency, throughput, and cost (quantisation, KV-cache optimisation, dynamic/continuous batching).
- Evaluate, integrate, and tailor inference frameworks (e.g., vLLM, TensorRT-LLM,...
Ready to Apply?
Submit your application for Associate Director, Software Engineering (Model Hosting/Inference Optimisation) at HSBC Global Services Limited
Apply Now