多模态算法工程师(模型优化方向)/ Multimodal Algorithm Engineer (Model Optimization)
Advanced Micro Devices View all jobs
- Beijing
- Permanent
- Full-time
- Optimize training strategies, fine-tuning, and alignment for multimodal models (VLM / VLA / WAM) on AMD platforms.
- Enhance action prediction, world state modeling, and long-horizon planning capabilities of WAM/VLA models for embodied intelligence scenarios (e.g., robotics, simulation-based interaction).
- Design and implement model optimization techniques including quantization (PTQ/QAT), pruning, knowledge distillation, operator fusion, and KV cache optimization to improve inference latency, throughput, and energy efficiency.
- Collaborate closely with compiler, driver, and system software teams to deeply integrate models into AMD’s software stack.
- Stay at the forefront of research in World Models, action generation, and multimodal agents—and explore novel architectures for AMD’s heterogeneous compute platforms.
- Master’s or PhD in Computer Science, Artificial Intelligence, Robotics, Electrical Engineering, or a related field.
- Hands-on experience with VLMs, VLAs, or WAMs (World Action Models)—especially in robotics decision-making, simulated environment training, or action sequence generation—is highly preferred.
- Proficiency in PyTorch; familiarity with multimodal and embodied AI frameworks.
- Familiarity with simulation platforms such as Isaac Gym, LIBERO, MuJoCo, or RoboTwin.
- Strong software engineering skills and ability to deliver full-cycle solutions—from research prototyping to production deployment.
- Contributions to open-source projects in multimodal agents, world models, or robotics (e.g., OpenVLA, DROID, ACT).
- Publication record in top-tier conferences (e.g., CVPR, ICRA, CoRL, NeurIPS, ICLR) in multimodal learning or embodied AI is a strong advantage.
- Experience with AMD ROCm ecosystem or heterogeneous computing performance tuning.
- Access to cutting-edge AMD compute resources.
- Unique opportunity to shape full-stack co-design across algorithms, compilers, and hardware.
- A collaborative, globally distributed team of world-class AI systems and robotics researchers.