💼 Full-Time Position

AI Inference & Compression Engineer

🏢
Beijing Foreign Enterprise Management Consultants Co.,Ltd.
📍 singapore, singapore, Singapore
📍
Location
singapore, Singapore
📅
Posted
June 27, 2026
Type
Full-Time
🎯

Full-Time Opportunity: This is a permanent, full-time position with a competitive package and real career growth potential.

Job Description

On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as AI Inference & Compression Engineer.

Key Responsibilities
  • LLM Inference Acceleration. Research and develop advanced compression algorithms to accelerate LLM serving. Focus on KV cache optimization, model quantization, and resolving memory bandwidth bottlenecks during autoregressive decoding.
  • Classical Codec Development. Design and implement advanced video compression algorithms, focusing on improving Rate–Distortion performance, optimizing entropy coding, and enhancing quantization design for real-world applications.
  • AI-Based Media Coding. Develop and optimize AI-based video coding components, including AI-based loop filters, optical flow, and intelligent rate control.
  • Model Deployment & Fusion. Bridge the gap between AI research and production. Optimize deep learning models ...