Senior ML Inference Engineer — Model Efficiency

Cohere · montreal (administrative region), qc, Canada

Location
montreal (administrative region)
Job Type
Full-time
Posted
June 04, 2026

Job Description

A leading AI technology company is seeking a Member of Technical Staff to enhance model efficiency. This role involves improving performance metrics, optimizing bottlenecks, and collaborating with various teams. The ideal candidate has 5+ years in high-performance coding, strong skills in C++ or Python, and familiarity with large language models. Competitive perks include a flexible work environment, health benefits, and generous vacation time.
#J-18808-Ljbffr

Ready to Apply?

Submit your application for Senior ML Inference Engineer — Model Efficiency at Cohere

Apply Now