AI Alignment Engineer: RLHF & Reward Modeling

Odixcity Consulting · Remote, Remote, South-Africa

Location
Remote
Job Type
Full-time
Posted
June 17, 2026

Job Description

Odixcity Consulting is hiring an RLHF Specialist to enhance and align AI models using reinforcement learning methodologies. This role involves designing feedback pipelines, generating high-quality preference data, and collaborating with machine learning engineers. Candidates should have at least 2 years of experience in relevant fields, strong Python skills, and familiarity with deep learning frameworks. The position is remote, allowing for global collaboration on cutting-edge AI technologies.
#J-18808-Ljbffr

Ready to Apply?

Submit your application for AI Alignment Engineer: RLHF & Reward Modeling at Odixcity Consulting

Apply Now