💼 Full-Time Position

RLHF Architect: Align AI with Human Feedback (Remote)

🏢

Odixcity Consulting

📍 , , spain, , , spain, Spain

📍

Location

, , spain, Spain

📅

Posted

June 06, 2026

⏰

Type

Full-Time

🎯

Full-Time Opportunity: This is a permanent, full-time position with a competitive package and real career growth potential.

Job Description

                        A global AI consulting firm is seeking an RLHF Specialist to enhance AI models using Reinforcement Learning from Human Feedback methodologies. This remote role involves generating preference data, designing model tests, and collaborating with teams to improve ML outcomes. Candidates must have a minimum of 2 years in relevant fields, strong Python proficiency, and experience with deep learning frameworks. Ideal for those passionate about AI alignment and optimization with a flexible work environment.
#J-18808-Ljbffr
                    

Job Details

Job Type Full-Time

Location , , spain, , , spain

Country Spain

Posted June 06, 2026

Deadline July 16, 2026

Experience As specified