Research Engineer - Post-training

San Francisco, New York or Seattle

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As a Research Engineer in post-training, you’ll help develop novel techniques and datasets to maximize model performance for real-world applications, leveraging data and compute at scale. You’ll enable our models to complete engineering, code review, and software design tasks in large, real-world codebases while incorporating cutting-edge reinforcement learning (RL) methods.

What you might work on:

Research and develop innovative post-training techniques and reinforcement learning strategies to enable models to autonomously generate, debug, and optimize software
Build dynamic reward systems and feedback pipelines to align model outputs with human-like decision-making in software development
Scale up synthetic dataset generation and evaluations to drive iterative improvements in autonomous coding and problem-solving tasks
Improve model capabilities for generating substantial, high-quality, functional code
Design scalable approaches for evaluations and synthetic dataset generation that align with reinforcement learning objectives
Explore and implement novel methods to align AI behavior with human intent, ensuring reliability and performance in high-stakes environments

What we’re looking for:

Strong experience deploying and fine-tuning LLMs for real-world applications.
Strong general software engineering skills
Thorough knowledge of the deep learning literature
Expertise in reinforcement learning techniques such as actor-critic, self-play or self-evaluation, or RLHF
Ability to come up with and evaluate novel research ideas
Obsession with details, reliability, and good testing to ensure data quality and integrity
Willingness to dive deeply into a large ML codebase to debug
Passion for building systems that redefine software engineering through fully autonomous AI

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

Our culture:

Integrity. Words and actions should be aligned
Hands-on. At Magic, everyone is building
Teamwork. We move as one team, not N individuals
Focus. Safely deploy AGI. Everything else is noise
Quality. Magic should feel like magic

Compensation, benefits and perks (US):

Annual salary range: $100K - $550K
Equity is a significant part of total compensation, in addition to salary
401(k) plan with 6% salary matching
Generous health, dental and vision insurance for you and your dependents
Unlimited paid time off
Visa sponsorship and relocation stipend to bring you to SF, if possible
A small, fast-paced, highly focused team