Get more other jobs in your inbox
Verified daily — no ghost listings.
About This RoleAI processing…
Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.
Key Responsibilities
- 1Improve our understanding of RL, what it takes to handle longer horizon tasks, and train with less compute
- 2Train graders to improve performance on coding tasks with non-verifiable reward
- 3Improve the quality and difficulty of datapoints we use for training our models
- 4Realtime RL for coding agents
Requirements
- You have a deep background in RL and strong machine learning fundamentals
- You are truth seeking, aiming to learn more about the science than proving your ideas are correct.
- You should expect significantly more scope and autonomy than in other research labs.
Perks & BenefitsTypical for this role
Apply to This Job in Minutes
Generate ATS-optimized resume + cover letter + interview prep with Jobease.ca AI. Complete your application faster.
75% of AI Resumes Get Rejected
Beat the ATS with Jobease.ca's AI Resume Builder. Optimized for real hiring systems.
Build My ResumeProfile Match
Loading…Checking your profile against this job…
Job Overview
Share This Job
Track All Your Applications
Never lose track again. Jobease.ca organizes every application, interview, and follow-up.
Organize My Search