Principal Machine Learning Researcher Job at Alldus, San Jose, CA

ZDJXUWZNRGNkV0pGWWtROWM4TU9mL25XR0E9PQ==
  • Alldus
  • San Jose, CA

Job Description

Principal / Director, AI Research – Reinforcement Learning for LLMs

We're hiring a Principal or Director-level AI Researcher with deep expertise in Reinforcement Learning and LLM post-training to join our growing AI research group. This is a research-first role, with a mandate to push the frontier of model alignment, safety, and performance — working with foundation models in real-world, high-stakes environments.

You won’t be handed toy problems or legacy systems. Instead, you'll lead applied research efforts focused on tuning, aligning, and optimizing large models for privacy, security, and interpretability - in one of the few spaces where LLMs have both massive scale and measurable consequences.

What You’ll Work On:

This role centers on building and refining intelligent agents that interact with sensitive data and complex access controls, using modern reinforcement learning and post-training techniques:

  • Post-training of LLMs using RL: Design and run experiments with methods like PPO, DPO, RLAIF, and other fine-tuning strategies to align model behavior with security and privacy goals
  • RL for Self-Correction & Redaction: Enable models to iteratively improve their predictions on document classification, redaction, and identity resolution through self-rewarded feedback loops
  • Model Alignment & Safety: Contribute to the development of our “LLM Firewall” — filtering prompts/responses to prevent jailbreaking, data leakage, and adversarial exploits
  • Inference Stack & Optimization: Collaborate with engineers optimizing our in-house inference stack to make LLaMA-class models performant at scale

What We’re Looking For:

  • Demonstrated expertise in Reinforcement Learning applied to language models or decision-making agents
  • Strong understanding of post-training methodologies (e.g., RLHF, DPO, preference modeling, rejection sampling, offline RL)
  • Solid background in LLMs , token-level reasoning , and language modeling internals
  • Publication record or research contributions in top-tier venues (NeurIPS, ICLR, ICML, ACL, etc.) preferred
  • Ability to work independently and iterate quickly — experience in scrappy, high-output research environments a plus
  • Industry experience is not required — we care more about the depth of your research thinking and experimentation rigor

Why This Role:

  • Join a company with massive real-world data , impactful use cases, and a mature infrastructure
  • Avoid the grind of infra-focused roles — we’ve already solved those problems
  • Shape the next phase of LLM alignment , self-correcting models , and AI safety at inference time
  • Work on problems with technical depth and direct product impact

Job Tags

Similar Jobs

Professional Placement Services

Recruitment Consultant Job at Professional Placement Services

 ...Evaluate applicants by discussing job requirements and applicant qualifications Manage the interview process to ensure a positive experience for candidates Follow up and strategize with managers to determine their recruiting effectiveness Coordinate interviews... 

bet365

Senior UI-UX Designer Job at bet365

 ...You will work within the Product Design team in the Design and UX department, who are responsible for the strategic design, visual direction and development of our product. With a new focus on the US market, we are looking to craft innovative mobile app experiences that... 

Clarity Recruiting

Executive Administrative Assistant Job at Clarity Recruiting

 ...Administrative Assistant to join our team on a contract basis . This is an exciting opportunity to support a group of exceptional investment professionals while gaining exposure to a fast-paced, high-performing environment that values curiosity, collaboration, and... 

Gamett & King CPAs

Tax Accountant - Staff Job at Gamett & King CPAs

 ...Tax Accountant - Staff **Location:** Henderson, NV **Type:** Full-Time Overview As a Staff Accountant, you will prepare individual and business tax returns, assist in tax planning engagements, and support client compliance needs under the supervision of senior... 

Ernest

Procurement Specialist Job at Ernest

Merchandiser (Procurement Specialist) Location : Avondale, AZ | Full-Time | On-Site Competitive Base | Benefits | Growth-Oriented Culture Be the Buyer Behind the Brand At Ernest, our merchandisers (we call them merchandisers because, well, they do a lot more...