Research Intern – Reinforcement Learning (RL) - Onsite

LevelAI • Bay Area, California • 1m ago

Why This Job is Featured on The SaaS Jobs

This Research Intern role sits at the intersection of SaaS and applied AI, where reinforcement learning is being tied directly to customer experience workflows. The listing points to a productized platform spanning conversation intelligence, multimodal understanding, and agentic systems, which is a distinctly SaaS-shaped problem space because model performance must translate into reliable behavior across real customer interactions.

For a SaaS career, the standout value is exposure to the full loop between research and production. Work such as defining reward signals, structuring interaction traces into training datasets, and evaluating systems with real-world feedback mirrors how modern SaaS companies operationalize machine learning. That experience tends to transfer well across AI-enabled SaaS roles because it builds intuition for instrumentation, iteration cycles, and the practical constraints of deploying learning systems.

The role is best suited to early-career candidates who want hands-on ownership of experiments and are comfortable moving between theory and implementation. It also fits someone who prefers concrete problem framing, measurable outcomes, and collaboration with engineering and product partners to land research work in shipped software.

The section above is editorial commentary from The SaaS Jobs, provided to help SaaS professionals understand the role in a broader industry context.

Job Description

🚀 Build the next generation of Agentic AI with us

Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agents across the entire customer experience lifecycle.

A core part of this vision is our investment in custom Small Language Models (SLMs)—purpose-built for CX workflows—paired with reinforcement learning systems that continuously improve decision-making in real-world environments.

We’re looking for a Research Intern (Reinforcement Learning) to join us in shaping this future.

What you’ll do

Design and build reinforcement learning environments that model real-world customer interaction workflows.
Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops
Define reward models and feedback loops using real-world signals (outcomes and human feedback)
Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning
Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making
Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale.

What we’re looking for

Currently pursuing (or recently completed) a degree in Computer Science, AI, Machine Learning, or related field
Strong understanding of reinforcement learning fundamentals
Familiarity with RL environments and training libraries such as Verl and Tinker
Strong foundation in probability, math, and optimization
Passion for building real-world AI systems