Control Reinforcement Learning: Interpretable Token-Level Steering of LLMs via Sparse Autoencoder Features

Published on

March 9, 2026

Share this

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.