Hot: reinforcement-learning