Reinforcement Learning

GitHub - agentsea/r1-computer-use: Applying the ideas of Deepseek R1 to computer use

An experimental project applying large-scale Reinforcement Learning techniques to computer usage scenarios, utilizing neural reward models to validate agent actions. The system implements a three-step cycle extending ReACT into reinforcement learning, with multiple training stages focused on developing reasoning skills for computer interaction.