Benchmarks for Offline Reinforcement Learning
Most of the successes of RL rely closely on repeated on-line interactions of an agent with an setting, which we name on-line RL. Regardless of its success in simulation, the uptake of RL for real-world functions has been restricted. Energy crops, robots, healthcare methods, or self-driving automobiles are costly to run and inappropriate controls can … Read more