policy-gradient-descent

Attribute error in PPO algorithm for Cartpole gym environment

Attribute error in PPO algorithm for Cartpole gym environment Question: I’m trying to run the code from here (Github link on this page): https://keras.io/examples/rl/ppo_cartpole/ I’m getting an attribute error in the training section from observation = observation.reshape(1,-1) which says "’tuple’ object has no attribute ‘reshape’". It seems that observation is currently env.reset() which is a …

Total answers: 1