This project is to design a reinforcement learning algorithm using double Q-learning algorithm with function approximation. Two multi-layer neural networks should be designed as the Q function approximators. The environment should be Acrobot-v1, taken from the OpenAI Gym package.
To be submitted:
- A report in PDF format, including a description of the problem, a description of the neural networks, pseudo-code for the algorithm, performance of the algorithm, training time, lessons learned, and possible improvement.
- Source code in Python format.
The grading of the project will be based on the quality of the report, performance achieved (the final score per episode, and number of episode needed to achieve the performance, and complexity of the neural network, training time), and quality of the code.
Any code taken from the Internet (or peers in the class) will need to be clearly labeled with the source of the code.

![[Solved] EE526 Final Project](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip.jpg)

![[Solved] EE526 Homework 3](https://assignmentchef.com/wp-content/uploads/2022/08/downloadzip-1200x1200.jpg)
Reviews
There are no reviews yet.