[SOLVED] game Java python graph The goal of this Assignment is to expose you to some of the delights of building machine agents for playing video games.

$25

File Name: game_Java_python_graph_The_goal_of_this_Assignment_is_to_expose_you_to_some_of_the_delights_of_building_machine_agents_for_playing_video_games..zip
File Size: 1347.06 KB

5/5 - (1 vote)

The goal of this Assignment is to expose you to some of the delights of building machine agents for playing video games.
For this purpose we will assume the Deadly Corridor task from the VizDoom game engine.
VizDoom is an open source port of the FPS title that provides several different task configurations as well as the death match for which the game is well known. What makes this platform interesting from a learning agent perspective is that the first person perspective renders the task partially observable, whereas most instances of the Atari arcade games provide full observability. The basic challenge of your Assignment is to demonstrate the application of a visual reinforcement learning agent on the specific instance of the Deadly Corridor task.
In order to get you started, we will provide a paper detailing the application of the Tangled Program Graph (TPG) framework for visual reinforcement learning to VizDoom. You will need to familiarize yourself with this reference.
Two code bases for TPG are made available:
Python maintained by Ryan Amaral with some FAQ.
Java maintained by Robert Smith.
o Windows OS compatible Eclipse package: Copy this into your Eclipse workspace folder and import the Assignment. You might also need to change the Java path to match your version of Java.
o Once imported into Java, all the dependencies should be properly set up and you should be able to drop the TPG source into the Assignment. Then update your API execution file and it will be able to see all the correct code/DLL paths without any need for additional setup.
The deadly corridor task requires you to successfully collect the armour in the last of a sequential sequence of 3 rooms connected by short corridors, as per the following figure:

Your agent is spawn in the first room (LHS) and has to successfully pass the opponent agents present in each room in order to finally collect the armour (RHS). Given that you are using a TPG learning agent for this task, you will have to consider how to provide rewards for achieving useful behaviours in this task. One example might be to reward removing opponent agents from the game as well as minimizing distance to the armour. Other factors might include reducing the cost (to character health) of being hit or experimenting with different methods of reproduction.
The baseline behaviour corresponds to an agent that dies in room 2, and is worth a grade of B-.
You need do following task for this Assignment:
Provide your code, and show outcomes from your Assignment.
4 page written report. Such a report needs to summarize the findings of your Assignment detailing what you have learn over the course of the Assignment. View this as an opportunity to pass on some important/pragmatic tricks of the trade and/or caveats you picked up over the course of the Assignment. With this in mind emperical evidence needs to be demonstrated to emphasize the significance of your findings/recommendations.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Shopping Cart
[SOLVED] game Java python graph The goal of this Assignment is to expose you to some of the delights of building machine agents for playing video games.
$25