5/5 - (1 vote)

The objective of this homework assignment is to understand and visualize how an Adversarial Search algorithm works. Specifically, we will implement and compare the performance of Adversarial search algorithm for three different policies:

Random
Minimax
Alpha Beta Pruning

We will explore these algorithms in the context of Tic Tac Toe gaming environment where the game will start with a N x N table. You (Agent) will select a square in the table and that will show up as a cross. Opponent will have the next turn and she will have the next move. Her move will show up as a circle. You keep playing with the opponent until either you win or the opponent or it ends up in a draw.

These are the following files for this assignment:

py: is the main program that implements the tictactoe game with the user interface. It is invoked using: tictactoe.py depth search_algorithm size where

(depth=3, search=minimax, size=3)

Depth is the size N of the N x N Tic Tac Toe board

Search_algorithm can be one of rand, mimimax or alphabeta

Example: run tictactoe.py 4 rand 3

py: this is where the adversarial search policies will be implemented. You have to implement:
1. mimimax
2. alphabeta

Here is the skeleton of the code for your assignment:

Adversarial search algorithms implementation

Your task for homework 5 is to implement:

minimax
alphabeta
abdl (alpha beta depth limited)

import random

import sys

def rand(game_state):

Generate a random move.

:param game_state: GameState object

:return: a tuple representing the row column of the random move

done = False

while not done:

row = random.randint(0, game_state.size 1)

col = random.randint(0, game_state.size 1)

if game_state.available(row,col):

done = True

return row, col

def minimax(game_state):

Find the best move for our AI agent by applying the minimax algorithm

(searching the entire tree from the current game state)

:param game_state: GameState object

:return: a tuple representing the row column of the best move

#Enter your code here

def value(game_state, player):

Calculate the minimax value for any state under the given agents control

:param game_state: GameState object state may be terminal or non-terminal

:param player: (string) user or AI AI is max

:return: (integer) value of that state -1, 0 or 1

if game_state.is_win(AI):

return 1

if game_state.is_win(user):

return -1

if game_state.is_tie():

return 0

# If the agent is MAX return max-value

if player is AI:

return max_value(game_state)

# If the agent is MIN return min-value

return min_value(game_state)

def max_value(game_state):

Calculate the minimax value for a non-terminal state under Maxs

control (AI agent)

:param game_state: non-terminal GameState object

:return: (integer) value of that state -1, 0 or 1

v = -sys.maxsize

for move in game_state.possible_moves():

v1 = value(game_state.successor(move, AI), user) # not good

tup = [v, v1]

# print(TUP, tup)

v = max(tup)

# print(MAX: , v)

return v

def min_value(game_state):

Calculate the minimax value for a non-terminal state under Mins

control (user)

:param game_state: non-terminal GameState object

:return: (integer) value of that state -1, 0 or 1

# Enter your code here and remove the pass statement below

v = sys.maxsize

for move in game_state.possible_moves():

v1 = value(game_state.successor(move, user), AI) # little gud

tup = [v, v1]

# print(TUP, tup)

v = min(tup)

# print(MIN: , v)

return v

def alphabeta(game_state):

Find the best move for our AI agent by applying the minimax algorithm

with alpha beta pruning.

:param game_state: GameState object

:return: a tuple representing the row column of the best move

# Enter your code here and remove the raise statement below

#Enter your code here#

def ab_value(game_state, player, alpha, beta):

Calculate the minimax value for any state under the given agents control

using alpha beta pruning

:param game_state: GameState object state may be terminal or non-terminal

:param player: (string) user or AI AI is max

:return: (integer) value of that state -1, 0 or 1

# Enter your code here and remove the pass statement below

if game_state.is_win(AI):

return 1

if game_state.is_win(user):

return -1

if game_state.is_tie():

return 0

# If the agent is MAX return max-value

if player is AI:

return abmax_value(game_state, alpha, beta)

# If the agent is MIN return min-value

return abmin_value(game_state, alpha, beta)

def abmax_value(game_state, alpha, beta):

Calculate the minimax value for a non-terminal state under Maxs

control (AI agent) using alpha beta pruning

:param game_state: non-terminal GameState object

:return: (integer) value of that state -1, 0 or 1

# Enter your code here and remove the pass statement below

a = alpha

v = -sys.maxsize

for move in game_state.possible_moves():

v = max([v, ab_value(game_state.successor(move, AI), user, a, beta)])

if v >= beta:

return v

a = max(a, v)

return v

def abmin_value(game_state, alpha, beta):

Calculate the minimax value for a non-terminal state under Mins

control (user) using alpha beta pruning

:param game_state: non-terminal GameState object

:return: (integer) value of that state -1, 0 or 1

# Enter your code here and remove the pass statement below

b = beta

v = sys.maxsize

for move in game_state.possible_moves():

v = min([v, ab_value(game_state.successor(move, user), AI, alpha, b)])

if v <= alpha:

return v

b = min([b, v])

return v

def abdl(game_state, depth):

Find the best move for our AI agent by limiting the alpha beta search to

the given depth and using the evaluation function game_state.eval()

:param game_state: GameState object

:return: a tuple representing the row column of the best move

# Enter your code here and remove the raise statement below

#Enter your code here#

def abdl_value(game_state, player, alpha, beta, depth):

Calculate the utility for any state under the given agents control

using depth limited alpha beta pruning and the evaluation

function game_state.eval()

:param game_state: GameState object state may be terminal or non-terminal

:param player: (string) user or AI AI is max

:return: (integer) utility of that state

# Enter your code here and remove the pass statement below

if player == AI and game_state.is_win(AI):

return 1

if player == user and game_state.is_win(user):

return -1

if game_state.is_tie():

return 0

if depth == 0:

return game_state.eval()

# If the agent is MAX return max-value

if player is AI:

return abdlmax_value(game_state, alpha, beta, depth)

# If the agent is MIN return min-value

return abdlmin_value(game_state, alpha, beta, depth)

def abdlmax_value(game_state, alpha, beta, depth):

Calculate the utility for a non-terminal state under Maxs control

using depth limited alpha beta pruning and the evaluation

function game_state.eval()

:param game_state: non-terminal GameState object

:return: (integer) utility (evaluation function) of that state

# Enter your code here and remove the pass statement below

a = alpha

v = -sys.maxsize

for move in game_state.possible_moves():

v = max([v, abdl_value(game_state.successor(move, AI), user, a, beta, depth -1)])

if v >= beta:

return v

a = max(a, v)

return v

def abdlmin_value( game_state, alpha, beta, depth):

Calculate the utility for a non-terminal state under Mins control

using depth limited alpha beta pruning and the evaluation

function game_state.eval()

:param game_state: non-terminal GameState object

:return: (integer) utility (evaluation function) of that state

# Enter your code here and remove the pass statement below

b = beta

v = sys.maxsize

for move in game_state.possible_moves():

v = min([v, abdl_value(game_state.successor(move, user), AI, alpha, b, depth -1 )])

if v <= alpha:

return v

b = min([b, v])

return v

Summarize your observations in terms of:

Total Number of nodes expanded for

Minimax
Alphabeta

Results:

When run for N = 3 and depth 3

Minimax: 2,556
Alphabeta: 558

we can see a significant improvement in terms of the number of nodes that were expanded.

Bonus points for:

Any other insights from running your code on different values of N and depth.

When run for N = 5 and depth 3

Minimax: 696,339
Alphabeta: 43,388

As we increase the dimensions and the depth, the program seems to slow down significantly to the point where it takes several minutes to explore all possible moves. Therefore, I kept the depth constant at 3 to decrease the number of nodes needed to be explored and expanded.

Implementing Alphabeta with limited depth

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] CS156 Homework #6-Adversarial Search

Reviews

Whatsapp Us

[Solved] CS156 Homework #6-Adversarial Search

Reviews

Related products

[Solved] CS156 Homework #2-Regression

[Solved] CS156 Homework #4-Uninformed Search

[Solved] CS156 Homework #5-Informed Search A* Search

[Solved] CS156 Homework #3-Classification