人工智能课设——基于A*算法的五子棋博弈系统（Python实现）

一、课设要求

基于A算法的五子棋博弈系统
A)给出用 A算法设计实现五子棋博弈的思想与方法。
B)设计实现五子棋博弈交互系统。
C)对不同的格局，以及不同的初始状态和目标状态，记录 A算法的落棋求解结果。
分析A算法设计实现五子棋博弈的有效性。

二、代码实现

注：该代码在码者编辑此文档时进行过微调，如有bug可评论或私信，感谢。

2.1 原代码

作者：罗WWEEII (luo_wweeii) - Gitee.com
项目链接：gobang_AI: 基于博弈树α-β剪枝搜索的五子棋AI (gitee.com)

因为本人不擅长python，再加上从零开始也很浪费时间，所以直接去gitee上面找的现有代码进行修改。(原作者如有介意请联系删除)

2.2 核心代码

2.2.1 A*算法实现

open表：用于存储待评估的点；closed表：用于记录已经评估过的点；价值：价值越高，越容易获胜；代价：价值取反；

首先，将所有落子的邻接点放入open表中，放入的时候会进行落子评估，获取落子在一个点后的棋局价值；
从open表中取出价值最高的点；
落子判断，判断当前点是否合法、当前点是否在closed表中、当前搜索是否达到搜索深度（DEPTH）；
将当前落子的所有邻接点放入open表，放入前也会进行价值计算；
循环第二步至第四步，直到到达循环出口（open表为空，达到递归深度）；

代码：

defa_star_search():
    open_list =[]# 所有带搜索的点
    closed_set ={}# 记录已经搜索过的点以及其价值评估分数for point in list3:
        neighbors = get_neighbors(point)# 取出所有落子的邻接点for neighbor in neighbors:if neighbor in list_all and neighbor notin list3 and neighbor notin closed_set.keys():# 修改此处检查邻居是否在 closed_set 的键中# 后续两个append()和remove()是为了评估落下评估点后的棋局状态(但实际还未落下)，所以需要先添加进两个list中，之后再删除
                list1.append(neighbor)# 将邻居加入AI列表
                list2.append(neighbor)# 将邻居加入人类落子列表if neighbor notin[node for(_, node)in open_list]:# 检查节点是否已经存在于open列表中
                    heapq.heappush(open_list,(-evaluation(True), neighbor))# 将当前点加入open列表# 从列表中删除刚刚加入的邻居
                list1.remove(neighbor)
                list2.remove(neighbor)ifnot open_list:returnNonewhile open_list:# 在a_star_search函数中修改取出具有最小代价的节点的行为
        min_cost =min(open_list)[0]# 获取当前最小代价
        min_cost_nodes =[node for cost, node in open_list if cost == min_cost]# 找到所有具有最小代价的节点列表
        current_node = random.choice(min_cost_nodes)# 从具有相同最小代价的节点列表中随机选择一个节点
        open_list.remove((min_cost, current_node))# 从open_list中移除选择的节点
        current_cost = min_cost

        if current_node notin closed_set:if current_node notin list3:
                closed_set[current_node]= current_cost  # 记录当前点和评估分数iflen(closed_set)>= DEPTH:# 到达搜索深度
                max_score_node =min(closed_set, key=closed_set.get)# 找到评估分数最大的点(代价最小,即价值最大)return max_score_node

            neighbors = get_neighbors(current_node)for neighbor in neighbors:if neighbor in list_all and neighbor notin list3 and neighbor notin closed_set.keys():# 修改此处检查邻居是否在 closed_set 的键中
                    list1.append(neighbor)# 将邻居加入AI列表
                    list2.append(neighbor)# 将邻居加入列表if neighbor notin[node for(_, node)in open_list]:# 检查节点是否不在open列表中
                        heapq.heappush(open_list,(-evaluation(True), neighbor))# 将节点推入open列表# 从列表中删除刚刚加入的邻居
                    list1.remove(neighbor)
                    list2.remove(neighbor)# 如果搜索完所有可搜索的点时仍未到达搜索深度，则返回评估分数最大的点
    max_score_node =min(closed_set, key=closed_set.get)return max_score_node

2.2.2 评估函数

1> 评估模型

当某一行/列构成评估模型中的状态时，便相应的累加记分。

# 棋型的评估分数,例：落子为* * 1 1 0 0 0 * * 时的棋型得10分 1为计算分数的对象的棋子,0为可落子的空位置
shape_score =[(10,(1,1,0,0,0)),(10,(1,0,1,0,0)),(10,(1,0,0,1,0)),(50,(0,1,1,0,0)),(50,(0,1,0,1,0)),(100,(1,1,0,1,0)),(100,(0,0,1,1,1)),(100,(1,1,1,0,0)),(500,(0,1,0,1,1,0)),(500,(0,1,1,0,1,0)),(2000,(0,1,1,1,0)),(4000,(1,1,1,0,1)),(4000,(1,1,0,1,1)),(4000,(1,0,1,1,1)),(5000,(1,1,1,1,0)),(5000,(0,1,1,1,1)),(100000,(0,1,1,1,1,0)),(99999999,(1,1,1,1,1))]

2> 评估函数

在评估函数中，首先会根据传参，相应的进行待计算棋子列表的初始化，然后计算双方每个落子的每个方向上的得分情况（使用score_all_arr来保证不会对一种得分情况进行重复计算），最后带权相加当前的棋局得分情况。

ratio：进攻的系数；小于1：进攻型、大于1：防守型

# 一个点的价值评估函数defevaluation(is_ai):if is_ai:
        my_list = list1
        enemy_list = list2
    else:
        my_list = list2
        enemy_list = list1

    # 算自己的得分
    score_all_arr =[]# 得分形状的位置,避免重复计算
    my_score =0for pt in my_list:# 计算自己的所有点在四个方向上的得分情况
        m = pt[0]
        n = pt[1]
        my_score += cal_score(m, n,0,1, enemy_list, my_list, score_all_arr)
        my_score += cal_score(m, n,1,0, enemy_list, my_list, score_all_arr)
        my_score += cal_score(m, n,1,1, enemy_list, my_list, score_all_arr)
        my_score += cal_score(m, n,-1,1, enemy_list, my_list, score_all_arr)#  算敌人的得分
    score_all_arr_enemy =[]
    enemy_score =0for pt in enemy_list:
        m = pt[0]
        n = pt[1]
        enemy_score += cal_score(m, n,0,1, my_list, enemy_list, score_all_arr_enemy)
        enemy_score += cal_score(m, n,1,0, my_list, enemy_list, score_all_arr_enemy)
        enemy_score += cal_score(m, n,1,1, my_list, enemy_list, score_all_arr_enemy)
        enemy_score += cal_score(m, n,-1,1, my_list, enemy_list, score_all_arr_enemy)

    total_score = my_score + enemy_score * ratio

    return total_score

3> 得分计算

该函数会根据参数，相应的查找传入点的传入方向上的前后共十一个位置，优先记录该方向上得分最高的相应得分棋型。同时这里还增加了部分特判规则，当待查找的点落入棋盘外时，会调整直接进入下一次循环，避免无效查询。

# 一个方向上的分值计算defcal_score(m, n, x_direct, y_direct, enemy_list, my_list, score_all_arr):
    add_score =0# 加分项# 在一个方向上， 只取最大的得分项
    max_score_shape =(0,None)# 如果此方向上，该点已经有得分形状，不重复计算for item in score_all_arr:for pt in item[1]:if m == pt[0]and n == pt[1]and x_direct == item[2][0]and y_direct == item[2][1]:return0# 在落子点 左右方向上循环查找得分形状for offset inrange(-5,1):# offset = -2
        pos =[]
        found_valid_pos =False# 布尔变量用于记录是否找到有效位置for i inrange(0,6):
            next_pos =(m +(i + offset)* x_direct, n +(i + offset)* y_direct)ifnot(0<= next_pos[0]<= COLUMN and0<= next_pos[1]<= ROW)and i !=5:
                found_valid_pos =True# 如果存在不满足条件的情况，设置布尔变量为 Truebreakif next_pos in my_list:
                pos.append(1)elif next_pos in enemy_list:
                pos.append(2)else:
                pos.append(0)if found_valid_pos:# 检查布尔变量是否为真continue# 如果找到无效位置，跳出内层循环
        tmp_shap5 =(pos[0], pos[1], pos[2], pos[3], pos[4])
        tmp_shap6 =(pos[0], pos[1], pos[2], pos[3], pos[4], pos[5])for(score, shape)in shape_score:if tmp_shap5 == shape or tmp_shap6 == shape:if tmp_shap5 ==(1,1,1,1,1):print('到达赛点咯!')if score > max_score_shape[0]:# 棋型记录
                    max_score_shape =(score,((m +(0+ offset)* x_direct, n +(0+ offset)* y_direct),(m +(1+ offset)* x_direct, n +(1+ offset)* y_direct),(m +(2+ offset)* x_direct, n +(2+ offset)* y_direct),(m +(3+ offset)* x_direct, n +(3+ offset)* y_direct),(m +(4+ offset)* x_direct, n +(4+ offset)* y_direct)),(x_direct, y_direct))# 计算两个形状相交， 如两个3活 相交， 得分增加 一个子的除外（该代码为原作者代码，此处暂做保留，但码者并未详看，所以暂不做注解）if max_score_shape[1]isnotNone:for item in score_all_arr:for pt1 in item[1]:for pt2 in max_score_shape[1]:if pt1 == pt2 and max_score_shape[0]>10and item[0]>10:
                        add_score += item[0]+ max_score_shape[0]

        score_all_arr.append(max_score_shape)return add_score + max_score_shape[0]

2.2.3 难度设置

通过难度设置，可以动态的调整A*算法的搜索深度，以达到更加的搜索结果。

这里通过添加了三个按钮，监听点击事件，当相应的点击到了按钮所在区域后，执行相应逻辑代码，否则保持监听

defset_difficulty(win):global DEPTH  # 搜索深度
    button_width =80
    button_height =20
    button_spacing =100
    button_y = win.getHeight()/2- button_height /2

    buttons =[]
    button_labels =["简单","中等","高级"]
    depths =[1,(ROW +1)*(COLUMN +1)*0.3,(ROW +1)*(COLUMN +1)]
    button_x1 =(win.getWidth()-3* button_spacing)/2for label, depth inzip(button_labels, depths):# 显示按钮
        button_x2 = button_x1 + button_width
        button = Rectangle(Point(button_x1, button_y), Point(button_x2, button_y + button_height))
        button.setFill('light gray')
        button_text = Text(Point((button_x1 + button_x2)/2, button_y + button_height /2), label)
        button.draw(win)
        button_text.draw(win)
        buttons.append((button, button_text, depth))
        button_x1 += button_spacing

    clicked =Falsewhilenot clicked:
        click_point = win.getMouse()
        x = click_point.getX()
        y = click_point.getY()for button, _, depth in buttons:# 遍历所有难度按钮
            button_x1, button_x2 = button.getP1().getX(), button.getP2().getX()
            button_y1, button_y2 = button.getP1().getY(), button.getP2().getY()if button_x1 < x < button_x2 and button_y1 < y < button_y2:# 判断是否点击的当前按钮
                DEPTH = depth
                clicked =Truebreakif210<= x <=290and10<= y <= EXTRA_HEIGHT -10:# 玩家点击了退出按钮
            win.close()returnTruefor button, button_text, _ in buttons:
        button.undraw()# 删除按钮
        button_text.undraw()# 删除按钮文本returnFalse

2.2.4 重置游戏

重置游戏的逻辑就是清空所有数据，重新初始化相关图形化界面。

defreset_game(win):# 清空棋盘列表和图形
    list1.clear()
    list2.clear()
    list3.clear()for item in win.items[:]:
        item.undraw()# 重新绘制额外部分
    extra_part = Rectangle(Point(0,0), Point(COLUMN * GRID_WIDTH, EXTRA_HEIGHT))
    extra_part.setFill('light blue')
    extra_part.draw(win)# 重新绘制按钮
    reset_button = Rectangle(Point(10,10), Point(90, EXTRA_HEIGHT -10))
    reset_button.setFill('light gray')
    reset_button.draw(win)
    reset_text = Text(Point(50, EXTRA_HEIGHT /2),"重置游戏")
    reset_text.draw(win)

    undo_button = Rectangle(Point(110,10), Point(190, EXTRA_HEIGHT -10))
    undo_button.setFill('light gray')
    undo_button.draw(win)
    undo_text = Text(Point(150, EXTRA_HEIGHT /2),"悔棋")
    undo_text.draw(win)

    quit_button = Rectangle(Point(210,10), Point(290, EXTRA_HEIGHT -10))
    quit_button.setFill('light gray')
    quit_button.draw(win)
    quit_text = Text(Point(250, EXTRA_HEIGHT /2),"退出游戏")
    quit_text.draw(win)# 重新绘制棋盘的水平线条for j inrange(ROW +1):
        line_horizontal = Line(Point(0, j * GRID_WIDTH + EXTRA_HEIGHT),
                               Point(COLUMN * GRID_WIDTH, j * GRID_WIDTH + EXTRA_HEIGHT))
        line_horizontal.draw(win)# 重新绘制棋盘的垂直线条for i inrange(COLUMN +1):
        line_vertical = Line(Point(i * GRID_WIDTH, EXTRA_HEIGHT),
                             Point(i * GRID_WIDTH, ROW * GRID_WIDTH + EXTRA_HEIGHT))
        line_vertical.draw(win)return set_difficulty(win)

2.2.5 悔棋

悔棋通过动态的删除AI和人类相关list最顶上的元素，从而实现悔棋的效果。

defundo_move(win):global move_hint
    global player_wins

    # 移除当前的红色圆圈（如果存在）if move_hint:
        move_hint.undraw()if list1 andnot player_wins:
        last_ai_move = list1.pop()
        list3.remove(last_ai_move)
        x, y = last_ai_move
        for item in win.items[:]:ifisinstance(item, Circle):if item.getCenter().getX()== x * GRID_WIDTH and item.getCenter().getY()== y * GRID_WIDTH + EXTRA_HEIGHT:
                    item.undraw()breakif list2:
        last_human_move = list2.pop()
        list3.remove(last_human_move)
        x, y = last_human_move
        for item in win.items[:]:ifisinstance(item, Circle):if item.getCenter().getX()== x * GRID_WIDTH and item.getCenter().getY()== y * GRID_WIDTH + EXTRA_HEIGHT:
                    item.undraw()break# 如果还有AI的落子记录，更新红色圆圈的位置if list1:
        last_ai_move = list1[-1]
        x, y = last_ai_move
        move_hint = Circle(Point(GRID_WIDTH * x, GRID_WIDTH * y + EXTRA_HEIGHT),16)
        move_hint.setOutline('red')
        move_hint.setWidth(3)
        move_hint.draw(win)
    player_wins =False

2.2.6 落子提示

落子提示通过记录一个全局变量，动态记录当前需要进行提示的棋子，主要的提示显示与提示撤销逻辑在主函数中。但是需要注意，在悔棋的函数中也需要动态的对该全局变量进行修改。

# 移除上一个玩家的落子提示圆圈（如果有的话）if move_hint:
    move_hint.undraw()# 创建新的落子并设置为落子提示
piece = Circle(Point(GRID_WIDTH * pos[0], GRID_WIDTH * pos[1]+ EXTRA_HEIGHT),16)
piece.setFill('white')
piece.draw(win)
move_hint = Circle(Point(GRID_WIDTH * pos[0], GRID_WIDTH * pos[1]+ EXTRA_HEIGHT),16)# 创建落子提示圆圈
move_hint.setOutline('red')# 设置提示圆圈颜色为红色
move_hint.setWidth(3)# 设置提示圆圈的线宽
move_hint.draw(win)# 绘制落子提示圆圈

defundo_move(win):global move_hint
    global player_wins
    
    # 移除当前的红色圆圈（如果存在）if move_hint:
        move_hint.undraw()# 如果还有AI的落子记录，更新红色圆圈的位置if list1:
        last_ai_move = list1[-1]
        x, y = last_ai_move
        move_hint = Circle(Point(GRID_WIDTH * x, GRID_WIDTH * y + EXTRA_HEIGHT),16)
        move_hint.setOutline('red')
        move_hint.setWidth(3)
        move_hint.draw(win)

2.3 总体流程

首先创建一个游戏对象，该对象用于后续的相关函数传参，然后进行相应变量的初始化。随后进入main中的主要循环，该循环使用变量change来记录当前应该哪一方进行落子，在相应的落子逻辑中会先进行游戏胜利的判断：若游戏胜利则弹出弹框，反之进行落子显示。其中还包含了不少的逻辑分支，在此不进行一一列举，详见注解。

defmain():
    win = gomokuwin()# 游戏对象for i inrange(COLUMN +1):for j inrange(ROW +1):
            list_all.append((i, j))# list_all表示所有可进行落子的点if set_difficulty(win):# 初始时先进行难度设置，返回值为真时表示点击了退出游戏return

    change =0# 表示当前的落子方global move_hint  # 落子提示while1:if change %2==1:# 如果玩家或AI已经获胜，则跳过当前回合if game_win(list1)or game_win(list2):
                change +=1continue# 调用AI算法找到下一步落子位置
            pos = a_star_search()# 如果落子位置为 None，表示没有可落子的位置，游戏平局（该分支未测试）if pos isNone:
                messagebox.showinfo("游戏结束","平局！")
                change +=1break# 移除上一个玩家的落子提示圆圈（如果存在的话）if move_hint:
                move_hint.undraw()# 将AI的落子位置加入列表中
            list1.append(pos)
            list3.append(pos)# 创建新的落子并设置为落子提示
            piece = Circle(Point(GRID_WIDTH * pos[0], GRID_WIDTH * pos[1]+ EXTRA_HEIGHT),16)
            piece.setFill('white')
            piece.draw(win)
            move_hint = Circle(Point(GRID_WIDTH * pos[0], GRID_WIDTH * pos[1]+ EXTRA_HEIGHT),16)# 创建落子提示圆圈
            move_hint.setOutline('red')# 设置提示圆圈颜色为红色
            move_hint.setWidth(3)# 设置提示圆圈的线宽
            move_hint.draw(win)# 绘制落子提示圆圈# 如果AI获胜，则显示游戏结束信息if game_win(list1):
                messagebox.showinfo("游戏结束","你输了！")
            change +=1else:
            click_point = win.getMouse()# 获得当前鼠标点击的点坐标
            x = click_point.getX()
            y = click_point.getY()# 检查点击位置是否在按钮区域内if10<= x <=90and10<= y <= EXTRA_HEIGHT -10:# 重置游戏if reset_game(win):returncontinueelif110<= x <=190and10<= y <= EXTRA_HEIGHT -10:# 悔棋
                undo_move(win)continueelif210<= x <=290and10<= y <= EXTRA_HEIGHT -10:# 退出游戏
                win.close()return# 如果点击事件在棋盘区域内且位置合法(落子点未存在棋子)elif y > EXTRA_HEIGHT andnot((round(x / GRID_WIDTH),round((y - EXTRA_HEIGHT)/ GRID_WIDTH))in list3)andnot game_win(
                list1)andnot game_win(list2):# 记录棋子
                a2 =round(x / GRID_WIDTH)
                b2 =round((y - EXTRA_HEIGHT)/ GRID_WIDTH)
                list2.append((a2, b2))
                list3.append((a2, b2))# 移除上一个玩家的落子提示圆圈（如果有的话）if move_hint:
                    move_hint.undraw()# # 创建新的落子并设置为落子提示
                piece = Circle(Point(GRID_WIDTH * a2, GRID_WIDTH * b2 + EXTRA_HEIGHT),16)
                piece.setFill('black')
                piece.draw(win)

                move_hint = Circle(Point(GRID_WIDTH * a2, GRID_WIDTH * b2 + EXTRA_HEIGHT),16)# 创建落子提示圆圈
                move_hint.setOutline('red')# 设置提示圆圈颜色为红色
                move_hint.setWidth(3)# 设置提示圆圈的线宽
                move_hint.draw(win)# 绘制落子提示圆圈# 如果玩家获胜，则显示游戏结束信息if game_win(list2):
                    messagebox.showinfo("游戏结束","你赢了！")global player_wins
                    player_wins =True
                change +=1
    win.close()

2.4 效果实现

这里附上部分游戏运行截图：

开始游戏：

落子：

游戏结束：
游戏获胜

游戏结束后仅支持点击上方的三个按钮，直接关闭窗口会报错，因为没有按照正常流程进行操作，后台线程还在运行。游戏结束后还可以直接悔棋重寻神之一手。

三、源码链接

代码地址：人工智能课设——基于A*算法的五子棋博弈系统（Python实现） (gitee.com)

直接拉取下来大概率跑不了（环境问题），建议自己本地创建python环境，然后将两个主要的文件（ gomoku.py、graphics.py）拷贝到自己的项目中运行

不熟悉gitee的可按如下操作：

点击链接进入gitee相关页面
点击“克隆/下载”
点击下载ZIP

标签：人工智能算法 python

本文转载自: https://blog.csdn.net/qq_64928997/article/details/139594729
版权归原作者 汪汪小乌拉 所有，如有侵权，请联系我们删除。