About 26,400 results
Open links in new tab
  1. What is the significance of move 37? (to a non go player)

    Feb 26, 2023 · 1 I have seen (and googled) information for Game 2, Move 37 in the AlphaGo vs. Lee Sedol match However it is difficult to find information concerning this move that doesn't …

  2. Did Alphago zero actually beat Alphago 100 games to 0?

    Oct 21, 2020 · Background: Alphago was the first superhuman go player, but it had human tuning and training. AlphaGo zero learned to be more superhuman than superhuman. Its supremacy …

  3. terminology - AlphaGo Zero MCTS a misnomer? - Artificial …

    Dec 19, 2024 · AlphaGo Zero utilizes an algorithm it calls "Monte Carlo Tree Search" in the papers, but it seems like the algorithm is fully deterministic since it doesn't utilize random …

  4. Difference between AlphaGo's policy network and value network

    Mar 29, 2016 · If anyone else stumbles upon this old question, like me, you'll be pleased to know that AlphaGo's successor, "AlphaGo Zero", as well as its successor "AlphaZero" do indeed get …

  5. deep learning - What is the input to AlphaGo's neural network ...

    Jun 8, 2020 · AlphaGo Zero only uses the black and white stones from the Go board as its input, whereas previous versions of AlphaGo included a small number of hand-engineered features. …

  6. Newest 'alphago' Questions - Artificial Intelligence Stack Exchange

    For questions related to DeepMind's AlphaGo, which is the first computer Go program to beat a human professional Go player without handicaps on a full-sized 19x19 board. AlphaGo was …

  7. Why AlphaGo didn't use Deep Q-Learning?

    Apr 29, 2020 · In the previous research, in 2015, Deep Q-Learning shows its great performance on single player Atari Games. But why do AlphaGo's researchers use CNN + MCTS instead of …

  8. How does Alpha Go Zero MCTS work in parallel?

    Sep 25, 2023 · To understand how AlphaGo Zero performs parallel simulations think of each simulation as a separate agent that interacts with the search tree. Each agent starts from the …

  9. Why is Monte Carlo used as the tree search algorithm for AlphaGo?

    Apr 9, 2019 · The paper that introduced AlphaGo, Mastering the game of Go with deep neural networks and tree search, motivates the use of MCTS Monte Carlo tree search (MCTS) uses …

  10. policies - What kind of reinforcement learning method does …

    Dec 23, 2020 · 2 In reinforcement learning, there are model-based versus model-free methods. Within model-based ones, there are policy-based and value-based methods. AlphaGo …