DeepMind 的 Player of Games

前幾天在 Hacker News Daily 上看到的消息，DeepMind 發了一篇新的論文，講 Player of Games 這個新的演算法：「Player of Games」，Hacker News 上的討論在這：「Player of Games (arxiv.org)」。

照留言上的討論，Player of Games 的名字由來應該是取自科幻小說《The Player of Games》。

這是一個更一般性的演算法，可以同時駕馭 perfect information 與 imperfect information：

We introduce Player of Games, a general-purpose algorithm that unifies previous approaches, combining guided search, self-play learning, and game-theoretic reasoning. Player of Games is the first algorithm to achieve strong empirical performance in large perfect and imperfect information games -- an important step towards truly general algorithms for arbitrary environments.

論文裡面也提到以前的各種演算法 (包含 DeepMind 自家的一些演算法)。在 perfect information 的例子來說，可以看到沒有 AlphaZero 強 (西洋棋與圍棋)，但也已經有一定水準了，算是個起頭的感覺：

主要的成就在於一般性，但論文後面也有提到，目前這個演算法需要的資源還是過大，還有改善的空間...

Git 支援其他 Hash 演算法的進展

Git 用 SHA-1，而 SHA-1 又破的問題使得 Git 開始計畫其他 hash algorithm (「Google 與 CWI Amsterdam 合作，找到 SHA-1 第一個 collision」)。在「"uchar [40]" to "struct object_id" conversion continues.」這邊可以看到一些動作，先把本來的 uchar[40] 換成一般性的 struct object_id。 Hacker News 上的「The beginning of Git supporting other hash algorithms」也有一些討論可以看。

March 19, 2017

In "Computer"

用 Machine Learning 改善 Streaming 品質的服務與論文

在 Hacker News 上看到「Puffer」這個服務，裡面利用了 machine learning algorithm 動態調整 bitrate，以提昇傳輸品質。測試得到的數據後來被整理起來一起放進論文：「Continual learning improves Internet video streaming」。在開頭介紹了 Fugu 這個演算法： We describe Fugu, a continual learning algorithm for bitrate selection in streaming video. 而 Puffer 就是實驗網站： We evaluate Fugu with Puffer, a public website we built that streams live TV using Fugu…

July 26, 2019

In "Computer"

QOI 圖片無損壓縮演算法

在 Hacker News Daily 上看到「Lossless Image Compression in O(n) Time」這篇，作者丟出了一個圖片的無損壓縮演算法，壓縮與解壓縮的速度超快，但壓縮率又不輸 PNG 太多，在 Hacker News 上的討論也可以看一下：「QOI: Lossless Image Compression in O(n) Time (phoboslab.org)」。裡面有提到在遊戲產業常用到的 stb_image.h： Yes, stb_image saved us all from the pains of dealing with libpng and is therefore used in countless games and apps. A while ago I aimed…

November 27, 2021

In "Computer"

Author Gea-Suan LinPosted on December 15, 2021Categories Computer, Murmuring, ProgrammingTags algorithm, deepmind, games, imperfect, information, learning, machine, of, perfect, player, pog

DeepMind 的 Player of Games

DeepMind 的 Player of Games

Related

Git 支援其他 Hash 演算法的進展

用 Machine Learning 改善 Streaming 品質的服務與論文

QOI 圖片無損壓縮演算法

Leave a Reply Cancel reply

Post navigation

Recommend

'Ameca' robot shows off more human-like facial expressions

all: gofmt -w -r 'interface{} -> any' src · golang/go@2580d0e · GitHub

Smart Suggestions vs Personal Recommendations: Why We Should Lose the Human Touc...

eBay’s Mavenization Strategy of Legacy Domain Business Libraries

Careers | DevCycle

好温馨～“糙汉”插画家带着新作回归了，鸡飞狗跳的二胎生活，笑中带泪

Using Free Wordpress Security Scanner - WPSeku | ComputingForGeeks

数据：以太坊近24小时销毁6931枚ETH

这是我见过最 ——— 长的LOGO

Install OpenProject on CentOS 8|Rocky Linux 8|AlmaLinux 8

About Joyk