Chess contingency tables

From Computational Statistics Course Wiki
Jump to navigation Jump to search

The file here consists of three pieces of information about each of 100,000 (real) chess games: the first move taken by white, the first move taken by black, and the eventual winner of the game ((W)hite, (B)lack, or (D)raw). Each game is on a different, tab-separated line. For example, the first 10 lines of the file look like:

d4 Nf6 B
d4 Nf6 D
c4 Nf6 W
e4 g6 B
e4 Nf6 W
e4 g6 D
e4 c6 D
e4 c6 B
e4 c6 D
e4 e6 B

For an explanation of the syntax for describing moves, see wikipedia.

Using this data, answer as many of the following questions as you can:

1. Is there a significant association between the type of piece white chooses to move first and the outcome of the game?

2. Is there a significant association between the pair of opening moves taken together and the outcome of the game?

3. If white's first move is (pawn to e4), is there a significant association between black's response and the outcome of the game?

If time permits, come up with a way to visualize these associations.