Fri Apr 20

From Computational Statistics (CSE383M and CS395T)
Jump to navigation Jump to search

The file here consists of three pieces of information about each of 100,000 (real) chess games: the first move taken by white, the first move taken by black, and the eventual winner of the game ((W)hite, (B)lack, or (D)raw). Each game is on a different, tab-separated line. For example, the first 10 lines of the file look like:

d4 Nf6 B
d4 Nf6 D
c4 Nf6 W
e4 g6 B
e4 Nf6 W
e4 g6 D
e4 c6 D
e4 c6 B
e4 c6 D
e4 e6 B

For an explanation of the syntax for describing moves, see wikipedia.

Using this data, answer as many of the following questions as you can:

1. Is there a significant association between the type of piece white chooses to move first and the outcome of the game?

2. Is there a significant association between the pair of opening moves taken together and the outcome of the game?

3. If white's first move is (pawn to e4), is there a significant association between black's response and the outcome of the game?

If time permits, come up with a way to visualize these associations.