The funny thing is, I have some suspicions that the 2011 Open dataset might be better than the 2012 dataset for the height/weight analyses. Why? Part of it is the registration process. In 2011, when athletes registered, they were asked immediately for their height and weight. In 2012, these questions have been eliminated, although folks can enter this information freely in their profile. As a result though, I expect that volunteered information for height/weight will fall off dramatically. Who knows though, with how big the Open is getting maybe it won't matter?
Click here for a .csv file of the 2011 Open dataset, courtesy of work done by Greg Perkins. The dataset includes all athletes, including athletes that did not finish all six workouts. The column headers should be:
- athlete ID,nameURL,age,sex&division,height,weight, overall-points,overall-rank, score1,rank1, score2, rank2, score3,rank3, score4,rank4, score5,rank5, score6,rank6
Click here for some helpful matlab scripts, including one that breaks the overall dataset into separate structures for each competition category.
And here for some information about the .csv file and descriptions of the matlab scripts.