So as not to overwhelm anyone, we will be releasing the data in three waves. Today’s launch allows people to register and download the first instalment, which includes enough data for people to start trying out models. It includes claims data from Y1, information on members and the details of hospitalizations recorded in Y2.
The next instalment will be released on May 4 and will involve the release a more comprehensive dataset, including claims for later years as well as the test dataset against which entries will be judged. It is at this point that we will open up the competition to entries, reveal the performance threshold and begin posting the leaderboard.
Finally, the last release happens on June 4 and will include some ancillary data of prescriptions and lab tests.
members don’t sign up again. To register, simply login and accept the rules before downloading the data.
Finally the Twitter hashtag for the competition is #drflix. Help spread the word.