This is a dataset collected during April-May 2009. It contains the following two representative samples of Facebook users with a few annotated properties:
- MHRW: A sample of 957,000 unique users obtained Facebook-wide by Metropolis-Hastings random walks, which is shown to closely approximate the ground truth.
- UNI: A uniform sample of 984,000 unique users that represents the ground truth. User IDs were selected uniformly at random from the platform's 32-bit ID space—essentially a rejection sampling procedure.
For each dataset, we release two files. The first file contains for each sampled user ID, the number of times the user was sampled and the user IDs of his/her friends. The second file contains additional node properties for each sampled user. For each sampled user ID we have the number of times sampled, the total number of friends, privacy settings and network membership. User IDs and network IDs are anonymized.
Submitted by Minas Gjoka
Added 22 September 2010