10 Group Data

This chapter is outdated. We need to add the pcn groupdata explanation.

10.1 Preparing the data

Once the Group Data is received from the regional focal point of the Poverty GP, It has to be copied and pasted into a text file. Two columns of data must be pasted into the text file; one for weight and one for welfare, which may come in three different varieties and are clearly explain the Povcalnet Website. For the case of China 2016, which is the database used as example in this text, data comes in type 5 format,8 which means that the weight variable is the share of the population in a given quantile and the welfare variable is the mean of welfare therein. If the data comes broken down by urban and rural population, it is necessary to create one text file for each distribution.

10.2 Naming convention

Text files must be saved with the following naming convention,

CCCgXX.T0F

where the only literal piece of the naming convention is T0. All the other are as follows:

  • CCC refers to the country code. However, the name of the country has been used in the past.
  • g refers to the geographic area: r for rural, u for urban, and n for national
  • XX stands for the last two digits of the corresponding year of the survey. For example, if the year is 2016, then XX would be 16.
  • F refers to the type formatting of the data. In the case of China 2016 it would be 5.

Make sure the text does not have the extension txt. In software used to save the text file includes the extension txt by default, make sure to remove it manually.

So, the final name for rural China 2016 would be CHNr16.T05