London School of Economics EOPP: Economic Organisation and Public Policy Programme LSE
EOPP: Economic Organisation and Public Policy Programme

Dataset Summary

Datasets are in STATA8-SE format, except for the administrative data which is in MS-Excel format.

The administrative data contains the state, district, block, Gram Panchayat, and population for every village in the sample. The unit of observation is the village, identified uniquely by "keyid" It also contains the block pair information: every block in the sampled, except Chittoor, is paired with another block based on historical (belonging to same former princely state) and cultural (similar languages spoken) data. In the dataset, the blocks with the same "pair_no" form a pair.

The household data is a household survey on a random sample of 20 households from each sampled village. The unit of observation is the household, identified by "hh_srl". The village is identified by "keyid". In general, "keyid" allows for merges across datasets at village level, and "gp_keyid" allows for merges at Gram Panchayat level.

The gram pradhan data is a household survey of the Gram Pradhan, Vice-Pradhan, and ward-members in every sampled village. For a small number of villages the pradhan was not available for interview. Also for a small number of villages there was more than one pradhan interviewed in one village. These two situations add up to 10 Gram Panchayats for which Gram Pradhan information is not available. The majority of the questions overlap with those from the household data, but there are also some additional questions. The unit of observation is the household.

The PRA(Participatory Rapid Appraisal) data is a village level survey. Section 1 contains caste level information for every village; the unit of observation is the caste within the village. Section 2 contains the problems the village is facing, as described by different groups of villagers (men, women, SC/ST, etc.). The unit of observation is the group within the village. For sections 3 through 7, the unit of observation is the village.

The facilities data is a village level dataset containing information about the availability and quality of facilities in the village. The unit of observation is the village.

The facilities, new round data has further village level information about the availability and quality of facilities in the village. In addition it has information about the quality of sanitation in the village.