ETH_2020_HFPS_v01_EN_M_v01_A_OCS
COVID-19 High Frequency Phone Survey of Households 2020
Name | Country code |
---|---|
Ethiopia | ETH |
Socio-Economic/Monitoring Survey [hh/sems]
The World Bank is providing support to countries to help mitigate the spread and impact of the new coronavirus disease (COVID-19). One area of support is for data collection to inform evidence-based policies that may help mitigate the effects of this disease. Towards this end, the World Bank is leveraging the Living Standards Measurement Study - Integrated Survey on Agriculture (LSMS-ISA) program to implement high-frequency phone surveys on COVID-19 in 5 African countries - Nigeria, Ethiopia, Uganda, Tanzania, and Malawi. This effort is part of a broader first wave of World Bank-supported national longitudinal high frequency survey that can be used to help assess the economic and social implications of the COVID-19 pandemic on households and individuals.
The potential impacts of the COVID-19 pandemic in Ethiopia are expected to be severe on Ethiopian households' welfare. To monitor these impacts on households, the team selected a subsample of households that had been interviewed for the Living Standards Measurement Study (LSMS) in 2019, covering urban and rural areas in all regions of Ethiopia. The 15-minute questionnaire covers a series of topics, such as knowledge of COVID and mitigation measures, access to routine healthcare as public health systems are increasingly under stress, access to educational activities during school closures, employment dynamics, household income and livelihood, income loss and coping strategies, and external assistance.
The survey is implemented using Computer Assisted Telephone Interviewing, using a modular approach, which allows for modules to be dropped and/or added in different waves of the survey. Survey data collection started at the end of April 2020 and households are called back every three to four weeks for a total of seven survey rounds to track the impact of the pandemic as it unfolds and inform government action. This provides data to the government and development partners in near real-time, supporting an evidence-based response to the crisis.
The sample of households was drawn from the sample of households interviewed in the 2018/2019 round of the Ethiopia Socioeconomic Survey (ESS). The extensive information collected in the ESS, less than one year prior to the pandemic, provides a rich set of background information on the COVID-19 High Frequency Phone Survey of households which can be leveraged to assess the differential impacts of the pandemic in the country.
Sample survey data [ssd]
Households
The Ethiopia - COVID-19 High Frequency Phone Survey of Households covered the following topics:
Topic | Vocabulary |
---|---|
Food (production, crisis) | FAO |
Aid effectiveness | FAO |
Social protection | FAO |
Disaster Risk Management | FAO |
National
The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.
Name |
---|
World Bank |
Name | Role |
---|---|
Central Statistical Agency | Collaborator |
Name | Role |
---|---|
United States Agency for International Development | Funded the study |
The World Bank Group | Funded the study |
The sample of the HFPS-HH is a subsample of the 2018-2019 Ethiopia Socioeconomic Survey (ESS). The ESS is built on a nationally and regionally representative sample of households in Ethiopia. ESS 2018-2019 interviewed 6,770 households in urban and rural areas. In the ESS interview, households were asked to provide phone numbers either their own or that of a reference household (i.e. friends or neighbours) so that they can be contacted in the follow-up ESS surveys should they move from their sampled location. At least one valid phone number was obtained for 5,374 households (4,626 owning a phone and 995 with a reference phone number). These households established the sampling frame for the HFPS-HH.
To obtain representative strata at the national, urban, and rural level, the target sample size for the HFPS-HH is 3,300 households; 1,300 in rural and 2,000 households in urban areas. In rural areas, we attempt to call all phone numbers included in the ESS as only 1,413 households owned phones and another 771 households provided reference phone numbers. In urban areas, 3,213 households owned a phone and 224 households provided reference phone numbers. To account for non-response and attrition all the 5,374 households were called in round 1 of the HFPS-HH.
The total number of completed interviews in round one is 3,249 households (978 in rural areas, 2,271 in urban areas).
The total number of completed interviews in round two is 3,107 households (940 in rural areas, 2,167 in urban areas).
The total number of completed interviews in round three is 3,058 households (934 in rural areas, 2,124 in urban areas).
To obtain unbiased estimates from the sample, the information reported by households needs to be adjusted by a sampling weight (or raising factor) “w_h”. To construct the sampling weights, we follow the steps outlined in Himelein, K. (2014), which outlines eight steps, of which we follow six, to construct the sampling weights for the HFPS-HH:
Start | End | Cycle |
---|---|---|
2020-04-22 | 2020-05-13 | Round 1 |
2020-05-14 | 2020-06-03 | Round 2 |
2020-06-04 | 2020-06-26 | Round 3 |
DATA CLEANING
At the end of data collection, the raw dataset was cleaned by the Research team. This included formatting, and correcting results based on monitoring issues, enumerator feedback and survey changes. The details are as follows.
Variable naming and labelling:
• Variable names were changed to reflect the lowercase question name in the paper survey copy, and a word or two related to the question.
• Variables were labelled with longer descriptions of their contents and the full question text was stored in Notes for each variable.
• “Other, specify” variables were named similarly to their related question, with “_other” appended to the name.
• Value labels were assigned where relevant, with options shown in English for all variables, unless preloaded from the roster in Amharic.
Variable formatting:
• Variables were formatted as their object type (string, integer, decimal, time, date, or datetime).
• Multi-select variables were saved both in space-separated single-variables and as multiple binary variables showing the yes/no value of each possible response.
• Time and date variables were stored as POSIX timestamp values and formatted to show Gregorian dates.
• Location information was left in separate ID and Name variables, following the format of the incoming roster. IDs were formatted to include only the variable level digits, and not the higher-level prefixes (2-3 digits only.)
• Full Household and Enumeration Area ID variables were given leading 0s to match incoming roster format.
Observation and variable arrangement:
• Only consented surveys were kept in the dataset, and all personal information and internal survey variables were dropped from the clean dataset.
• Roster data is separated from the main data set and kept in long-form but can be merged on the key variable (key can also be used to merge with the raw data).
• In the main dataset, ii4_resp_id and cs7_hhh_id are the roster IDs of the respondent and household head respectively and can be merged with individual_id in the roster.
• The variables were arranged in the same order as the paper instrument, with observations arranged according to their submission time.
Backcheck data review: Results of the backcheck survey are compared against the originally captured survey results using the bcstats command in Stata. This function delivers a comparison of variables and identifies any discrepancies. Any discrepancies identified are then examined individually to determine if they are within reason.
Is signing of a confidentiality declaration required? | Confidentiality declaration text |
---|---|
yes | The users shall not take any action with the purpose of identifying any individual entity (i.e. person, household, enterprise, etc.) in the micro dataset(s). If such a disclosure is made inadvertently, no use will be made of the information, and it will be reported immediately to FAO |
Micro datasets disseminated by FAO shall only be allowed for research and statistical purposes. Any user which requests access working for a commercial company will not be granted access to any micro dataset regardless of their specified purpose. Users requesting access to any datasets must agree to the following minimal conditions:
Use of the dataset must be acknowledged using a citation which would include:
World Bank. Ethiopia - COVID-19 High Frequency Phone Survey of Households 2020. Dataset downloaded from www.microdata.worldbank.org on [date].
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses
Name | Affiliation | |
---|---|---|
LSMS | The World Bank Group | [email protected] |
DDI_ETH_2020_HFPS_v01_EN_M_v01_A_OCS_FAO
Name | Affiliation | Role |
---|---|---|
Office of Chief Statistician | Food and Agriculture Organization | Adoption of metadata for FAM |
Development Economics Data Group | The World Bank | Documentation of the DDI |
ETH_2020_HFPS_v01_EN_M_v01_A_OCS_v01