ipumsr provides an R interface for handling IPUMS data, allowing users to:
Easily read files downloaded from the IPUMS extract system
Request data, download files, and get metadata from certain IPUMS collections
Interpret and process data using the contextual information that is included with many IPUMS files
To install the package from CRAN, use
To install the development version of the package, use
IPUMS is the world’s largest publicly available population database, providing census and survey data from around the world integrated across time and space. IPUMS integration and documentation make it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community context. Data and services are available free of charge.
IPUMS consists of multiple projects, or collections, that provide different data products.
- Microdata projects distribute data for individual survey units, like people or households.
- Aggregate data projects distribute summary tables of aggregate statistics for particular geographic units along with corresponding GIS mapping files.
ipumsr supports different levels of functionality for each IPUMS project, as summarized in the table below.
|Read Data Extracts
|Request & Download Data
|U.S. Census and American Community Survey microdata (1850-present)
|Current Population Survey microdata including basic monthly surveys and supplements (1962-present)
|Census microdata covering over 100 countries, contemporary and historical
|Tabular U.S. Census data and GIS mapping files (1790-present)
|Tabular and GIS data from population, housing, and agricultural censuses around the world
|IPUMS Time Use
|Time use microdata from the U.S. (1930-present) and thirteen other countries (1965-present)
|IPUMS Health Surveys
|Microdata from the U.S. National Health Interview Survey (NHIS) (1963-present) and Medical Expenditure Panel Survey (MEPS) (1996-present)
|IPUMS Global Health
|Health survey microdata for low- and middle-income countries, including harmonized data collections for Demographic and Health Surveys (DHS) and Performance Monitoring for Action (PMA) surveys
|IPUMS Higher Ed
|Survey microdata on the science and engineering workforce in the U.S. from 1993 to 2013
ipumsr uses the IPUMS API to submit data requests, download data extracts, and get metadata, so the scope of functionality generally corresponds to that available via the API. As the IPUMS team extends the API to support more functionality for more projects, we aim to extend ipumsr capabilities accordingly.
The package vignettes are the best place to explore what ipumsr has to offer:
To read IPUMS data extracts into R, see
To interact with the IPUMS extract and metadata system via the IPUMS API, see
To work with labelled values in IPUMS data, see
For techniques for working with large data extracts, see
We greatly appreciate feedback and development contributions. Please submit any bug reports, pull requests, or other suggestions on GitHub. Before contributing, please be sure to read the Contributing Guidelines and the Code of Conduct.