Skip to contents

The ipums_ddi class provides a data structure for storing the metadata information contained in IPUMS codebook files. These objects are primarily used when loading IPUMS data, but can also be used to explore metadata for an IPUMS extract.

  • For microdata projects, this information is provided in DDI codebook (.xml) files.

  • For NHGIS, this information is provided in .txt codebook files.

The codebook file contains metadata about the extract files themselves, including file name, file path, and extract date as well as information about variables present in the data, including variable names, descriptions, data types, implied decimals, and positions in the fixed-width files.

This information is used to correctly parse IPUMS fixed-width files and attach additional variable metadata to data upon load.

Note that codebook metadata for NHGIS extracts can also be stored in an ipums_ddi object, even though these codebooks are distributed as .txt files, not .xml files. These files do not adhere to the same standards as the DDI codebook files, so some ipums_ddi fields will be left blank when reading NHGIS codebooks.

Creating an ipums_ddi object

Loading data

View metadata

  • Use ipums_var_info() to explore variable-level metadata for the variables included in a dataset.

  • Use ipums_file_info() to explore file-level metadata for an extract.