Microsoft released a U.S.-wide vector building dataset in 2018. Although the vector building layers provide relatively accurate geometries, their use in large-extent geospatial analysis comes at a high computational cost. We used High-Performance Computing (HPC) to develop an algorithm that calculates six summary values for each cell in a raster representation of each U.S. state, excluding Alaska and Hawaii: (1) total footprint coverage, (2) number of unique buildings intersecting each cell, (3) number of building centroids falling inside each cell, and area of the (4) average, (5) smallest, and (6) largest area of buildings that intersect each cell. These values are represented as raster layers with 30 m cell size covering the 48 conterminous states. We also identify errors in the original building dataset. We evaluate precision and recall in the data for three large U.S. urban areas. Precision is high and comparable to results reported by Microsoft while recall is high for buildings with footprints larger than 200 m2 but lower for progressively smaller buildings.
Citation Information
Publication Year | 2020 |
---|---|
Title | A rasterized building footprint dataset for the United States |
DOI | 10.1038/s41597-020-0542-3 |
Authors | Mehdi Pourpeikari Heris, Nathan Leon Foks, Kenneth J. Bagstad, Austin Troy, Zachary H. Ancona |
Publication Type | Article |
Publication Subtype | Journal Article |
Series Title | Scientific Data |
Index ID | 70210868 |
Record Source | USGS Publications Warehouse |
USGS Organization | Geosciences and Environmental Change Science Center |