Common Data Acquisition Considerations
Authoritative Data Source
An Authoritative Data Source (ADS) is a single officially designated source authorized to provide a type or many types of information that is trusted, timely, and secure on which lines of business rely. Information that is trusted means that the information provider exercises management responsibility for appropriate practices, procedures, and processes to produce information that is within acceptable thresholds for quality, integrity, and security. The intended outcome is to provide information that is visible, accessible, understandable, and credible to information consumers, which include DOI business users, DOI information exchange partners, and IT applications and services. The assessment and designation of Authoritative Data Sources are accomplished through analysis and recommendations that are documented in but not limited to Modernization Blueprint projects, business process reengineering projects, and E-Gov related projects.
If there is an ADS, does the information in it meet your business needs? Attribute values, domain ranges, spatial accuracy, etc.?
If the ADS meets your need then you should be using it; you are in fact required by OMB to be doing so. If it does not meet your needs, then documenting why it does not allows you to move forward to other acquisition options.
Newly Collected Data Considerations
Contractor/Volunteer vs. USGS: The decision of who will perform new data collection must be balanced between the following:
- Skills: Skills required for this collection may dictate that it will be contracted. For example, if the required data can only be collected by a certified person and the USGS does not have anyone available with that certification, contracting may be the only option.
- Frequency: If the data will only be collected once, acquiring the collection skill in-house may not be justified.
- Timeliness: When will the data be needed? Is it time-critical?
In the USGS, most data are collected by employees and/or their contractors. While USGS can obtain some data from outside sources, we recognize that the bulk of an employee's work is the creation and maintenance of data.
Because data collection is important to the Bureau, data collection is important to the data stewards [see Plan > Data Stewardship for more information]. Data collection is an area where cost savings mechanisms are needed. For instance, Global Positioning Systems and mobile units are now being used to take field data and enter them directly from the source. The problem remains that quality data be collected initially at the source (where data can be correlated directly with observation), where the strictest controls should be placed. Unfortunately, heretofore, strict control has not occurred at the source.
Therefore, before data are initially collected, strict controls must be in place. All of the analysis, definitions, and standards need to be in place prior to any field information collection. While this may seem obvious, it is not always practiced. Good planning will reduce this heavy budget item.
Data must be reviewed and updated on a regular schedule to maintain a high standard of quality. Metadata must also be updated at the same time. Managers need to be confident that they have the best possible data available when making decisions. Each time the data changes, the metadata must be updated as well.
Converted/Transformed Legacy Data Considerations
- Legacy Quality: Is the data of sufficient quality to meet the science needs?
- Technical Issues: Is the storage medium readable? Can the data be converted into a usable format? At what cost?
Shared/Exchanged Data Considerations
- Creating Data Sharing Agreements: Data Sharing Agreements need to include provisions concerning access and dissemination. It is not wise to enter into a data sharing agreement where privacy information may be disclosed to non-Federal organizations since they are not subject to the Privacy Act. Similarly, the non-Federal organization needs to be alerted that the Federal agencies may be compelled to release information under the FOIA. Learn more about Data Sharing Agreements.
- Data Organization: Is the data organized in a usable form? Will it require conversion/transformation to make it usable? Who will perform this? At what cost?
- Records Requirements: Data must have corresponding metadata and other pertinent documentation.
- Completeness of Data: Are the data complete? If not, who will address the gaps in the data? At what cost?
Purchased Data Considerations
- Purchase Agreements: Data purchases require a Purchasing Agreement. By purchasing data, you are endorsing the data. Such data then becomes subject to the Information Quality Act, which covers all data, not just geospatial data.
- Data Certification: Metadata are required for purchased data. The specifics of this requirement should be specified in the Purchasing Agreement.
- Licensing Issues: What restrictions are placed upon the use of the data? Are there Privacy Act or FOIA considerations?
References
- Chatfield, T., Selbach, R. February, 2011. Data Management for Data Stewards. Data Management Training Workshop. Bureau of Land Management (BLM).