Principal Investigator : Anthony L Everette, Susan K Skagen
Objectives
1. Validate and document the application of the CDI Data Management Lifecycle framework
Through cooperation with the CSAS ‘Species Occurrence Records and Data Transformation Processes’ project we were able to expand the scope of this objective to include additional bat and white-tailed kite data to the project, increasing our sample size for estimating resource requirements from 3 datasets to 9. Details of the progress for each dataset follows.
-
Southeastern Arizona riparian bird and habitat data (Completed)
Data has been quality controlled and documented and is being prepared for submission as a USGS Digital Data Series product. -
Texas, Kansas, Oklahoma, South Dakota, North Dakota wetlands and shorebird data (Processing)
The original WB3 format for these data requires Quattro Pro 7 software licensing which we have acquired and converted files to CSV and XLSX format for further processing. -
Eastern Colorado prairie bird and habitat data (Processing)
The original WB3 format for these data requires Quattro Pro 7 software licensing which we have acquired and converted files to CSV and XLSX format for further processing.
-
Bats of the Rocky Mountain Arsenal Mist Net Data (Completed)
Data and metadata have been reviewed and archived in the Fort Collins Science Center Sciencebase community and is ready for FSP approval. -
Bat Inventory of Ouray National Wildlife Refuge Mist Net Data (Completed)
Data and metadata have been reviewed and archived in the Fort Collins Science Center Sciencebase community and is ready for FSP approval. -
Bats of Mesa Verde National Monument Mist Net Data (Processing)
Data has been reviewed and metadata is in the process of being completed. Both have been archived in the Fort Collins Science Center Sciencebase community. -
White-tailed Kite Historic Data (Processing)
We are working with the original USGS PI and the museums who contributed data to this datasets to verify data agreements and to develop metadata. The data itself has been converted from its original QuatroPro 6 format to XLSX and CSV for final data processing. -
White-tailed Kite Physiological Data (Processing)
Metadata for this dataset is complete and the data has been converted from its original QuatroPro 6 format to XLSX and CSV for processing. -
White-tailed Kite Morphological Data (Processing)
Metadata for this dataset is complete and the original capture data has been converted from its original QuatroPro format to XLSX and CSV for processing. -
Chapter 1 of the project’s completion report is in draft form.
2. Inventory, prioritize and estimate the cost of integrating a USGS data mine
-
Conducted an inventory of FORT datasets based on metadata produced between 1994-2011. We’ve identified 440 potential datasets to date. All inventoried records have been migrated to the Fort Collins Science Center’s Sciencebase community as part of a ‘Data Mine’ space, which is restricted to internal FORT data stewards and principal investigators. Once datasets are completely processed and approved for public distribution, those Sciencebase dataset items will be moved to the FORT community’s public ‘Datasets’ space for distribution.
-
FORT Case Files were also inventoried where dataset metadata was associated with active FORT staff. Fewer than a dozen additional datasets have been identified through this review of Case Files, however, this process is ongoing.
-
Initial estimates for datasets similar to those completing Objective 1 processing have been assigned. These are updated as dataset processing times are analyzed. The current average is 24 hours per dataset to process.
-
Chapter 2 of the project’s completion report is in draft form.
3. Objective 3: Develop a USGS Data Mine Web application
-
Based on our experience inventorying the FORT’s metadata and case files we are currently documenting our suggested inventory workflows and designing wireframes for the Data Mine data management application. These make up Chapter 3 of the project completion report.
- We are evaluating other data and project lifecycle management applications being developed within USGS to partner with, the hope being the Data Mine application as a legacy data portal that unifies and/or expands upon the strengths of each of those related applications.
Note: this description is from the FY13 Annual Report
- Source: USGS Sciencebase (id: 523235b3e4b0f06321418f1d)
USGS Data at Risk: Expanding Legacy Data Inventory and Preservation Strategies
Developing a USGS Legacy Data Inventory to Preserve and Release Historical USGS Data
Principal Investigator : Anthony L Everette, Susan K Skagen
Objectives
1. Validate and document the application of the CDI Data Management Lifecycle framework
Through cooperation with the CSAS ‘Species Occurrence Records and Data Transformation Processes’ project we were able to expand the scope of this objective to include additional bat and white-tailed kite data to the project, increasing our sample size for estimating resource requirements from 3 datasets to 9. Details of the progress for each dataset follows.
-
Southeastern Arizona riparian bird and habitat data (Completed)
Data has been quality controlled and documented and is being prepared for submission as a USGS Digital Data Series product. -
Texas, Kansas, Oklahoma, South Dakota, North Dakota wetlands and shorebird data (Processing)
The original WB3 format for these data requires Quattro Pro 7 software licensing which we have acquired and converted files to CSV and XLSX format for further processing. -
Eastern Colorado prairie bird and habitat data (Processing)
The original WB3 format for these data requires Quattro Pro 7 software licensing which we have acquired and converted files to CSV and XLSX format for further processing.
-
Bats of the Rocky Mountain Arsenal Mist Net Data (Completed)
Data and metadata have been reviewed and archived in the Fort Collins Science Center Sciencebase community and is ready for FSP approval. -
Bat Inventory of Ouray National Wildlife Refuge Mist Net Data (Completed)
Data and metadata have been reviewed and archived in the Fort Collins Science Center Sciencebase community and is ready for FSP approval. -
Bats of Mesa Verde National Monument Mist Net Data (Processing)
Data has been reviewed and metadata is in the process of being completed. Both have been archived in the Fort Collins Science Center Sciencebase community. -
White-tailed Kite Historic Data (Processing)
We are working with the original USGS PI and the museums who contributed data to this datasets to verify data agreements and to develop metadata. The data itself has been converted from its original QuatroPro 6 format to XLSX and CSV for final data processing. -
White-tailed Kite Physiological Data (Processing)
Metadata for this dataset is complete and the data has been converted from its original QuatroPro 6 format to XLSX and CSV for processing. -
White-tailed Kite Morphological Data (Processing)
Metadata for this dataset is complete and the original capture data has been converted from its original QuatroPro format to XLSX and CSV for processing. -
Chapter 1 of the project’s completion report is in draft form.
2. Inventory, prioritize and estimate the cost of integrating a USGS data mine
-
Conducted an inventory of FORT datasets based on metadata produced between 1994-2011. We’ve identified 440 potential datasets to date. All inventoried records have been migrated to the Fort Collins Science Center’s Sciencebase community as part of a ‘Data Mine’ space, which is restricted to internal FORT data stewards and principal investigators. Once datasets are completely processed and approved for public distribution, those Sciencebase dataset items will be moved to the FORT community’s public ‘Datasets’ space for distribution.
-
FORT Case Files were also inventoried where dataset metadata was associated with active FORT staff. Fewer than a dozen additional datasets have been identified through this review of Case Files, however, this process is ongoing.
-
Initial estimates for datasets similar to those completing Objective 1 processing have been assigned. These are updated as dataset processing times are analyzed. The current average is 24 hours per dataset to process.
-
Chapter 2 of the project’s completion report is in draft form.
3. Objective 3: Develop a USGS Data Mine Web application
-
Based on our experience inventorying the FORT’s metadata and case files we are currently documenting our suggested inventory workflows and designing wireframes for the Data Mine data management application. These make up Chapter 3 of the project completion report.
- We are evaluating other data and project lifecycle management applications being developed within USGS to partner with, the hope being the Data Mine application as a legacy data portal that unifies and/or expands upon the strengths of each of those related applications.
Note: this description is from the FY13 Annual Report
- Source: USGS Sciencebase (id: 523235b3e4b0f06321418f1d)