Skip to main content
U.S. flag

An official website of the United States government

Publications

Browse publications that have a connection to the Community for Data Integration.

Filter Total Items: 36

Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor

Biologists and environmental scientists now routinely solve computational problems that were unimaginable a generation ago. Examples include processing geospatial data, analyzing -omics data, and running large-scale simulations. Conventional desktop computing cannot handle these tasks when they are large, and high-performance computing is not always available nor the most appropriate solution for
Authors
Richard A. Erickson, Michael N. Fienen, S. Grace McCalla, Emily L. Weiser, Melvin L. Bower, Jonathan M. Knudson, Greg Thain

Community for Data Integration 2017 annual report

The Community for Data Integration (CDI) is a group that helps members grow their expertise on all aspects of working with scientific data. The CDI’s activities advance data and information integration capabilities in the U.S. Geological Survey and in the wider Earth and biological sciences. This annual report describes the presentations, activities, collaboration areas, workshop, and other CDI-sp
Authors
Leslie Hsu, Madison L. Langseth

U.S. Geological Survey Community for Data Integration 2017 Workshop Proceedings

Executive SummaryThe U.S. Geological Survey (USGS) Community for Data Integration (CDI) Workshop was held May 16–19, 2017 at the Denver Federal Center. There were 183 in-person attendees and 35 virtual attendees over four days. The theme of the workshop was “Enabling Integrated Science,” with the purpose of bringing together the community to discuss current topics, shared challenges, and steps for
Authors
Leslie Hsu, Vivian B. Hutchison, Madison L. Langseth, Benjamin Wheeler

Development and release of phenological data products—A case study in compliance with federal open data policy

In Autumn 2015, USA National Phenology Network (USA-NPN) staff implemented new U.S. Geological Survey (USGS) data-management policies intended to ensure that the results of Federally funded research are made available to the public. The effort aimed both to improve USA-NPN data releases and to provide a model for similar programs within the USGS. This report provides an overview of the steps taken
Authors
Alyssa H. Rosemartin, Madison L. Langseth, Theresa Crimmins, Jake F. Weltzin

An open repository of earthquake-triggered ground-failure inventories

Earthquake-triggered ground failure, such as landsliding and liquefaction, can contribute significantly to losses, but our current ability to accurately include them in earthquake-hazard analyses is limited. The development of robust and widely applicable models requires access to numerous inventories of ground failures triggered by earthquakes that span a broad range of terrains, shaking characte
Authors
Robert G. Schmitt, Hakan Tanyas, M. Anna Nowicki Jessee, Jing Zhu, Katherine M. Biegel, Kate E. Allstadt, Randall W. Jibson, Eric M. Thompson, Cees J. van Westen, Hiroshi P. Sato, David J. Wald, Jonathan W. Godt, Tolga Gorum, Chong Xu, Ellen M. Rathje, Keith L. Knudsen

Community for Data Integration 2016 annual report

The Community for Data Integration (CDI) represents a dynamic community of practice focused on advancing science data and information management and integration capabilities across the U.S. Geological Survey and the CDI community. This annual report describes the various presentations, activities, and outcomes of the CDI monthly forums, working groups, virtual training series, and other CDI-sponso
Authors
Madison L. Langseth, Leslie Hsu, Jon Amberg, Norman Bliss, Andrew R. Bock, Rachel T. Bolus, R. Sky Bristol, Katherine J. Chase, Theresa M. Crimmins, Paul S. Earle, Richard Erickson, A. Lance Everette, Jeff T. Falgout, John Faundeen, Michael N. Fienen, Rusty Griffin, Michelle R. Guy, Kevin D. Henry, Nancy J. Hoebelheinrich, Randall J. Hunt, Vivian B. Hutchison, Drew A. Ignizio, Dana M. Infante, Catherine Jarnevich, Jeanne M. Jones, Tim Kern, Scott Leibowitz, Francis L. Lightsom, R. Lee Marsh, S. Grace McCalla, Marcia McNiff, Jeffrey T. Morisette, John C. Nelson, Tamar Norkin, Todd M. Preston, Alyssa Rosemartin, Roy Sando, Jason T. Sherba, Richard P. Signell, Benjamin M. Sleeter, Eric T. Sundquist, Colin B. Talbert, Roland J. Viger, Jake F. Weltzin, Sharon Waltman, Marc Weber, Daniel J. Wieferich, Brad Williams, Lisamarie Windham-Myers

Community for Data Integration 2015 annual report

The Community for Data Integration (CDI) continued to experience success in fiscal year 2015. The CDI community members have been sharing, learning, and collaborating through monthly forums, workshops, working groups, and funded projects. In fiscal year 2015, CDI coordinated 10 monthly forums with 16 different speakers from the U.S. Geological Survey and external partners; funded 11 collaborative
Authors
Madison L. Langseth, Michelle Y. Chang, Jennifer Carlino, J. Ryan Bellmore, Daniella D. Birch, Joshua Bradley, R. Sky Bristol, Daniel D. Buscombe, Jeffrey J. Duda, Anthony L. Everette, Tabitha A. Graves, Michelle M. Greenwood, David L. Govoni, Heather S. Henkel, Vivian B. Hutchison, Brenda K. Jones, Tim Kern, Jennifer Lacey, Rynn M. Lamb, Frances L. Lightsom, John L. Long, Ra'ad A. Saleh, Stan W. Smith, Christopher E. Soulard, Roland J. Viger, Jonathan A. Warrick, Katherine E. Wesenberg, Daniel J. Wieferich, Luke A. Winslow

Dam Removal Information Portal (DRIP)—A map-based resource linking scientific studies and associated geospatial information about dam removals

The removal of dams has recently increased over historical levels due to aging infrastructure, changing societal needs, and modern safety standards rendering some dams obsolete. Where possibilities for river restoration, or improved safety, exceed the benefits of retaining a dam, removal is more often being considered as a viable option. Yet, as this is a relatively new development in the history
Authors
Jeffrey J. Duda, Daniel J. Wieferich, R. Sky Bristol, J. Ryan Bellmore, Vivian B. Hutchison, Katherine M. Vittum, Laura Craig, Jonathan A. Warrick

sbtools: A package connecting R to cloud-based data for collaborative online research

The adoption of high-quality tools for collaboration and reproducible research such as R and Github is becoming more common in many research fields. While Github and other version management systems are excellent resources, they were originally designed to handle code and scale poorly to large text-based or binary datasets. A number of scientific data repositories are coming online and are often f
Authors
Luke Winslow, Scott Chamberlain, Alison P. Appling, Jordan S. Read

Sharing our data—An overview of current (2016) USGS policies and practices for publishing data on ScienceBase and an example interactive mapping application

This report provides an overview of current (2016) U.S. Geological Survey policies and practices related to publishing data on ScienceBase, and an example interactive mapping application to display those data. ScienceBase is an integrated data sharing platform managed by the U.S. Geological Survey. This report describes resources that U.S. Geological Survey Scientists can use for writing data mana
Authors
Katherine J. Chase, Andrew R. Bock, Roy Sando

myScience—Engaging the public in U.S. Geological Survey science

myScience (http://txpub.usgs.gov/myscience/) is a Web application developed by the U.S. Geological Survey (USGS) Texas Water Science Center through a partnership with the USGS Community for Data Integration to address the need for increasing public awareness and participation in existing USGS citizen science projects. The myScience application contains data for 20 projects available for public par
Authors
Sally Holl

Community for Data Integration 2014 annual report

The U.S. Geological Survey (USGS) researches Earth science to help address complex issues affecting society and the environment. In 2006, the USGS held the first Scientific Information Management Workshop to bring together staff from across the organization to discuss the data and information management issues affecting the integration and delivery of Earth science research and investigate the use
Authors
Madison L. Langseth, Michelle Y. Chang, Jennifer Carlino, Daniella D. Birch, Joshua Bradley, R. Sky Bristol, Craig Conzelmann, Robert H. Diehl, Paul S. Earle, Laura E. Ellison, Anthony L. Everette, Pamela L. Fuller, Janice M. Gordon, David L. Govoni, Michelle R. Guy, Heather S. Henkel, Vivian B. Hutchison, Tim Kern, Frances L. Lightsom, Joseph W. Long, Ryan Longhenry, Todd M. Preston, Stan W. Smith, Roland J. Viger, Katherine Wesenberg, Eric C. Wood