The 2021 Community for Data Integration Workshop was held May 25-29, 2021 in a virtual format. The theme of the workshop was "Designing Data-Intensive Science."
The 2021 Workshop had 483 registrants, 24 breakout sessions, 3 plenaries, 32 posters or demos, and 14 lightning talks. The completely virtual format allowed a larger number of participants than previous in-person CDI workshops.
The online agenda for the workshop is available on Sched.
Breakout Sessions
- A fun, fast hands-on introduction to the user-centered design process, Joe Bard, Sophie Hou, Rachel Volentine
- Advanced Scientific Computing in the USGS, Janice Gordon, Courtney Neu
- Assessing the Value and Usage of USGS Data Management Plans, Grace Donovan, Amanda Liford, Madison Langseth, Elizabeth Sellers
- Cleaning data with OpenRefine, Ricardo McClees-Funinan
- Cloud Optimized File Formats: What's new in ScienceBase and Strategies for Data Managers, Drew Ignizio, Rich Signell
- Data-Intensive Science in Action at the USGS, Rich Signell, Jeanne Jones, Peter Ng, Kevin Henry, Jamie Jones, Chris Skinner, Jason Kreitler, Ben Sleeter
- Effective Communication, Juliana Casavan, Claire Stirm
- Globus 101: Introduction and Basics to Moving and Sharing Data with Globus, Jeff Falgout, Janice Gordon, Vas Vasiliadis
- How to talk to your data manager/scientist – Breaking the ice, Madison Langseth, Jason Ferrante, Tara Bell, Matt Cannister, Kris Jaeger, Sue Kemp
- Integrated modeling at the USGS - what do we need?, Leslie Hsu, Christie Hegermiller, Brandon Serna, Catherine Jarnevich, Anne Wein
- Making Connections through Data Integration, Shayne Urbanowski, Amber Kremer, Madison Fung, Rich Signell, Becca Scully, Genevieve Barron, Tatyana DiMascio
- Market & Audience Assessment, Juliana Casavan and Claire Stirm
- Online Imagery Data Storage and Release: Current State of the Science and Future Directions, Seth Ackerman, Joe Adams, Sandy Brosnahan, Evan Dailey, Cian Dawson, Frank Engel, Dennis Walworth, Ben Letcher, Chris Gazoorian, Jon Warrick, Anthony Fischbach
- R Workshop: accessing the USGS National Map and making 3D maps with terrainR, Mike Mahoney, Colleen Nell, Lindsay Platt
- Reclamation Use of Data Management and Science for Water Resource Applications, Allison Odell, James Nagode, Kenneth Richard, Ken Nowak, Katie Holman, Lindsay Bearup, Drew Loney.
- Records Management: Winds of Change, Matt Arsenault, Chris Bartlett, Ed Olexa, Larry Reedy
- Semantic Web 101, Fran Lightsom, Brandon Whitehead, Scott Peckham, Ken Bagstad
- The Cloud in Action – How Centers are using Cloud Hosting Solutions for Data-intensive Workflows & Running Scientific Models, Kirsty Haynie, Mike Hearne, Eric Larson, Heather Schovanec, Dionne Zoanni, Cory Overton, Jeremy Fee, Tony Butzer and Stefanie Kagone, Rich Signell, Tarandeep Kalra, Sam Congdon, Courtney Neu
- The fundamentals of design for scientific data visualization, Ellen Bechtel, Sophie Hou, Ben Letcher, Colleen Nell, Amy Puls, Katherine Trickey, Dionne Zoanni
- Updates to the Data Storage and Transport Ecosystem for Research Computing, Jeff Falgout, Janice Gordon
- USGS Cloud Hosting Solutions - Advancing 21st Century Science, Jennifer Erxleben, Eric Larson, Dionne Zoanni, Courtney Neu, Robert Shepherd
- USGS Roadmap to enable FAIR Principles, Wade Bishop, Viv Hutchison, Fran Lightsom, Dave Govoni, Linda Debrewer
- USGS Shared Software Resources, Carl Schroedl
- Using AI/ML to advance USGS Science, Pete Doucette, Eric Larson, JC Nelson, Matt Kuckuk, Jeff Tracey, Freddie Kalaitzis
Posters and Demos
- Modernizing sensor data workflows to leverage Internet of Things (IoT) and cloud-based technologies, Caitlin Andrews
- Retrieving data. Wait a few seconds and try to cut or copy again, Itiya Aneece
- Semantics and machine reasoning enable FAIR, web-based data and model integration, Kenneth Bagstad
- From reactive- to condition-based maintenance: artificial intelligence for anomaly prediction in time-series data and operational decision-making, Matthew Cashman
- USGS Markup Application: Supporting User-Driven Improvements to Hydrography Data, Marcelle Caturia
- Colorado River Basin EarthMAP Implementation, Katharine Dahm
- Transforming Data Representations to Improve Computational Performance, David Donato
- Making USGS/NOAA Total Water Level and Coastal Change Forecast data accessible through user-friendly interfaces, Kara Doran
- What's New in Cloud Hosting Solutions?, Jennifer Erxleben
- Delivering the North American tree-ring fire history network through a web application and an R package, Chris Guiterman
- Central Energy Resources Science Center Data Management Services Project Overview, Gregory Gunther
- Landsat-derived fire history metrics to provide critical information for prioritizing prescribed fire across the Southeast, Todd Hawbaker
- Let’s Chat about Usability!, Sophie Hou
- USGS Model Catalog, Leslie Hsu
- Efficiently Accessing Large Earth Imagery Datasets Using the Meta Raster Format (MRF) and AWS Serverless Architecture, Liz Huselid
- Meet the SAS Science Data Management Team!, Viv Hutchison
- USGS State of the Data, Viv Hutchison
- The Definition of Analysis-ready Data in USGS, Viv Hutchison
- USMIN – delivering critical mineral data for the U.S., Nick Karl
- Diversity and Inclusion Resources at USGS, Kim Kloecker
- A Fire-Aware Stream Application to Integrate USGS Fire and Water Databases, Katharine Kolb
- The Standalone Data Dictionary: A More Robust Approach in Documenting the Entity and Attributes for Data, Raymond Obuch
- Diverse data to improve Southwest fire forecasts: Joining novel remote sensing, post-fire dynamics, and intra-annual precipitation patterns, Michala Phillips
- Advancing Post-Fire Debris Flow Hazard Science with a Field Deployable Mapping Tool, Francis Rengers
- Development of a web-based tool for coastal water resources management, Tara Root
- The Wildfire Trends Tool: A data visualization and analysis tool to meet land management needs and facilitate scientific inquiry, Douglas Shinneman
- Utilization of Google Earth Engine to Examine Surface Water Inundation Patterns in California Croplands, Britt Smith
- Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, Jessica Walker
- Remote sensing strategies for invasive species management, Cynthia Wallace
- GIS Clipping and Summarization Tool for Points, Lines, Polygons, and Rasters, Justin Welty
- Coast Train - Massive Library of Labeled Coastal Images for Machine Learning Applications, Phillipe Wernette
- Visualizing Science using Python Dash, Daniel Wieferich
Lightning Talks
-
Semantics and machine reasoning enable FAIR, web-based data and model integration, Kenneth Bagstad
-
Coast Train - Massive Library of Labeled Coastal Images for Machine Learning Applications, Phillipe Wernette
-
From reactive- to condition-based maintenance: artificial intelligence for anomaly prediction in time-series data and operational decision-making, Matthew Cashman
-
USGS Markup Application: Supporting User-Driven Improvements to Hydrography Data, Marcelle Caturia
-
Making USGS/NOAA Total Water Level and Coastal Change Forecast data accessible through user-friendly interfaces, Kara Doran
-
USGS Model Catalog, Leslie Hsu
-
Efficiently Accessing Large Earth Imagery Datasets Using the Meta Raster Format (MRF) and AWS Serverless Architecture, Liz Huselid
-
A Fire-Aware Stream Application to Integrate USGS Fire and Water Databases, Katharine Kolb
-
The Standalone Data Dictionary: A More Robust Approach in Documenting the Entity and Attributes for Data, Raymond Obuch
-
Diverse Data to Improve Southwest Fire Forecasts: Joining Novel Remote Sensing, Post-fire Dynamics, and Intra-annual Precipitation Patterns, Sasha Reed
-
Development of a web-based tool for coastal water resources management, Tara Root
-
Utilization of Google Earth Engine to Examine Surface Water Inundation Patterns in California Croplands, Britt Smith
-
Remote sensing strategies for invasive species management, Cynthia Wallace
-
Visualizing Science Using Python Dash, Daniel Wieferich
Go back to CDI 2021 Activities.