Skip to main content
Advanced Search

Filters: partyWithName: Community for Data Integration - CDI (X)

126 results (37ms)   

Filters
View Results as: JSON ATOM CSV
thumbnail
Natural resources managers are regularly required to make decisions regarding upcoming restoration treatments, often based on little more than business as usual practices. To assist in the decision-making process, we created a tool that predicts site-specific soil moisture and climate for the upcoming year, and provides guidance on whether common restoration activities (i.e. seeding, planting) will be successful based on these conditions. This tool is hosted within the Land Treatment Exploration Tool (LTET), an application already used by land managers that delivers a report of site condition and treatment history. Incorporated within the short-term drought forecaster (STDF) is a rigorous statistical process,...
thumbnail
Digital Elevation Models (DEM) provide details of the earth’s surface and are used for visualization, physical modeling, and elevation change analysis. Creating DEMs in coastal environments is complicated by the highly ephemeral nature of the coast and the need to span the land-water interface. This requires merging multiple bathymetric and topographic datasets that have been collected at different times, using different instrument platforms with varying levels of accuracy, and with variable spatial resolution and coverage. Because coastal change can occur over relatively short time scales (days to weeks in the case of storms), rapid updates to coastal DEMs are also needed. These challenges and the lack of available...
thumbnail
Increasingly, USGS scientists seek to share and collaborate while working on data and code. Furthermore, these scientists often require advanced computing resources. Jupyter Notebooks are one such tool for creating these workflows. The files are interactive, code “notebooks” which allow users to combine code and text in one document, enabling scientists to share the stories held within their data. Recently, USGS launched an instance of Pangeo—a community platform for Big Data geoscience—as a tool for internally hosting and executing these notebooks. Few examples exist on how to use Pangeo and no formal documentation exists for USGS scientists to use Pangeo. We will create and curate examples of using Jupyter Notebooks...
thumbnail
Metadata Wizard THIS VERSION OF THE TOOL HAS BEEN REPLACED BY AN UPDATED VERSION Users should obtain the new version of the Metadata Wizard at the links below: The user manual is available here: https://usgs.github.io/fort-pymdwizard/index.html Software installer can be downloaded here: https://github.com/usgs/fort-pymdwizard/releases This tool will eventually replace the Metadata Wizard hosted from this page to eliminate dependencies on ESRI ArcDesktop and to enable Mac users to utilize the Metadata Wizard. Documentation and Previous Release Notes for the Legacy Publication and Product are Below ------------------------------ Metadata Wizard version: 1.8.5 (Last updated: 1/21/20) To download this toolbox...
Geographic Information System (GIS) analyses are an essential part of natural resource management and research. Calculating and summarizing data within intersecting GIS layers is common practice for analysts and researchers. However, the various tools and steps required to complete this process are slow and tedious, requiring many tools iterating over hundreds, or even thousands of datasets. We propose to combine a series of ArcGIS geoprocessing capabilities with custom scripts to create tools that will calculate, summarize, and organize large amounts of data that can span many temporal and spatial scales with minimal user input. The tools work with polygons, lines, points, and rasters to calculate relevant summary...
The USGS provides many national, regional and local datasets for download, streaming interaction such as WFS/WCS, and analysis. Ultimately, most datasets are presented for visualization in "viewers" with basic navigation and interaction for inspection and even lightweight WebGIS like web service functions, annotations, etc. Many viewers–different APIs, clients, purposes, and niche functions–are invested in at USGS and DOI and the whole Federal Government. The solution is not "1 viewer" or "1 viewer API" - see the "Viewer Explosion Conundrum" below. We are stuck in a multiple viewer environment, we could recommend a few APIs, and restrict others at best. The problem with this is that when someone goes to a new viewer,...
This project leveraged existing efforts toward the use of social media systems for delivery of information into a web based visualization framework. Rather than support the development of an expensive system developed in-house, this project supports the use of cloud-based social media system Twitter to provide a robust observation platform. Development efforts were directed at utilizing the substantial Twitter API feature set to query the media stream for species observation submissions. Citizen science participants were encouraged to use the Twitter direct message system to submit species observations using a pre-defined schema. Observations were extracted from the Twitter stream and processed using geospatial,...
An additional product not originally on the funded activities list was organized for the newly-formed Science Data Coordinator Network. The Network invited 2 representatives from each of the USGS geographic areas, and several mission areas, to participate in a Data Management Workshop. The Workshop was organized by Viv Hutchison and taught by Tom Chatfield of the Bureau of Land Management. The course was very well received and requests to repeat the workshop have been numerous. This response emphasizes the need for data management training in the USGS, thus the creation of USGS specific training materials. Data management education in USGS is critical to building a culture that understands its importance and incorporates...
thumbnail
Over the past 40 years the National Wildlife Health center has collected wildlife health information from around the U.S. and beyond, amassing the world’s largest repository of wildlife-disease surveillance data. This project identified, characterized, and documented NWHC’s locally stored wildlife health datasets, a critical first step to migrating them to new laboratory- and public-facing data systems, such as the Wildlife Health Information Sharing Partnership-event reporting system. To accomplish this, we developed a systematic, standardized approach for collaborating with laboratory scientists to locate, define, and classify their long-term datasets so that they can be cleansed, archived, and mapped to new...
thumbnail
As one of the largest and oldest science organizations in the world, USGS has produced more than a century of earth science data, much of which is currently unavailable to the greater scientific community due to inaccessible or obsolescent media, formats, and technology. Tapping this vast wealth of “dark data” requires 1) a complete inventory of legacy data and 2) methods and tools to effectively evaluate, prioritize, and preserve the data with the greatest potential impact to society. Recognizing these truths and the potential value of legacy data, USGS has been investigating legacy data management and preservation since 2006, including the 2016 “DaR” project, which developed legacy data inventory and evaluation...
thumbnail
In the mid-1800s, tile-drains were installed in poorly-drained soils of topographic lows as water management to protect cropland during wet conditions; consequently, estimations of tile-drain location have been based on soil series. Most tile drains are in the Midwest, however each state has farms with tile and tile-drain density has increased in the last decade. Where tile drains quickly remove water from fields, groundwater and stream water interaction can change, affecting water availability and flooding. Nutrients and sediment can quickly travel to streams thru tile, contributing to harmful algal blooms and hypoxia in large water bodies. Tile drains are below the soil surface, about 1 m deep, but their location...
thumbnail
Prior to this project, data acquired from the USGS Unmanned Aircraft Systems (UAS) have been provided to requesting scientists but have not been made available to the broader USGS community, the U.S. Department of the Interior (DOI) bureaus, or the public at large. This project performed a pilot study and developed a strategy that is scalable to evolve into a permanent UAS data management capability. The goal is to make UAS datasets available over the Internet to the USGS, DOI, and public science communities by establishing robust data management strategies and integrating these data with other geospatial datasets in the existing infrastructure located at the USGS EROS Data Center. Principal Investigator : Jennifer...
The purpose of this project is to improve the USGS Publications Warehouse (Pubs Warehouse) so that a person can search for USGS publications by geographic region in addition to existing search criteria; for example, one could search using map zooms or congressional districts. The addition of geographic searches allows users to narrow their search results to specific areas of interest, which reduces the time required to sift through all results outside the area of interest. In FY 2014, the project team determined that the ScienceBase Footprint Studio would be an appropriate tool for creating the footprints for USGS publications and decided on the technical implementation for information exchange between ScienceBase...
The scientific legacy of the USGS is the data and the scientific knowledge derived from it gathered over 130 years of research. However, it is widely assumed, and in some cases known, that high quality data, particularly legacy data critical for large time-scale analyses such as climate change and habitat change, is hidden away in case files, file cabinets, and hard drives housed in USGS science centers and field stations (both hereafter “science centers”). Many USGS science centers, such as the Fort Collins Science Center, have long, established research histories, are known repositories of data sets, and conduct periodic “file room cleanout” days that establish and enforce some minimal data lifecycle management...
The USMIN Project is multi-year project of the USGS Mineral Resource Program (MRP) whose objective is to develop a comprehensive geospatial database of the mines, mineral deposits and mineral districts of the United States. This database builds upon MRP projects which date back to the late 1960’s and will provide data that will be of value to other parts of the USGS, other federal and state agencies and the general public. Mine Features, which are defined as “a single human-made object or disturbance associated with mining, such as a shaft or adit (vertical or horizontal opening), tailings, machinery and facilities, etc. A mine can be comprised of one or more features.” are a major component of the database....
This project sought to incorporate land-use land-cover time series data into the Geo Data Portal web service infrastructure. Till now, the Geo Data Portal has been used for relatively low spatial resolution downscaled climate data. Data from the EPA’s Integrated Climate and Land Use Scenarios was incorporated into a web service compatible with the Geo Data Portal and is now hosted in the Geo Data Portal catalog. Great progress was made in establishing connections between the Land-Processes Distributed Active Archive Center’s (LP-DAAC) holdings of MODIS satellite derived land cover data. At this time, the data is only available on development servers, but many issues were fixed and close partners are using the integration...
thumbnail
Global climate models are a key source of climate information and produce large amounts of spatially explicit data for various physical parameters. However, these projections have substantial uncertainties associated with them, and the datasets themselves can be difficult to work with. The project team created the first version (cst 0.1.0) of the Climate Futures Toolbox, an open source workflow in R that allows users to access downscaled climate projections data, clip data by spatial boundaries (shapefile), save the output, and generate summary tables and plots. A detailed R vignette guides users to easily generate derived variables in order to answer specific questions about their region of interest (e.g. how will...
The CDI Upload, Registry and Access Project was added as a core capability to ScienceBase, a data integration system sponsored by Core Science Systems. The following description is the abstract from the 2012-3874 USGS Fact Sheet: "As a leading science and information agency and in fulfillment of its mission to provide reliable scientific information to describe and understand the Earth, the U.S. Geological Survey (USGS) ensures that all scientific data are effectively hosted, adequately described, and appropriately accessible to scientists, collaborators, and the general public. To succeed in this task, the USGS established the Community for Data Integration (CDI) to address data and information management issues...
As research and management of natural resources shift from local to regional and national scales, the need for information about aquatic systems to be summarized to multiple scales is becoming more apparent. Recently, four federally funded national stream assessment efforts (USGS Aquatic GAP, USGS National Water-Quality Assessment Program, U.S. Environmental Protection Agency [EPA] StreamCat, and National Fish Habitat Partnership) identified and summarized landscape information into two hydrologically and ecologically significant scales of local and network catchments for the National Hydrography Dataset Plus (NHDPlus). These efforts have revealed a significant percentage of assessment funds being directed to the...
To make informed decisions, land managers require knowledge about the state of the ecosystems present. Vegetation structure is a key indicator of the state of forested systems; it influences habitat suitability, water quality and runoff, microclimate, and informs wildfire-related characteristics such as fuel loads, burn severity, and post-fire regeneration. Field data used to derive vegetation structure are limited in spatial and temporal extent. Alternatively, forest growth simulation models estimate vegetation structure, but do not capture all factors influencing vegetation growth. Assessment of vegetation structure can be improved by using observations to derive maps which can be used to calibrate modeled forest...


map background search result map search result map USGS Data at Risk: Expanding Legacy Data Inventory and Preservation Strategies Open-Source and Open-Workflow Climate Futures Toolbox for Adaptation Planning Open-Source and Open-Workflow Climate Futures Toolbox for Adaptation Planning