Skip to main content
Advanced Search

Filters: Tags: regression analysis (X) > partyWithName: U.S. Geological Survey (X)

44 results (16ms)   

View Results as: JSON ATOM CSV
thumbnail
This dataset summarizes various sedimentary data from ten U.S. Geological Survey (USGS) surface water sites across the contiguous United States. These sites include: 01648010 Rock Creek at Joyce Road, Washington, DC 05586300 Illinois River at Florence, Illinois 06731000 Cherry Creek below Cherry Creek Lake, Colorado 06807000 Missouri River at Nebraska City, Nebraska 06935965 Missouri River at St. Charles, Missouri 08374550 Rio Grande near Castolon, Texas 08375300 Rio Grande at Rio Grande Village, Big Bend National Park, Texas 09404200 Colorado River above Diamond Creek near Peach Springs, Arizona 11447650 Sacramento River at Freeport, California 12046260 Elwha River at diversion near Port Angeles, Washington The...
A generalized least squares (GLS) technique was used with at-site n-day low-flow time series and frequencies, from SWToolbox (Kiang and others, 2018), and 14 basin characteristics (basin perimeter, drainage area, maximum basin elevation, mean total annual reference evapotranspiration, groundwater head, length of longest flow path, minimum basin elevation, runoff-curve number, relief, ruggedness, slope ratio, percentage of hydrologic soil type B, available water capacity, and total length of streams in a basin) for 28 selected streamflow gaging stations in Puerto Rico to calculate regional n-day low-flow frequency regression equations for ungaged stream locations. The GLS approach assigns weights based on uncertainty...
thumbnail
Regional regression equations were calculated in Puerto Rico with generalized least squares techniques to estimate flood frequency statistics at ungaged locations using drainage area as the only explanatory variable. The island was divided into 2 regions to minimize residuals. The region division that resulted in lower and more balanced residuals runs primarily north-south near the center of the island, mostly along an 8-digit hydrologic unit code (HUC8) boundary. The division line runs through a HUC8 polygon on the southern end of the island, but care was taken to include entire watersheds and consideration was given where hydrologic and physiographic properties differed. This data release includes geographic information...
thumbnail
Site-specific multiple linear regression models were developed for one beach in Ohio (three discrete sampling sites) and one beach in Pennsylvania to estimate concentrations of Escherichia coli (E. coli) or the probability of exceeding the bathing-water standard for E. coli in recreational waters used by the public. Traditional culture-based methods are commonly used to estimate concentrations of fecal indicator bacteria, such as E. coli; however, results are obtained 18 to 24 hours post sampling and do not accurately reflect current water-quality conditions. Beach-specific mathematical models use environmental and water-quality variables that are easily and quickly measured as surrogates to estimate concentrations...
thumbnail
Three regression models were developed for describing variability of host-specific flux data in eight Great Lakes tributaries. Models include one for Lachnospiraceae flux, one for human bacteroides flux, and one for ruminant bacteroides flux. Models were developed using the R project for statistical computing with core functionality and the survival, smwrBase, and smwrQW packages. Predictor variables for these models are included in the data file and input files provided. These include sampling dates and times, rainfall depth, percent of watershed underlain by tile drainage, cattle population (for the ruminant marker model) and human population (for the human marker models). Sample collection and laboratory analyses...
thumbnail
This data release contains trend results computed on the basis of modeled and observed daily streamflows at 502 reference gages across the conterminous U.S. from October 1, 1983 through September 30, 2016. Modeled daily streamflows were computed using the deterministic Precipitation Runoff Modeling System (PRMS), and five statistical techniques: Nearest-Neighbor Drainage Area Ratio (NNDAR), Map-Correlation Drainage Area Ratio (MCDAR), Ordinary Kriging of the logarithms of discharge per unit area (OKDAR), Nearest-Neighbor nonlinear spatial interpolation using flow duration curves (NNQPPQ), and Map-Correlation nonlinear spatial interpolation using flow duration curves (MCQPPQ). Observed daily streamflow data for the...
thumbnail
Measures used to assess trends in the 10th, 50th, and 90th quantiles of annual peak streamflow from 1916-2015 at 2,683 U.S. Geological Survey stations and within 191 4-digit HUCs in the conterminous United States. Linear quantile regression was applied to the selected quantiles of log-transformed annual peak streamflow to represent trends for a range of flood frequencies from small, common floods to large, infrequent floods. Comparative trends in pairs of quantiles were characterized as coherent, convergent, or divergent by comparing the slopes of linear quantile regression equations.
thumbnail
These model archive summaries document the surrogate regression models developed to estimate 15-minute suspended-sediment concentrations at three streamgage sites in Colorado: Anthracite Creek above Mouth near Somerset, U.S. Geological Survey (USGS) site number 09132095; Muddy Creek above Paonia Reservoir, USGS site number 385903107210800; and North Fork Gunnison below Raven Gulch near Somerset, USGS site number 385553107243301. The methods used follow USGS guidance as referenced in relevant Office of Surface Water Technical Memorandum (TM) 2016.07 and Office of Water Quality TM 2016.10, USGS Techniques and Methods, book 3, chap. C5 (Landers and others, 2016), and (or) USGS Techniques and Methods, book 3, chap....
thumbnail
These processed data and provisional codes were created to investigate seismic velocity changes associated with the collapse of Kīlauea caldera during its 2018 eruption. Primary data (i.e., seismic waveforms) are hosted at the Incorporated Research Institutions for Seismology (IRIS; https://www.iris.edu/) and are ingested by the codes included here to reproduce the data analyzed in Hotovec-Ellis et al., 'Earthquake-derived seismic velocity changes during the 2018 caldera collapse of Kīlauea volcano.' The included code ('cwire' short for Coda Wave Interferometry with Repeating Earthquakes) takes a catalog of earthquakes clustered by waveform similarity (e.g., REDPy, https://github.com/ahotovec/REDPy/) and processes...
thumbnail
This data release presents catch and effort data for Cisco Coregonus artedi and Lake Whitefish Coregonus clupeaformis commercial gill net fisheries in State of Michigan waters of Lakes Superior, Michigan, and Huron during 1929-1970. The data were used to determine if Cisco and Lake Whitefish relative abundance (commercial gill net catch per effort) were correlated (positive and negative) during the historical period. The file is in .csv format and contains columns for: (1) lake (LAKE); (2) commercial fishery management unit (MU); (3) year (YEAR); (4) gill net material used to target Lake Whitefish (WF_MAT); (5) gill net material used to target Cisco (CS_MAT); (6) gill net conversion factors (multipliers) used...
thumbnail
This data release contains input data and programs (scripts) used to estimate monthly water demand for retail customers of Providence Water, located in Providence, Rhode Island. Explanatory data and model outputs are from July 2014 through June 2021. Models of per capita (for single-family residential customers) or per connection (for multi-family residential, commercial, and industrial customers) water use were developed using multiple linear regression. The dependent variables, provided by Providence Water, are the monthly number of connections and gallons of water delivered to single- and multi-family residential, commercial, and industrial connections. Potential independent variables (from online sources) are...
thumbnail
The U.S. Geological Survey (USGS), in cooperation with the Bureau of Land Management, used streamflow measurements at 11 partial-record sites and related them to nearby USGS or Idaho Power Company real-time streamgages (index sites) to provide daily mean streamflow values at ungaged (partial-record) sites within the Wild and Scenic River of the Owyhee Canyonlands Wilderness, Idaho. Daily mean streamflow was estimated by developing a regression relationship between streamflow at each partial-record site and the index site for the period of record of the index site. The regressions are then used to estimate annual and semimonthly 20-, 50-, and 80-percent exceedance probability streamflow statistics at each partial-record...
thumbnail
Peak-flow frequency analysis is crucial in various water-resources management applications, including floodplain management and critical structure design. Federal guidelines for peak-flow frequency analyses, provided in Bulletin 17C, assume that the statistical properties of the hydrologic processes driving variability in peak flows do not change over time and so the frequency distribution of annual peak flows is stationary. Better understanding of long-term climatic persistence and further consideration of potential climate and land-use changes have caused the assumption of stationarity to be reexamined. This data release contains input data and results of a study investigating hydroclimatic trends in peak streamflow...
thumbnail
This model archive summary documents the suspended-sediment concentration (SSC) model developed to estimate 15-minute SSC at Anthracite Creek above Mouth near Somerset, U.S. Geological Survey (USGS) site number 09132095. The methods used follow USGS guidance as referenced in relevant Office of Surface Water Technical Memorandum (TM) 2016.07 and Office of Water Quality TM 2016.10, USGS Techniques and Methods, book 3, chap. C5 (Landers and others, 2016), and USGS Techniques and Methods, book 3, chap. C4 (Rasmussen and others, 2009). A total of 399 suspended-sediment samples were collected during the calibration period (43 cross-section and 356 single-station samples). Forty-one of these samples with associated streamflow...
An ordinary least squares (OLS) technique was used with at-site flow-duration exceedance probabilities, from SWToolbox (Kiang and others, 2018), and 14 basin characteristics (basin perimeter, drainage area, maximum basin elevation, mean total annual reference evapotranspiration, groundwater head, length of longest flow path, minimum basin elevation, runoff-curve number, relief, ruggedness, slope ratio, percentage of hydrologic soil type B, available water capacity, and total length of streams in a basin) for 28 selected streamflow gaging stations in Puerto Rico to calculate regional flow-duration regression equations for ungaged stream locations. The final flow-duration regression equations were developed in R (R...
thumbnail
This dataset is one of many used in the development of the manuscript 'Advancing Cave Detection using Terrain Analysis Techniques and Thermal Imagery' by Wynne et al. 2021. Manuscript Abstract: Since the initial experiments nearly 50 years ago, techniques for detecting caves using airborne and spacecraft acquired thermal imagery have improved markedly. These advances are largely due to a combination of higher instrument sensitivity, modern computing systems, and processor-intensive analytical techniques. Through applying these advancements, our goals were to: (1) determine the utility of methods designed for terrain analysis and applied to thermal imagery; (2) analyze the usefulness of predawn and midday imagery...
thumbnail
This data release contains estimates of annual nitrate, suspended sediment, phosphorus, and chloride loads and uncertainty from sites in the Illinois Environmental Protection Agency (Illinois EPA) Ambient Water Quality Monitoring Network. The loads were estimated using Weighted Regressions on Time, Discharge, and Season with Kalman filtering (WRTDS-K) and existing discrete water-quality data and discharge data collected by Illinois EPA, the U.S. Geological Survey, and Illinois State Water Survey through water year (WY) 2020. All water-quality and discharge data used in this work are available from the Water Quality Portal and the National Water Information System. The data release comprises a single comma-separated...
thumbnail
For more than 100 years, the Permian Basin has been an important source of oil and gas produced from conventional reservoirs; directional drilling combined with hydraulic fracturing has greatly increased production in the past 10 years to the extent that the Permian Basin is becoming one of the world’s largest continuous oil and gas (COG) producing fields (U.S. Energy Information Administration, 2020). These recent techniques extract oil and gas by directionally drilling and hydraulically fracturing the surrounding reservoir rock. The extraction of COG by using these techniques requires large volumes of water and estimates of the total water volume used in COG require a comprehensive assessment to determine the...
thumbnail
Regression equations, which may be used to estimate flood flows at select annual exceedance probabilities, were developed for ungaged streams in Pennsylvania. The equations were developed using annual peak flow data through water year 2015 and basin characteristics for 285 streamflow gaging stations across Pennsylvania and surrounding states. The streamgages included active and discontinued continuous-record stations, as well as crest-stage partial-record stations, and required a minimum of 10 years of annual peak streamflow data for inclusion in the study. Explanatory variables significant at the 95-percent confidence level for one or more regression equations included the following basin characteristics: drainage...
thumbnail
This data release contains the inputs and outputs from the development of Escherichia coli (E. coli) concentration models for the four Missouri River sites near Omaha, Nebraska. Model development, models, and results are published in the associated U.S. Geological Survey (USGS) report Densmore and others, 2020. Load Estimator models (LOADEST) (Runkel and others, 2004) were developed from the first 5 years of data (July 2012–September 2016). Sites include Missouri River at NP Dodge Park at Omaha, Nebr. (USGS Site ID 412126095565201), Missouri River at Freedom Park Omaha, Nebr. (USGS Site ID 411636095535104), Missouri River near Council Bluffs, Iowa (USGS Site ID 06610505), and Missouri River near La Platte, Nebr....


map background search result map search result map Regression models and associated data for describing variability of host specific bacteria fluxes in eight Great Lakes tributaries, 2011-2013 Modeling Escherichia coli in the Missouri River near Omaha, Nebraska, 2012-16: Model Inputs and Outputs Data in Support of Development of Regression Equations for the Estimation of Flood Flows at Ungaged Streams in Pennsylvania (ver. 2.0, August 2020) Trends in annual peak streamflow quantiles for 2,683 U.S. Geological Survey streamgages in the conterminous United States Modeled and observed trends at reference basins in the conterminous U.S. from October 1, 1983 through September 30, 2016 Surrogate regression models for computation of time series suspended-sediment concentrations at Muddy Creek above Paonia Reservoir, Anthracite Creek above mouth near Somerset, and North Fork Gunnison below Raven Gulch near Somerset, Colorado Model Archive Summary for Suspended-Sediment Concentration at U.S. Geological Survey Site 09132095; Anthracite Creek above Mouth near Somerset, Colorado Data for Field Evaluation of the Sequoia Scientific LISST-ABS Acoustic Backscatter Sediment Sensor R scripts and results of estimated water use associated with continuous oil and gas development, Permian Basin, United States, 2010–19 (ver. 2.0, April, 2022) Regions for regional regression equations Data for multiple linear regression models for estimating Escherichia coli (E. coli) concentrations or the probability of exceeding the bathing-water standard at recreational sites in Ohio and Pennsylvania as part of the Great Lakes NowCast, 2019 Puerto Rico N-day Low-Flow Regression Files Puerto Rico Flow-Duration Regression Files WRTDS-K nutrient and sediment loads at Illinois EPA Ambient Water Quality Monitoring Network sites through water year 2020 Catch and Effort Data for Cisco and Lake Whitefish Commercial Gill Net Fisheries in State of Michigan Waters of Lakes Superior, Michigan, and Huron During 1929–1970 Streamflow regressions and annual and semimonthly exceedance probability statistics for wild and scenic rivers, Owyhee Canyonlands Wilderness, Idaho Time series of seismic velocity changes during the 2018 collapse of Kīlauea volcano derived from coda wave interferometry of repeating earthquakes Aircraft-Borne Thermal Imagery and Derived Terrain Analysis Layers, Pisgah Lava Field, California (COPY) Peak Streamflow Data, Climate Data, and Results from Investigating Hydroclimatic Trends and Climate Change Effects on Peak Streamflow in the Central United States, 1921–2020 Data for Regression Models to Estimate Water Use in Providence, Rhode Island, 2014-2021 Aircraft-Borne Thermal Imagery and Derived Terrain Analysis Layers, Pisgah Lava Field, California (COPY) Time series of seismic velocity changes during the 2018 collapse of Kīlauea volcano derived from coda wave interferometry of repeating earthquakes Data for Regression Models to Estimate Water Use in Providence, Rhode Island, 2014-2021 Modeling Escherichia coli in the Missouri River near Omaha, Nebraska, 2012-16: Model Inputs and Outputs Regions for regional regression equations Puerto Rico N-day Low-Flow Regression Files Puerto Rico Flow-Duration Regression Files Streamflow regressions and annual and semimonthly exceedance probability statistics for wild and scenic rivers, Owyhee Canyonlands Wilderness, Idaho Data for multiple linear regression models for estimating Escherichia coli (E. coli) concentrations or the probability of exceeding the bathing-water standard at recreational sites in Ohio and Pennsylvania as part of the Great Lakes NowCast, 2019 Data in Support of Development of Regression Equations for the Estimation of Flood Flows at Ungaged Streams in Pennsylvania (ver. 2.0, August 2020) WRTDS-K nutrient and sediment loads at Illinois EPA Ambient Water Quality Monitoring Network sites through water year 2020 Surrogate regression models for computation of time series suspended-sediment concentrations at Muddy Creek above Paonia Reservoir, Anthracite Creek above mouth near Somerset, and North Fork Gunnison below Raven Gulch near Somerset, Colorado Regression models and associated data for describing variability of host specific bacteria fluxes in eight Great Lakes tributaries, 2011-2013 R scripts and results of estimated water use associated with continuous oil and gas development, Permian Basin, United States, 2010–19 (ver. 2.0, April, 2022) Catch and Effort Data for Cisco and Lake Whitefish Commercial Gill Net Fisheries in State of Michigan Waters of Lakes Superior, Michigan, and Huron During 1929–1970 Peak Streamflow Data, Climate Data, and Results from Investigating Hydroclimatic Trends and Climate Change Effects on Peak Streamflow in the Central United States, 1921–2020 Data for Field Evaluation of the Sequoia Scientific LISST-ABS Acoustic Backscatter Sediment Sensor Trends in annual peak streamflow quantiles for 2,683 U.S. Geological Survey streamgages in the conterminous United States Modeled and observed trends at reference basins in the conterminous U.S. from October 1, 1983 through September 30, 2016