Skip to main content
Advanced Search

Filters: Tags: regression model (X)

3 results (8ms)   

View Results as: JSON ATOM CSV
thumbnail
Ensemble-tree machine learning (ML) regression models can be prone to systematic bias: small values are overestimated and large values are underestimated. Additional bias can be introduced if the dependent variable is a transform of the original data. Six methods were evaluated for their ability to correct systematic and introduced bias: (1) empirical distribution matching (EDM); (2) regression of observed on estimated values (ROE); (3) linear transfer function (LTF); (4) linear equation based on Z-score transform (ZZ); (5) second machine learning model used to estimate residuals (ML2-RES); and (6) Duan smearing estimate applied after ROE is implemented (ROE-Duan). The performance of the methods was evaluated using...
thumbnail
These data were released prior to the October 1, 2016 effective date for the USGS’s policy dictating the review, approval, and release of scientific data as referenced in USGS Survey Manual Chapter 502.8 Fundamental Science Practices: Review and Approval of Scientific Data for Release. This data set consists of 204 drainage basin boundaries for U.S. Geological Survey's (USGS) stream sites sampled in the National Water Quality Assessment (NAWQA) Program and the National Stream Quality Accounting Network (NASQAN). These drainage basin boundaries are used to generate watershed characteristics for the development of water-quality models. The basin boundaries were collected from USGS hydrologists and geographers from...
thumbnail
These data were released prior to the October 1, 2016 effective date for the USGS’s policy dictating the review, approval, and release of scientific data as referenced in USGS Survey Manual Chapter 502.8 Fundamental Science Practices: Review and Approval of Scientific Data for Release. This data set contains 204 drainage basin boundaries for U.S. Geological Survey (USGS) stream sites analyzed in the Watershed Regressions for Pesticides (WARP) model. This dataset supercedes version 1.0 released in April 2010, and consists of revisions to 26 basin boundaries.


    map background search result map search result map Data Release for Evaluation of Six Methods for Correcting Bias in Estimates from Ensemble Tree Machine Learning Regression Models Drainage Basins used for Development of the Watershed Regressions for Pesticides (WARP) Model Drainage Basins used for Development of the Watershed Regressions for Pesticides (WARP) Model, 2012 Data Release for Evaluation of Six Methods for Correcting Bias in Estimates from Ensemble Tree Machine Learning Regression Models Drainage Basins used for Development of the Watershed Regressions for Pesticides (WARP) Model Drainage Basins used for Development of the Watershed Regressions for Pesticides (WARP) Model, 2012