EGU2020-12315, updated on 12 Jun 2020
https://doi.org/10.5194/egusphere-egu2020-12315
EGU General Assembly 2020
© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.

Regression equations of probability plot correlation coefficient test statistics using machine learning

Hyunjun Ahn, Sunghun Kim, Joohyung Lee, and Jun-Haeng Heo
Hyunjun Ahn et al.
  • Yonsei University, School of Civil and Environment Engneering, Seoul, Korea, Republic of (kamjakang@yonsei.ac.kr)

In the extremes hydrology field, it is essential to find the probability distribution model that is most appropriate for the sample data to estimate the reasonable probability quantile. Depending on the assumed probability distribution model, the probability quantile could be estimated with quite different values. The probability plot correlation coefficient (PPCC) test is one of the goodness-of-fit tests for finding suitable probability distributions for a given sample. The PPCC test determines whether assumed probability distributions are acceptable for the sample data using correlation coefficients between sample data and theoretical quantiles of assumed probability distributions. The critical values for identification are presented as a two-dimensional table, depending on the sample size and the shape parameters of models, for a three-parameter probability distribution. In this study, the applicability and utility of machine learning in the hydrology field were examined. For the usability of the PPCC test, a regression equation was derived using a machine learning algorithm with two variables: sample size and shape parameter.

How to cite: Ahn, H., Kim, S., Lee, J., and Heo, J.-H.: Regression equations of probability plot correlation coefficient test statistics using machine learning, EGU General Assembly 2020, Online, 4–8 May 2020, EGU2020-12315, https://doi.org/10.5194/egusphere-egu2020-12315, 2020