Improved estimation of extreme floods with data pooling and mixed probability distribution

Abinesh Ganapathy, David M. Hannah, Ankit Agarwal*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The accurate estimation of flood probability is crucial for designing water storage and flood retention structures. However, the assumption of identical distribution in flood samples is unrealistic, given the influence of various flood mechanisms. To address this challenge, we proposed a novel framework based on flood clustering and data pooling that encompasses the key steps such as 1) flood event separation based on a peak-detection flood separation algorithm, 2) grouping flood events using the k-prototypes algorithm, 3) application of the UNprecedented Simulated Extreme ENsemble (UNSEEN) approach to pool reforecast ensemble datasets, and 4) statistical mixing approach to derive common quantiles from all the flood groups. We applied the framework to the Dresden gauge in the Elbe River for a detailed case study. Various tests have been performed to assess the applicability of the UNSEEN approach and the reforecast dataset consistently shows the potential for data pooling. The proposed methodology outperformed the classical approach in terms of goodness-of-fit. The relative difference between the classical and the proposed approach ((classical-proposed)/proposed) for the 100-year return level is 0.16, with a reduction in root mean square error (RMSE) value from 163 to 98 m3/s. Further, replication of the approach to the gauges in North Germany exhibited a relative difference ranging from −0.3 to +0.15 and produced better estimates in terms of RMSE compared with the traditional model. In summary, the proposed framework offers a better estimation of flood probability by addressing the inherent sample inhomogeneity along with the inclusion of unprecedented flood samples.

Original languageEnglish
Article number130633
Number of pages18
JournalJournal of Hydrology
Volume629
Early online date13 Jan 2024
DOIs
Publication statusPublished - Feb 2024

Bibliographical note

Funding Information:
AG acknowledges the financial support from the Commonwealth Scholarship Commission, UK (INCN-2021-57). Additionally, AG and AA acknowledge the funding support from the COPREPARE project (https://ir.iitr.ac.in/COPREPARE/) funded by the University Grant Commission (UGC) and DAAD under the Indo-German Partnership in Higher Education (IGP2020-24/COPREPARE). We appreciate the editor and two anonymous reviewers for their feedback, which has improved the quality of the manuscript.

Publisher Copyright:
© 2024 Elsevier B.V.

Keywords

  • Data pooling
  • Flood clustering
  • Flood frequency analysis
  • Mixed distribution model
  • Sample heterogeneity
  • UNSEEN approach

ASJC Scopus subject areas

  • Water Science and Technology

Fingerprint

Dive into the research topics of 'Improved estimation of extreme floods with data pooling and mixed probability distribution'. Together they form a unique fingerprint.

Cite this