Author Identifier (ORCID)

Hoda Khoshvaght: https://orcid.org/0000-0001-8766-419X

Amir Razmjou: https://orcid.org/0000-0002-3554-5129

Mehdi Khiadani: https://orcid.org/0000-0003-1703-9342

Abstract

While numerous machine learning (ML) models have been applied to wastewater treatment plant (WWTP) quality prediction tasks, significantly less attention has been paid to the selection and interpretation of performance evaluation metrics. Most studies rely on general-purpose regression metrics such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the Coefficient of Determination (R2). Although widely used, these metrics differ considerably in terms of interpretability, sensitivity to data anomalies, and suitability for dynamic, noisy environments like WWTPs. This review, based on a systematic literature analysis of 27 performance evaluation metrics, critically examines their theoretical foundations, strengths, limitations, and applicability within the context of supervised ML-based WWTP modeling. In addition to statistical metrics, it also explores complementary graphical techniques, such as residual or failure prediction plots, that offer deeper insights into model behavior, insights that purely numerical indicators may overlook. A significant contribution of this paper is the development of a practical, decision-guiding flowchart to assist researchers in selecting appropriate evaluation metrics based on dataset characteristics, modeling objectives, and project constraints. Additionally, it summarizes a reference toolkit of graphical methods that have been used in the literature to assess model performance beyond numerical indicators. Together, these resources not only promote more informed and transparent metric selection in research but also provide wastewater practitioners with actionable tools for interpreting model outputs, comparing predictive approaches, and identifying the models most suitable for reliable process monitoring and operational decision-making.

Document Type

Journal Article

Date of Publication

12-1-2025

Volume

13

Issue

6

Publication Title

Journal of Environmental Chemical Engineering

Publisher

Elsevier

School

Mineral Recovery Research Centre / School of Engineering

RAS ID

88082

Funders

Australian Government Research Training Program Scholarship / Water Corporation of Western Australia

Creative Commons License

Creative Commons Attribution 4.0 License
This work is licensed under a Creative Commons Attribution 4.0 License.

Comments

Khoshvaght, H., Permala, R. R., Razmjou, A., & Khiadani, M. (2025). A critical review on selecting performance evaluation metrics for supervised machine learning models in wastewater quality prediction. Journal of Environmental Chemical Engineering, 13(6), 119675. https://doi.org/10.1016/j.jece.2025.119675

Share

 
COinS
 

Link to publisher version (DOI)

10.1016/j.jece.2025.119675