Enhancing Flood Prediction in Urban Areas: A Machine Learning Approach for Makassar City
Received: 25 October 2024 | Revised: 04 December 2024 and 5 January 2025 | Accepted: 24 January 2025 | Online: 3 April 2025
Corresponding author: Mochamad Hariadi
Abstract
Accurate and rapid predictions regarding urban flooding, are essential in supporting risk mitigation efforts. Flood phenomena have the potential to cause extensive damage and disrupt the functions of economic and governmental sectors. However, these impacts can be minimized through comprehensive planning and preparation to reduce potential losses. Machine learning techniques have emerged as a promising method for predicting complex hydrological processes. This research develops a flood prediction model by comparing seven machine learning algorithms, namely Logistic Regression, Linear Discriminant Analysis, k-Nearest Neighbors, Gaussian Naive Bayes, Support Vector Machine, AdaBoost, and Random Forest. The results show that Random Forest has the highest performance, demonstrating the reliability of Random Forest in processing complex urban flood datasets. This model is expected to enhance disaster preparation and contribute significantly to flood risk management in urban areas.
Keywords:
flood prediction, early warning system, machine learning, random forest, Makassar CityDownloads
References
T. Panfilova et al., "Flood Susceptibility Assessment in Urban Areas via Deep Neural Network Approach," Sustainability, vol. 16, no. 17, Jan. 2024, Art. no. 7489.
R. Kondo, B. Du, Y. Narusue, and H. Morikawa, "Machine Learning Framework Supervised by Hydraulic Mechanical Models for Real-time Pluvial Flood Prediction," Journal of Information Processing, vol. 31, pp. 256–264, 2023.
L. Schmidt, F. Heße, S. Attinger, and R. Kumar, "Challenges in Applying Machine Learning Models for Hydrological Inference: A Case Study for Flooding Events Across Germany," Water Resources Research, vol. 56, no. 5, 2020, Art. no. e2019WR025924.
S. S. Mallela and S. K. Jonnalagadda, "Rainfall Prediction Based on Spatial Attention Layer: A Case Study Analysis," International Journal of Intelligent Engineering and Systems, vol. 15, no. 3, pp. 49–58, 2022.
W. M. Ridwan, M. Sapitang, A. Aziz, K. F. Kushiar, A. N. Ahmed, and A. El-Shafie, "Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia," Ain Shams Engineering Journal, vol. 12, no. 2, pp. 1651–1663, Jun. 2021.
C. Albertini, A. Gioia, V. Iacobellis, G. P. Petropoulos, and S. Manfreda, "Assessing multi-source random forest classification and robustness of predictor variables in flooded areas mapping," Remote Sensing Applications: Society and Environment, vol. 35, Aug. 2024, Art. no. 101239.
H. Yu, Z. Luo, L. Wang, X. Ding, and S. Wang, "Improving the Accuracy of Flood Susceptibility Prediction by Combining Machine Learning Models and the Expanded Flood Inventory Data," Remote Sensing, vol. 15, no. 14, Jan. 2023, Art. no. 3601.
E. H. Ighile, H. Shirakawa, and H. Tanikawa, "Application of GIS and Machine Learning to Predict Flood Areas in Nigeria," Sustainability, vol. 14, no. 9, Jan. 2022, Art. no. 5039.
Hanshika. V. G, Preethi. S, Sreemathy. S. P, and E. Freeda. S, "Flood Prediction Using Logistic Regression," in 2023 International Conference on Circuit Power and Computing Technologies (ICCPCT), Dec. 2023, pp. 1174–1179.
C. Vlachas et al., "Random forest classification algorithm for medical industry data," SHS Web of Conferences, vol. 139, 2022, Art. no. 03008.
Z. Zhu and Y. Zhang, "Flood disaster risk assessment based on random forest algorithm," Neural Computing and Applications, vol. 34, no. 5, pp. 3443–3455, Mar. 2022.
H. Sato, Y. Kimura, M. Ohba, Y. Ara, S. Wakabayashi, and H. Watanabe, "Prediction of Prednisolone Dose Correction Using Machine Learning," Journal of Healthcare Informatics Research, vol. 7, no. 1, pp. 84–103, Mar. 2023.
Y. Qian et al., "Classification of Imbalanced Travel Mode Choice to Work Data Using Adjustable SVM Model," Applied Sciences, vol. 11, no. 24, Jan. 2021, Art. no. 11916.
P. Deivendran, S. Selvakanmani, G. Malathi, and V. Vinoth Kumar, "Machine Learning based IoT Flood Rediction Using Data Modeling and Decision Support System," International Journal on Recent and Innovation Trends in Computing and Communication, vol. 11, no. 9, pp. 2489–2499, Nov. 2023.
J. Liang, Q. Liu, N. Nie, B. Zeng, and Z. Zhang, "An Improved Algorithm based on KNN and Random Forest," in 3rd International Conference on Computer Science and Application Engineering, Sanya, China, Oct. 2019, pp. 1–6.
Y. Xu, J. Zhao, B. Wan, J. Cai, and J. Wan, "Flood Forecasting Method and Application Based on Informer Model," Water, vol. 16, no. 5, Jan. 2024, Art. no. 765.
W. Dai, Y. Tang, Z. Zhang, and Z. Cai, "Ensemble Learning Technology for Coastal Flood Forecasting in Internet-of-Things-Enabled Smart City," International Journal of Computational Intelligence Systems, vol. 14, no. 1, Sep. 2021, Art. no. 166.
S. Law, P. Jeszenszky, and K. Yano, "Examining geographical generalisation of machine learning models in urban analytics through street frontage classification and house price regression," in GIScience 2021 Short Paper Proceedings, Sep. 2021.
E. Wolff and D. Ramirez-Lovering, "Living with Floods in Informal Settlements: Compounding and Cascading Risks in Makassar, Indonesia," in Complex Disasters: Compounding, Cascading, and Protracted, A. Lukasiewicz and T. O’Donnell, Eds. New York, NY, USA: Springer, 2022, pp. 161–181.
A. B. Amjoud and M. Amrouch, "Transfer Learning for Automatic Image Orientation Detection Using Deep Learning and Logistic Regression," IEEE Access, vol. 10, pp. 128543–128553, Jan. 2022.
H. P. Singh and H. N. Alhulail, "Predicting Student-Teachers Dropout Risk and Early Identification: A Four-Step Logistic Regression Approach," IEEE Access, vol. 10, pp. 6470–6482, Jan. 2022.
R. Madhuri, S. Sistla, and K. Srinivasa Raju, "Application of machine learning algorithms for flood susceptibility assessment and risk management," Journal of Water and Climate Change, vol. 12, no. 6, pp. 2608–2623, Apr. 2021.
C.-N. Li, Y.-H. Shao, Z. Wang, N.-Y. Deng, and Z.-M. Yang, "Robust Bhattacharyya bound linear discriminant analysis through an adaptive algorithm," Knowledge-Based Systems, vol. 183, Nov. 2019, Art. no. 104858.
C.-N. Li, Y. Li, Y.-H. Meng, P.-W. Ren, and Y.-H. Shao, "L₂,₁-Norm Regularized Robust and Sparse Linear Discriminant Analysis via an Alternating Direction Method of Multipliers," IEEE Access, vol. 11, pp. 34250–34259, Jan. 2023.
A. Dhurandhar and A. Dobra, "Probabilistic characterization of nearest neighbor classifier," International Journal of Machine Learning and Cybernetics, vol. 4, no. 4, pp. 259–272, Aug. 2013.
T. Liao, Z. Lei, T. Zhu, S. Zeng, Y. Li, and C. Yuan, "Deep Metric Learning for K Nearest Neighbor Classification," IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 1, pp. 264–275, Jan. 2023.
K. Taunk, S. De, S. Verma, and A. Swetapadma, "A Brief Review of Nearest Neighbor Algorithm for Learning and Classification," in International Conference on Intelligent Computing and Control Systems, Madurai, India, Dec. 2019, pp. 1255–1260.
A. H. Jahromi and M. Taheri, "A non-parametric mixture of Gaussian naive Bayes classifiers based on local independent features," in Artificial Intelligence and Signal Processing Conference, Shiraz, Iran, Oct. 2017, pp. 209–212.
Y. Dani and M. A. Ginting, "Comparison of Iris dataset classification with Gaussian naïve Bayes and decision tree algorithms," International Journal of Electrical and Computer Engineering, vol. 14, no. 2, pp. 1959–1968, Apr. 2024.
N. A. F. Sulaiman, S. M. Shaharudin, S. Ismail, N. H. Zainuddin, M. L. Tan, and Y. Abd Jalil, "Predictive Modelling of Statistical Downscaling Based on Hybrid Machine Learning Model for Daily Rainfall in East-Coast Peninsular Malaysia," Symmetry, vol. 14, no. 5, May 2022, Art. no. 927.
P. Katta and N. Hegde, "A Hybrid Adaptive Neuro-Fuzzy Interface and Support Vector Machine Based Sentiment Analysis on Political Twitter Data," International Journal of Intelligent Engineering and Systems, vol. 12, no. 1, pp. 165–173, Feb. 2019.
Y.-S. Jeon, D.-H. Yang, and D.-J. Lim, "FlexBoost: A Flexible Boosting Algorithm With Adaptive Loss Functions," IEEE Access, vol. 7, pp. 125054–125061, Jan. 2019.
M. Liao, H. Wen, L. Yang, G. Wang, X. Xiang, and X. Liang, "Improving the model robustness of flood hazard mapping based on hyperparameter optimization of random forest," Expert Systems with Applications, vol. 241, May 2024, Art. no. 122682.
X. Guan, F. Yu, H. Xu, C. Li, and Y. Guan, "Flood risk assessment of urban metro system using random forest algorithm and triangular fuzzy number based analytical hierarchy process approach," Sustainable Cities and Society, vol. 109, Aug. 2024, Art. no. 105546.
H. Hairani, T. Widiyaningtyas, and D. D. Prasetya, "Addressing Class Imbalance of Health Data: A Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies," JOIV : International Journal on Informatics Visualization, vol. 8, no. 3, pp. 1310–1318, Sep. 2024.
M. Gava, P. M. Biron, and T. Buffin-Belanger, "A random forest machine learning model to detect fluvial hazards," River Research and Applications, vol. 40, no. 10, pp. 1837–1854, 2024.
H. Zhang, A. Zhou, and H. Zhang, "An Evolutionary Forest for Regression," IEEE Transactions on Evolutionary Computation, vol. 26, no. 4, pp. 735–749, Dec. 2022.
X. Fang, M. Liao, L. Yang, X. Jiang, J. Jin, and Q. Zhu, "Assessment of Flash Flood Risk Using Random Forest Regression Model Integrated with Binary Statistics," in IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium, Athens, Greece, Jul. 2024, pp. 3952–3955.
"BMKG Dataonline," BMKG Dataonline. https://dataonline.bmkg.go.id.
Downloads
How to Cite
License
Copyright (c) 2025 H. Muh Rizal, Mochamad Hariadi, Yunifa Miftachul Arif, Elly Warni

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.