Audio Enhancement for Gamelan Instrument Recognition using Spectral Subtraction
Received: 10 January 2025 | Revised: 13 February 2025 and 21 February 2025 | Accepted: 27 February 2025 | Online: 3 April 2025
Corresponding author: Wahyono
Abstract
Artificial intelligence has made significant progress in processing audio, text, and images, but noise remains a major challenge, especially in real-world audio data. This research presents a novel approach to improve audio classification by integrating noise reduction techniques with machine learning models. Focusing on the bonang barung, a traditional Javanese gamelan instrument, the study uses Mel Frequency Cepstral Coefficients (MFCC) and Mel spectrograms to identify the most effective features for classification, and the Multi-Layer Perceptron (MLP) model for the classification task. In addition, the spectral subtraction method is used to reduce noise, which resulted in significant improvements in audio quality, although some artifacts remain. The main contribution of this study is the integration of noise reduction with the MLP model to improve the classification performance. The MLP model successfully classified various bonang barung playing techniques, achieving a classification accuracy of 90% after noise reduction compared to 87.22% with noise, highlighting the importance of preprocessing steps, such as noise reduction. It is also demonstrated that MLP models can be a viable alternative to more complex deep learning models, such as CNN and RNN, for audio classification tasks. Overall, this research provides new insights into the role of noise reduction in audio analysis and offers potential advances in the field of audio classification.
Keywords:
pattern recognition, audio enhancement, MFCC, multi-layer perceptronDownloads
References
Soeroso, Bagaimana bermain gamelan. Jakarta, Indonesia: Balai Pustaka, 1982.
J. Umamaheswari and A. Akila, "Improving Speech Recognition Performance using Spectral Subtraction with Artificial Neural Network," International Journal of Advanced Studies of Scientific Research, vol. 3, no. 11, pp. 214–219, 2018.
J. S. Ashwin and N. 92, Jan Manoharan, "Audio Denoising Based on Short Time Fourier Transform," Indonesian Journal of Electrical Engineering and Computer Science, vol. 9, no. 1, pp. 89–. 2018.
S. K. Shridhar, L. Doddimani, A. Hirekoppa, K. Kodliwad, and A. Viraktamath, "Speech Enhancement using Spectral Subtraction," International Journal of Engineering Research, vol. 10, no. 7, pp. 744–748, Jul. 2021.
Y. Yang, P. Liu, H. Zhou, and Y. Tian, "A Speech Enhancement Algorithm combining Spectral Subtraction and Wavelet Transform," in 2021 IEEE 4th International Conference on Automation, Electronics and Electrical Engineering, Shenyang, China, 2021, pp. 268–273.
A. A. Alasadi, T. H. Aldhayni, R. R. Deshmukh, A. H. Alahmadi, and A. S. Alshebami, "Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System," Engineering, Technology & Applied Science Research, vol. 10, no. 2, pp. 5547–5553, Apr. 2020.
K. S. Harshavardhan and Mahesh, "Urban sound classification using ANN," in 2022 International Interdisciplinary Humanitarian Conference for Sustainability, Bengaluru, India, 2022, pp. 1475–1480.
M. Shah, N. Pujara, K. Mangaroliya, L. Gohil, T. Vyas, and S. Degadwala, "Music Genre Classification using Deep Learning," in 2022 6th International Conference on Computing Methodologies and Communication, Erode, India, 2022, pp. 974–978.
R. Shah, P. Shah, C. Joshi, R. Jain, and R. Nikam, "Heartbeat Prediction using Mel Spectrogram and MFCC Value," in 2023 IEEE IAS Global Conference on Emerging Technologies, London, United Kingdom, 2023, pp. 1–5.
X. Zhou, K. Hu, and Z. Guan, "Environmental sound classification of western black-crowned gibbon habitat based on spectral subtraction and VGG16," in 2022 IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference, Chongqing, China, 2022, pp. 578–582.
H. A. Owida, A. Al-Ghraibah, and M. Altayeb, "Classification of Chest X-Ray Images using Wavelet and MFCC Features and Support Vector Machine Classifier," Engineering, Technology & Applied Science Research, vol. 11, no. 4, pp. 7296–7301, Aug. 2021.
V. L. Hardjanto, "Bonang Barung Instrument." Zenodo, Feb. 17, 2025.
M. S. Rao, O. Pavan Kalyan, N. N. Kumar, Md. Tasleem Tabassum, and B. Srihari, "Automatic Music Genre Classification Based on Linguistic Frequencies Using Machine Learning," in 2021 International Conference on Recent Advances in Mathematics and Informatics, Tebessa, Algeria, 2021, pp. 1–5.
Y.-H. Cheng, P.-C. Chang, and C.-N. Kuo, "Convolutional Neural Networks Approach for Music Genre Classification," in 2020 International Symposium on Computer, Consumer and Control, Taichung City, Taiwan, 2020, pp. 399–403.
J. K. Bhatia, R. D. Singh, and S. Kumar, "Music Genre Classification," in 2021 5th International Conference on Information Systems and Computer Networks, Mathura, India, 2021, pp. 1–4.
M. Rahmandani, H. A. Nugroho, and N. A. Setiawan, "Cardiac Sound Classification Using Mel-Frequency Cepstral Coefficients (MFCC) and Artificial Neural Network (ANN)," in 2018 3rd International Conference on Information Technology, Information System and Electrical Engineering, Yogyakarta, Indonesia, 2018, pp. 22–26.
X. Mu and C.-H. Min, "MFCC as Features for Speaker Classification using Machine Learning," in 2023 IEEE World AI IoT Congress, Seattle, WA, USA, 2023, pp. 0566–0570.
Downloads
How to Cite
License
Copyright (c) 2025 Viga Laksa Hardjanto, Wahyono

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.