Audio Enhancement for Gamelan Instrument Recognition using Spectral Subtraction

Viga Laksa Hardjanto; . Wahyono

doi:10.48084/etasr.10181

Authors

Viga Laksa Hardjanto Department of Computer Science and Electronics, Faculty of Mathematics and Natural Sciences, Universitas Gadjah Mada, Yogyakarta, Indonesia
Wahyono Department of Computer Science and Electronics, Faculty of Mathematics and Natural Sciences, Universitas Gadjah Mada, Yogyakarta, Indonesia

Volume: 15 | Issue: 2 | Pages: 22042-22048 | April 2025 | https://doi.org/10.48084/etasr.10181

Received: 10 January 2025 | Revised: 13 February 2025 and 21 February 2025 | Accepted: 27 February 2025 | Online: 3 April 2025

Corresponding author: Wahyono

Abstract

Artificial intelligence has made significant progress in processing audio, text, and images, but noise remains a major challenge, especially in real-world audio data. This research presents a novel approach to improve audio classification by integrating noise reduction techniques with machine learning models. Focusing on the bonang barung, a traditional Javanese gamelan instrument, the study uses Mel Frequency Cepstral Coefficients (MFCC) and Mel spectrograms to identify the most effective features for classification, and the Multi-Layer Perceptron (MLP) model for the classification task. In addition, the spectral subtraction method is used to reduce noise, which resulted in significant improvements in audio quality, although some artifacts remain. The main contribution of this study is the integration of noise reduction with the MLP model to improve the classification performance. The MLP model successfully classified various bonang barung playing techniques, achieving a classification accuracy of 90% after noise reduction compared to 87.22% with noise, highlighting the importance of preprocessing steps, such as noise reduction. It is also demonstrated that MLP models can be a viable alternative to more complex deep learning models, such as CNN and RNN, for audio classification tasks. Overall, this research provides new insights into the role of noise reduction in audio analysis and offers potential advances in the field of audio classification.

Keywords:

pattern recognition, audio enhancement, MFCC, multi-layer perceptron

Downloads

Download data is not yet available.

References

Soeroso, Bagaimana bermain gamelan. Jakarta, Indonesia: Balai Pustaka, 1982.

J. Umamaheswari and A. Akila, "Improving Speech Recognition Performance using Spectral Subtraction with Artificial Neural Network," International Journal of Advanced Studies of Scientific Research, vol. 3, no. 11, pp. 214–219, 2018.

J. S. Ashwin and N. 92, Jan Manoharan, "Audio Denoising Based on Short Time Fourier Transform," Indonesian Journal of Electrical Engineering and Computer Science, vol. 9, no. 1, pp. 89–. 2018.

S. K. Shridhar, L. Doddimani, A. Hirekoppa, K. Kodliwad, and A. Viraktamath, "Speech Enhancement using Spectral Subtraction," International Journal of Engineering Research, vol. 10, no. 7, pp. 744–748, Jul. 2021.

Y. Yang, P. Liu, H. Zhou, and Y. Tian, "A Speech Enhancement Algorithm combining Spectral Subtraction and Wavelet Transform," in 2021 IEEE 4th International Conference on Automation, Electronics and Electrical Engineering, Shenyang, China, 2021, pp. 268–273.

A. A. Alasadi, T. H. Aldhayni, R. R. Deshmukh, A. H. Alahmadi, and A. S. Alshebami, "Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System," Engineering, Technology & Applied Science Research, vol. 10, no. 2, pp. 5547–5553, Apr. 2020.

K. S. Harshavardhan and Mahesh, "Urban sound classification using ANN," in 2022 International Interdisciplinary Humanitarian Conference for Sustainability, Bengaluru, India, 2022, pp. 1475–1480.

M. Shah, N. Pujara, K. Mangaroliya, L. Gohil, T. Vyas, and S. Degadwala, "Music Genre Classification using Deep Learning," in 2022 6th International Conference on Computing Methodologies and Communication, Erode, India, 2022, pp. 974–978.

R. Shah, P. Shah, C. Joshi, R. Jain, and R. Nikam, "Heartbeat Prediction using Mel Spectrogram and MFCC Value," in 2023 IEEE IAS Global Conference on Emerging Technologies, London, United Kingdom, 2023, pp. 1–5.

X. Zhou, K. Hu, and Z. Guan, "Environmental sound classification of western black-crowned gibbon habitat based on spectral subtraction and VGG16," in 2022 IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference, Chongqing, China, 2022, pp. 578–582.

H. A. Owida, A. Al-Ghraibah, and M. Altayeb, "Classification of Chest X-Ray Images using Wavelet and MFCC Features and Support Vector Machine Classifier," Engineering, Technology & Applied Science Research, vol. 11, no. 4, pp. 7296–7301, Aug. 2021.

V. L. Hardjanto, "Bonang Barung Instrument." Zenodo, Feb. 17, 2025.

M. S. Rao, O. Pavan Kalyan, N. N. Kumar, Md. Tasleem Tabassum, and B. Srihari, "Automatic Music Genre Classification Based on Linguistic Frequencies Using Machine Learning," in 2021 International Conference on Recent Advances in Mathematics and Informatics, Tebessa, Algeria, 2021, pp. 1–5.

Y.-H. Cheng, P.-C. Chang, and C.-N. Kuo, "Convolutional Neural Networks Approach for Music Genre Classification," in 2020 International Symposium on Computer, Consumer and Control, Taichung City, Taiwan, 2020, pp. 399–403.

J. K. Bhatia, R. D. Singh, and S. Kumar, "Music Genre Classification," in 2021 5th International Conference on Information Systems and Computer Networks, Mathura, India, 2021, pp. 1–4.

M. Rahmandani, H. A. Nugroho, and N. A. Setiawan, "Cardiac Sound Classification Using Mel-Frequency Cepstral Coefficients (MFCC) and Artificial Neural Network (ANN)," in 2018 3rd International Conference on Information Technology, Information System and Electrical Engineering, Yogyakarta, Indonesia, 2018, pp. 22–26.

X. Mu and C.-H. Min, "MFCC as Features for Speaker Classification using Machine Learning," in 2023 IEEE World AI IoT Congress, Seattle, WA, USA, 2023, pp. 0566–0570.

Vol. 15 (2025)	Vol. 7 (2017)
Vol. 14 (2024)	Vol. 6 (2016)
Vol. 13 (2023)	Vol. 5 (2015)
Vol. 12 (2022)	Vol. 4 (2014)
Vol. 11 (2021)	Vol. 3 (2013)
Vol. 10 (2020)	Vol. 2 (2012)
Vol. 9 (2019)	Vol. 1 (2011)
Vol. 8 (2018)

Audio Enhancement for Gamelan Instrument Recognition using Spectral Subtraction

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License