Mask R-CNN (ResNet-50 vs ResNet-101): A Deep Learning Framework for Instance-Level Enamel Segmentation
Received: 21 November 2025 | Revised: 5 January 2026 and 20 January 2026 | Accepted: 23 January 2026 | Online: 8 February 2026
Corresponding author: Naveen Bettahalli
Abstract
Dental cavities constitute a major global health issue and must be diagnosed reliably to enable timely and effective treatment. The identification of dental caries at an early stage is essential, as lesions typically begin at the enamel surface and, over time, progress into the deeper tooth structures, including dentin and pulp. Advancements in dental imaging, combined with artificial intelligence-based methodologies, offer promising solutions for improving diagnostic accuracy and efficiency. Therefore, the present study evaluates the performance of Faster Region-based Convolutional Neural Network (Faster R-CNN) and Mask Region-based Convolutional Neural Network (Mask R-CNN) with ResNet-50 and ResNet-101 backbones for automatic enamel detection and segmentation. All models exhibited excellent detection performance, obtaining perfect Average Precision (AP) scores at IoU thresholds of 0.50 (AP50) and 0.75 (AP75). Faster R-CNN has achieved an AP of 95.92%, while both Mask R-CNN variants, ResNet-50 and ResNet-100, achieved near-perfect bounding box detection with an AP of approximately 99%. For segmentation, Mask R-CNN with a ResNet-50 backbone achieved an AP of 86.30%, whereas the deeper ResNet-101 backbone significantly improved segmentation performance, achieving an AP of 98.44%. These results demonstrate that the Mask R-CNN architecture surpasses Faster R-CNN in detection accuracy and provides superior segmentation performance. Overall, Mask R-CNN with a ResNet-101 backbone can be considered the most effective model for enamel detection and segmentation. Nevertheless, the proposed model should be improved and externally validated. This work can be further carried out to detect carious lesions in the enamel portion for early detection and treatment.
Keywords:
enamel, segmentation, deep learning, Mask R-CNNDownloads
References
Y. Liu et al., "Richer Convolutional Features for Edge Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 8, pp. 1939–1946, Aug. 2019. DOI: https://doi.org/10.1109/TPAMI.2018.2878849
S. Kanagamalliga, R. Jayashree, and M. Chouksey, "Fast R-CNN Approaches for Transforming Dental Caries Detection: An In-Depth Investigation," in 2024 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET), Chennai, India, Mar. 2024, pp. 1–5. DOI: https://doi.org/10.1109/WiSPNET61464.2024.10532984
R. Hemalatha, G. Amulya, and Ch. S. N. S. Lalitha, "Computer Vision Techniques for Object Detection," in 2024 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems, Chennai, India, Dec. 2024, pp. 1–6. DOI: https://doi.org/10.1109/ICSES63760.2024.10910496
J. Sun and K. Jia, "Research on Semantic Segmentation of Ground- Based Cloud Image Based on Fully Convolutional Network," in China Automation Congress, Xiamen, China, Nov. 2022, pp. 5850–5854. DOI: https://doi.org/10.1109/CAC57257.2022.10055648
Y. E. Kang, W. Kang, T. Lee, and H. S. Chwa, "Paste-and-Cut: Collective Image Localization and Classification for Real-Time Multi-Camera Object Detection," in 2023 14th International Conference on Information and Communication Technology Convergence, Jeju Island, Republic of Korea, Oct. 2023, pp. 740–742. DOI: https://doi.org/10.1109/ICTC58733.2023.10393851
S. Gobinath, M. Almusawi, G. Shalini, B. Pruthviraj Goud, and K. Sudheer Kumar, "Object Detection in High-Resolution Aerial Images Using Single Shot Multi-Box Detector," in 2024 International Conference on Data Science and Network Security, Tiptur, India, Jul. 2024, pp. 1–4. DOI: https://doi.org/10.1109/ICDSNS62112.2024.10691194
K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask R-CNN," in 2017 IEEE International Conference on Computer Vision, Venice, Italy, Oct. 2017, pp. 2980–2988. DOI: https://doi.org/10.1109/ICCV.2017.322
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 834–848, Apr. 2018. DOI: https://doi.org/10.1109/TPAMI.2017.2699184
Y. Gao, X. Wang, and Q. Li, "Improved Cervical Cell Segmentation Algorithm Based on Cascade Mask RCNN," in 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference, Chongqing, China, Sept. 2023, pp. 1775–1780. DOI: https://doi.org/10.1109/ITOEC57671.2023.10291487
D. A. Agneya, M. S. Shekar, A. Bharadwaj, N. Vineeth, and M. L. Neelima, "Deep Learning in Medical Image Analysis: A Survey," in 2024 International Conference on Innovation and Novelty in Engineering and Technology, Vijayapura, India, Dec. 2024, pp. 1–5. DOI: https://doi.org/10.1109/INNOVA63080.2024.10847040
R. Nambiar and R. Nanjundegowda, "Detection of Missing Tooth Regions Using Deep Learning in Panoramic Radiographs for Dental Implant Planning," Engineering, Technology & Applied Science Research, vol. 15, no. 5, pp. 28071–28076, Oct. 2025. DOI: https://doi.org/10.48084/etasr.13101
D. Bolya, C. Zhou, F. Xiao, and Y. J. Lee, "YOLACT: Real-Time Instance Segmentation," in 2019 IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, Oct. 2019, pp. 9156–9165. DOI: https://doi.org/10.1109/ICCV.2019.00925
L. V. Comia and E. D. Festijo, "Performance Analysis of Original Implementation of ResNet50-Mask-RCNN using Transfer Learning: A Benchmark Data for Backbone-Improved Based Future Comparative Studies," in 2024 28th International Conference on Information Technology, Zabljak, Montenegro, Feb. 2024, pp. 1–6. DOI: https://doi.org/10.1109/IT61232.2024.10475763
D. De Geus and G. Dubbelman, "Task-Aligned Part-Aware Panoptic Segmentation Through Joint Object-Part Representations," in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, Jun. 2024, pp. 3174–3183. DOI: https://doi.org/10.1109/CVPR52733.2024.00306
V. Singh, K. Singh, R. Sehrawat, A. P. Singh, J. Augustine, and M. Verma, "Deep Learning Techniques for Dental Caries Detection and Prosthodontics: A Systematic Review," in 2025 2nd International Conference on Computational Intelligence, Communication Technology and Networking, Ghaziabad, India, Feb. 2025, pp. 651–655. DOI: https://doi.org/10.1109/CICTN64563.2025.10932642
Y. Zhu et al., "Faster-RCNN Based Intelligent Detection and Localization of Dental Caries," Displays, vol. 74, Sept. 2022, Art. no. 102201. DOI: https://doi.org/10.1016/j.displa.2022.102201
H. Mohammad-Rahimi et al., "Deep Learning for Caries Detection: A Systematic Review," Journal of Dentistry, vol. 122, July 2022, Art. no. 104115. DOI: https://doi.org/10.1016/j.jdent.2022.104115
T.-J. Lin et al., "Evaluation of the Alveolar Crest and Cemento-Enamel Junction in Periodontitis Using Object Detection on Periapical Radiographs," Diagnostics, vol. 14, no. 15, Aug. 2024, Art. no. 1687. DOI: https://doi.org/10.3390/diagnostics14151687
J.-H. Lee, D.-H. Kim, S.-N. Jeong, and S.-H. Choi, "Detection and Diagnosis of Dental Caries Using a Deep Learning-based Convolutional Neural Network Algorithm," Journal of Dentistry, vol. 77, pp. 106–111, Oct. 2018. DOI: https://doi.org/10.1016/j.jdent.2018.07.015
A. AlSayyed et al., "Employing CNN Ensemble Models in Classifying Dental Caries Using Oral Photographs," International Journal of Data and Network Science, vol. 7, no. 4, pp. 1535–1550, 2023. DOI: https://doi.org/10.5267/j.ijdns.2023.8.009
O. Khattak et al., "Deep Learning Applications in Dental Image-Based Diagnostics: A Systematic Review," Healthcare, vol. 13, no. 12, June 2025, Art. no. 1466. DOI: https://doi.org/10.3390/healthcare13121466
F. Schwendicke et al., "Cost-effectiveness of Artificial Intelligence for Proximal Caries Detection," Journal of Dental Research, vol. 100, no. 4, pp. 369–376, Apr. 2021. DOI: https://doi.org/10.1177/0022034520972335
D. S. Kermany et al., "Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning," Cell, vol. 172, no. 5, pp. 1122-1131, Feb. 2018.
M. He et al., "Lightweight Mask R-CNN for Instance Segmentation and Particle Physical Property Analysis in Multiphase Flow," Powder Technology, vol. 449, Jan. 2025, Art. no. 120366. DOI: https://doi.org/10.1016/j.powtec.2024.120366
Downloads
How to Cite
License
Copyright (c) 2026 Nandeesh Mahadevu, Naveen Bettahalli, Srividya Chandagirikoppal Nagendra

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
