Multi-View Deep Learning for Mandibular Landmark Localization.
Authors
Affiliations (5)
Affiliations (5)
- Center of Digital Dentistry/Department of Prosthodontics, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology, Beijing 100081, China.
- Department of Oral and Maxillofacial Surgery, Peking University School of Stomatology, Beijing 100081, China.
- School of Computer Science & National Pilot Software Engineering School, BeiJing University of Post and Telecommunications, Beijing 100876, China.
- Center of Digital Dentistry/Department of Prosthodontics, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology, Beijing 100081, China. Electronic address: [email protected].
- Center of Digital Dentistry/Department of Prosthodontics, Peking University School and Hospital of Stomatology & National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Devices & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology, Beijing 100081, China. Electronic address: [email protected].
Abstract
Accurate localization of anatomical landmarks on the mandible is crucial for maxillofacial surgery and orthodontic treatment planning. This study aims to develop and validate a novel multi-view deep learning framework to enhance the accuracy and efficiency of landmark localization on CBCT-derived 3D mandibular surface models. We propose a multi-view stacked hourglass convolutional neural network (MVSH-CNN) that localizes 19 anatomical landmarks on 3D mandibular surface models reconstructed from cone beam computed tomography (CBCT) scans. A total of 140 mandibular scans from adult Han Chinese individuals were used, with 100 cases for training/validation and 40 cases (20 normal, 20 asymmetry) for independent testing. Manual annotations served as the reference standard. Localization performance was compared with the MeshMonk non-rigid registration method using Euclidean distance error and computational time. MVSH-CNN achieved a mean localization error of 1.13 ± 0.85 mm in the normal group and 1.10 ± 0.79 mm in the asymmetry group, significantly outperforming MeshMonk (1.42 ± 1.28 mm and 1.43 ± 1.14 mm, respectively; P < 0.05). Processing time per mandible was reduced from 356 seconds to 19.65 seconds. Over 94% of landmarks localized by MVSH-CNN had an error < 2 mm, meeting the predefined clinical threshold. The MVSH-CNN framework provides accurate, robust, and time-efficient semi-automated 3D landmark localization directly on STL-based mandibular models, outperforming conventional registration-based approaches. MVSH-CNN offers a semi-automated and clinically viable solution for digital orthodontic assessment, virtual surgical planning, and intelligent craniofacial analysis, significantly reducing manual workload while enhancing reproducibility and standardization.