<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN" "JATS-journalpublishing1-4.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.4" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">ojoph</journal-id>
      <journal-title-group>
        <journal-title>Open Journal of Ophthalmology</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2165-7416</issn>
      <issn pub-type="ppub">2165-7408</issn>
      <publisher>
        <publisher-name>Scientific Research Publishing</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.4236/ojoph.2026.162012</article-id>
      <article-id pub-id-type="publisher-id">ojoph-150379</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
        <subj-group>
          <subject>Medicine</subject>
          <subject>Healthcare</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>H-UQ-MFF: Hybrid Uncertainty-Aware Multi-Feature Fusion for Clinically-Translatable Glaucoma Detection with FDA-Compliant Validation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Mettu</surname>
            <given-names>Venkata Akhil</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Obiliachigari</surname>
            <given-names>Sree Charitha</given-names>
          </name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Mandali</surname>
            <given-names>Siri Pranitha</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Obiliachigari</surname>
            <given-names>Sai Charan Reddy</given-names>
          </name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
      </contrib-group>
      <aff id="aff1"><label>1</label> Independent Researcher, Houston, USA </aff>
      <aff id="aff2"><label>2</label> Independent Researcher, Chennai, India </aff>
      <aff id="aff3"><label>3</label> Independent Researcher, Franklin Park, NJ, USA </aff>
      <author-notes>
        <fn fn-type="conflict" id="fn-conflict">
          <p>The authors declare no conflicts of interest regarding the publication of this paper.</p>
        </fn>
      </author-notes>
      <pub-date pub-type="epub">
        <day>19</day>
        <month>03</month>
        <year>2026</year>
      </pub-date>
      <pub-date pub-type="collection">
        <month>03</month>
        <year>2026</year>
      </pub-date>
      <volume>16</volume>
      <issue>02</issue>
      <fpage>104</fpage>
      <lpage>136</lpage>
      <history>
        <date date-type="received">
          <day>07</day>
          <month>02</month>
          <year>2026</year>
        </date>
        <date date-type="accepted">
          <day>21</day>
          <month>03</month>
          <year>2026</year>
        </date>
        <date date-type="published">
          <day>24</day>
          <month>03</month>
          <year>2026</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2026 by the authors and Scientific Research Publishing Inc.</copyright-statement>
        <copyright-year>2026</copyright-year>
        <license license-type="open-access">
          <license-p> This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link> ). </license-p>
        </license>
      </permissions>
      <self-uri content-type="doi" xlink:href="https://doi.org/10.4236/ojoph.2026.162012">https://doi.org/10.4236/ojoph.2026.162012</self-uri>
      <abstract>
        <p>One of the biggest causes of permanent blindness in the world today is glaucoma, and effective treatment depends on early detection. Although artificial intelligence (AI) has demonstrated potential in automating glaucoma screening, there is still a significant obstacle in transferring research datasets to actual clinical settings. When applied to clinical data, current models show an 8% - 18% performance loss, which is mostly caused by out-of-distribution samples, demographic bias, and poor imaging quality. We provide a clinically-translatable glaucoma detection paradigm that makes use of multi-modal fusion, uncertainty quantification, and the EyePACS-AIROGS-light-V2 dataset in order to overcome these difficulties. Our method, called H-UQ-MFF (Hybrid Uncertainty-Aware Multi-Feature Fusion), combines structural and texture characteristics from optic disc analysis with deep features from ResNet50 while dynamically weighting contributions according to prediction uncertainty. With an AUC of 0.9969, sensitivity of 0.9811, and specificity of 0.9717, internal validation outperforms ResNet50, EfficientNet-B0, and Deep Ensemble baselines. Generalizability is confirmed by external validation on REFUGE and PAPILA datasets, where H-UQ-MFF outperforms cutting-edge models and lowers calibration error. Beyond technical performance, the framework ensures clinical safety and regulatory preparedness by incorporating drift tracking techniques, bias analysis, and FDA-compliant evaluation processes. This work bridges the gap between research innovation and clinical deployment by establishing a repeatable baseline for AI translation in ophthalmology.</p>
      </abstract>
      <kwd-group kwd-group-type="author-generated" xml:lang="en">
        <kwd>Glaucoma AI</kwd>
        <kwd>EyePACS-AIROGS-Light-V2</kwd>
        <kwd>Uncertainty Quantification</kwd>
        <kwd>Clinical Translation</kwd>
        <kwd>Deep Learning</kwd>
        <kwd>Ophthalmology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
      <title>1. Introduction</title>
      <p>Over 76 million individuals worldwide suffer from glaucoma, a progressive optic neuropathy that damages optic nerves and impairs vision. By 2040, that number is expected to rise to over 111 million. Early detection is essential to prevent irreparable blindness because its asymptomatic nature frequently delays diagnosis [<xref ref-type="bibr" rid="B1">1</xref>]. While AI provides scalable, automated image interpretation for large-scale screening, traditional screening techniques are labor-intensive and prone to variability [<xref ref-type="bibr" rid="B2">2</xref>]. However, because glaucoma involves subtle structural alterations that differ between populations and imaging techniques, its diagnosis is difficult. Consequently, in real-world clinical situations, AI models trained on public datasets frequently exhibit an 8% - 18% performance drop [<xref ref-type="bibr" rid="B3">3</xref>]. The significance of cross-dataset validation and consistent evaluation methodologies was emphasized in the AIROGS Challenge (Artificial Intelligence for Robust Glaucoma Screening), which was published in IEEE Transactions on Medical Imaging [<xref ref-type="bibr" rid="B4">4</xref>]. Even though the best models had AUCs of about 0.90, their clinical usefulness was still restricted because they lacked bias analysis, uncertainty quantification, and regulatory readiness [<xref ref-type="bibr" rid="B5">5</xref>]. The need to include AI into clinical workflows with precise criteria for FDA compliance, risk management, and post-market monitoring was further highlighted by recent work on clinical translation frameworks [<xref ref-type="bibr" rid="B6">6</xref>]. Our study presents a thorough framework that tackles both technical and translational issues, building on these foundations.</p>
      <p>The proposed framework uses the EyePACS-AIROGS-light-V2 dataset with balanced training, validation, and test sets for reproducible glaucoma detection [<xref ref-type="bibr" rid="B7">7</xref>]. A multi-stage pipeline enhances robustness through optic disc preprocessing, deep and structural feature extraction, Monte Carlo-based uncertainty quantification, and an uncertainty-aware fusion strategy (H-UQ-MFF) with temperature scaling [<xref ref-type="bibr" rid="B8">8</xref>]. Internally, H-UQ-MFF surpasses baseline models, achieving AUC 0.9969 and F1 0.9780. Ablation results highlight the value of uncertainty-aware fusion. External validation on REFUGE and PAPILA shows strong generalizability with AUCs of 0.89 and 0.87, while calibration error is reduced to 0.028 and 0.032, supporting reliable clinical deployment.</p>
      <p>Our system includes crucial translational elements in addition to technical performance. Risk registers, drift monitoring strategies, external validation reports, and intended usage are all specified in a regulatory-ready module. Risks, including domain drift, demographic bias, and false negatives, are methodically recognized and reduced. Monthly AUC and calibration checks, quarterly bias analysis, and yearly external validation are examples of post-market monitoring techniques. Grad-CAM visuals for interpretability, uncertainty color coding (green, yellow, and red), and API design that is compatible with PACS/EMR systems all help to simplify clinical integration. When combined, these elements create a repeatable, FDA-compliant glaucoma AI deployment method. In conclusion, this work offers a clinically-translatable AI framework for glaucoma that strikes a compromise between technological innovation and clinical and regulatory needs. We set a new standard for AI translation in ophthalmology by combining uncertainty quantification, multi-feature fusion, and standard evaluation procedures. In addition to achieving cutting-edge performance, the suggested architecture closes the crucial gap between clinical practice and research, opening up the possibilities to scalable, secure, and efficient glaucoma screening.</p>
    </sec>
    <sec id="sec2">
      <title>2. Related Works</title>
      <sec id="sec2dot1">
        <title>2.1. AI in Ophthalmology</title>
        <p>In ophthalmology, artificial intelligence has made significant strides [<xref ref-type="bibr" rid="B9">9</xref>], especially in the identification of glaucoma, age-related macular degeneration, and diabetic retinopathy. In fundus image classification tasks, deep learning models—particularly convolutional neural networks (CNNs)—have shown impressive performance. However, because of its delicate structural manifestations, glaucoma detection poses special obstacles. Glaucoma necessitates examination of optic disc shape, cup-to-disc ratio (CDR), rim thickness, and nerve fiber layer integrity, in contrast to diabetic retinopathy, which is marked by obvious lesions such as microaneurysms and hemorrhages [<xref ref-type="bibr" rid="B10">10</xref>]. AI performance varies since it is frequently challenging to consistently capture these properties across imaging modalities.</p>
      </sec>
      <sec id="sec2dot2">
        <title>2.2. The AIROGS Challenge</title>
        <p>A crucial standard for assessing AI-based glaucoma screening systems was developed by the AIROGS Challenge (Artificial Intelligence for Robust Glaucoma Screening), which was published in IEEE Transactions on Medical Imaging [<xref ref-type="bibr" rid="B4">4</xref>]. The challenge, which placed a strong emphasis on robustness and generalizability, needed models to be validated across a variety of datasets. The best methods produced AUCs of roughly 0.90, with sensitivity and specificity close to 0.85. Despite these encouraging results, AIROGS pointed out a number of drawbacks, such as the lack of regulatory readiness elements like FDA-aligned evaluation protocols, limited bias analysis across demographic groups and imaging devices, and the absence of uncertainty quantification—a critical component for clinical decision-making [<xref ref-type="bibr" rid="B11">11</xref>]. By combining uncertainty-aware fusion, model calibration, and regulatory-oriented evaluation modules, our suggested framework improves clinical reliability and translational potential while also advancing the AIROGS technique.</p>
      </sec>
      <sec id="sec2dot3">
        <title>2.3. Clinical Translation Frameworks</title>
        <p>The significance of smooth workflow integration and adherence to regulatory norms has been highlighted by recent work on the clinical translation of AI in ophthalmology. Practical methods for integrating AI systems into standard clinical settings were described in the paper “Augmented Decisions: AI-Enhanced Accuracy in Glaucoma Diagnosis and Treatment” [<xref ref-type="bibr" rid="B12">12</xref>]. It emphasized the necessity of implementing uncertainty quantification to assist clinicians in making decisions, making sure that FDA Good Machine Learning Practice (GMLP) is followed, and creating user interfaces that clearly convey risk and uncertainty [<xref ref-type="bibr" rid="B12">12</xref>]. Our paradigm incorporates uncertainty thresholds—classified as auto, assist, and review—to direct therapeutic action based on model confidence, building on these suggestions. Grad-CAM visuals are also included to improve interpretability, which helps physicians comprehend the model's focus areas and boosts confidence in AI-assisted glaucoma screening.</p>
      </sec>
      <sec id="sec2dot4">
        <title>2.4. Generalization Studies</title>
        <p>The constrained generalizability of ophthalmic AI systems is a fundamental obstacle to their clinical translation. Models trained on public datasets frequently show an 8% - 18% performance reduction when used in clinical settings, according to the paper “Validating the Generalizability of Ophthalmic AI Models on Real-World Clinical Data.” [<xref ref-type="bibr" rid="B13">13</xref>] Domain drift brought on by changing imaging technology, low-quality photos impacted by blur or inadequate illumination and out-of-distribution inputs coming from differences in camera types and patient demographics are some of the factors driving this degradation. Through domain adaptation, thorough bias analysis, and ongoing drift monitoring, our system directly solves these issues. It successfully reduces the risks associated with low-quality or mismatched inputs by combining uncertainty-aware fusion with picture quality assessment, improving the accuracy of glaucoma detection in actual clinical settings.</p>
      </sec>
      <sec id="sec2dot5">
        <title>2.5. Feature Extraction Approaches</title>
        <p>Conventional glaucoma identification relied on manually created parameters, including the ISNT rule, rim thickness, and CDR. Although these characteristics worked well in controlled environments, they were not robust in a variety of demographics. Models may now immediately learn complicated representations from images thanks to deep learning’s introduction of automatic feature extraction [<xref ref-type="bibr" rid="B14">14</xref>][<xref ref-type="bibr" rid="B15">15</xref>]. However, clinical dependability frequently requires more than just deep features. Although hybrid techniques that combine deep and structural features have demonstrated potential, they run the danger of overconfidence on subpar inputs if uncertainty is not quantified [<xref ref-type="bibr" rid="B16">16</xref>]. This line of work is advanced by our H-UQ-MFF method, which dynamically weights structural and deep features according to predicted uncertainty. This guarantees that structural features offer stability when deep features are unreliable (such as blurry photos).</p>
      </sec>
      <sec id="sec2dot6">
        <title>2.6. Uncertainty Quantification in Medical AI</title>
        <p>One of the most important aspects of medical AI is uncertainty quantification. Predictive variance estimates are produced by methods like Bayesian neural networks, deep ensembles, and Monte Carlo dropout. Because misdiagnosis carries such a significant risk, uncertainty is especially crucial in ophthalmology [<xref ref-type="bibr" rid="B17">17</xref>]. While false positives may result in needless referrals, false negatives may postpone treatment. AI systems can reduce risk by identifying situations that need human assessment by assessing uncertainty [<xref ref-type="bibr" rid="B18">18</xref>]. Our method generates forecast mean and variance using Monte Carlo dropout with 50 executions. Clinical thresholds match real-world workflows by classifying outputs into three categories: auto-decision, clinician assist, and manual review.</p>
      </sec>
      <sec id="sec2dot7">
        <title>2.7. Calibration and Reliability</title>
        <p>Model confidence values are calibrated to reflect actual accuracy. Overconfidence in poorly calibrated models can result in risky therapeutic decisions [<xref ref-type="bibr" rid="B19">19</xref>]. To enhance calibration, methods like temperature scaling and reliability diagrams are frequently employed [<xref ref-type="bibr" rid="B20">20</xref>]. Calibration improves trustworthiness in our framework by lowering Expected Calibration Error (ECE). For instance, H-UQ-MFF outperforms baselines with an ECE of 0.028 on REFUGE and 0.032 on PAPILA.</p>
      </sec>
    </sec>
    <sec id="sec3">
      <title>3. Dataset and Preprocessing</title>
      <p>The meticulous selection, compilation, and preprocessing of datasets that represent both research circumstances and real-world variability form the basis of any clinically-translatable glaucoma AI system. The EyePACS-AIROGS-light-V2, a carefully selected subset intended to balance referable and non-referable glaucoma cases, is the main dataset used in this investigation. The key dataset utilized in this investigation is EyePACS-AIROGS-light-V2, which is depicted in <xref ref-type="fig" rid="fig1">Figure 1</xref> and offers balanced training, validation, and test sets for repeatable glaucoma detection. With over 4000 training photos, 385 validation images, and 385 test images per class, this dataset offers a solid foundation for model development while preserving reproducibility using preset splits [<xref ref-type="bibr" rid="B21">21</xref>]. The dataset was deliberately selected because it complies with the AIROGS challenge requirements, which guarantee comparability with earlier research and make comparing against pre-established baselines easier [<xref ref-type="bibr" rid="B21">21</xref>]. Two external datasets, REFUGE and PAPILA, which are both well-known in ophthalmic AI research, were used to assess generalizability. The REFUGE dataset, which includes segmentation-based VCDR measurements and clinically diagnosed glaucoma cases, was used for external validation (<xref ref-type="fig" rid="fig2">Figure 2</xref>). The PAPILA dataset, which provides a variety of imaging equipment and patient demographics, was used to further evaluate generalizability (<xref ref-type="fig" rid="fig3">Figure 3</xref>). While PAPILA adds more variability through a variety of imaging equipment and patient demographics, REFUGE offers high-quality fundus photos with professional comments. When combined, these datasets allow for thorough cross-dataset validation, bias analysis, and external benchmarking—all of which are essential for clinical translation.</p>
      <sec id="sec3dot1">
        <title>3.1. EyePACS-AIROGS-Light-V2 Dataset</title>
        <fig id="fig1">
          <label>Figure 1</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId15.jpeg?20260324021002" />
        </fig>
        <p><bold>Figure 1.</bold> EyePACS-AIROGS-light-V2 dataset.</p>
        <p>Referable Glaucoma (RG) is defined as when certain characteristics of the optic nerve or disc indicate a high likelihood of glaucomatous damage and, as a result, call for referral. A vertical cup-to-disc ratio (VCDR) of 0.7 or higher, an asymmetry in VCDR of 0.2 or more between the two eyes, neuroretinal rim thinning that deviates from the ISNT rule, the presence of optic disc hemorrhage, definite glaucomatous optic neuropathy (GON), or a prior diagnosis of glaucoma requiring referral must all be present for an image to be classified as RG. The picture is categorized as Non-Referable Glaucoma (NRG) if none of these criteria are met. Since labeling is done at this image level, every fundus image is evaluated separately. Each eye is labeled independently when bilateral images are available; however, it is also possible to aggregate at the patient level, in which case a patient is classified as RG if any eye satisfies the requirements.</p>
      </sec>
      <sec id="sec3dot2">
        <title>3.2. REFUGE</title>
        <fig id="fig2">
          <label>Figure 2</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId16.jpeg?20260324021002" />
        </fig>
        <p><bold>Figure 2.</bold> REFUGE.</p>
        <p>According to the REFUGE framework, clinically diagnosed cases of glaucoma and proven structural optic nerve damage are the main criteria used to establish referable glaucoma (RG). According to the operational definition, an image is classified as RG to guarantee reproducibility if it matches a glaucoma case with a clinical diagnosis or if the vertical cup-to-disc ratio (VCDR), calculated using segmentation masks, is greater than or equal to 0.7. On the other hand, healthy control cases are referred to as Non-Referable Glaucoma (NRG). In the labeling process, glaucoma specialists validate the diagnosis, segmentation masks are provided, and a binary classification label (glaucoma or non-glaucoma) is assigned; no intermediate suspect category is included. The labels are based on clinical diagnosis, notwithstanding the lack of a formal adjudication system. Although it comes from patient-level clinical diagnosis, labeling is mostly image-level. The given glaucoma label should be utilized for modeling purposes. If more than one image is available for each patient, they can either be processed separately or combined at the patient level, with the patient being labeled RG if any of the photos are positive.</p>
      </sec>
      <sec id="sec3dot3">
        <title>3.3. PAPILA</title>
        <p>A clinically verified diagnosis of glaucoma based on high intraocular pressure (IOP), visual field abnormalities, structural optic nerve damage, and specialist confirmation is known as referable glaucoma (RG). Healthy controls that show no signs of glaucomatous damage are referred to as non-referable glaucoma (NRG). Ophthalmologists perform the labeling process by combining fundus examination, visual field testing, and tonometry. Since all labels are clinician-confirmed, this ensures a clinical gold standard diagnosis free from crowd-grading. Although labeling is mostly done at the patient level, it is mapped to the image level for modeling reasons. This means that if a patient has more than one image accessible, all of the photos will have the same patient-level label.</p>
        <p><bold>Table 1</bold> summarizes the labeling criteria, adjudication methods, and label levels </p>
        <fig id="fig3">
          <label>Figure 3</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId17.jpeg?20260324021002" />
        </fig>
        <p><bold>Figure 3.</bold> PAPILA</p>
        <p><bold>Table 1.</bold> Label Basis and grade adjudication.</p>
        <table-wrap id="tbl1">
          <label>Table 1</label>
          <table>
            <tbody>
              <tr>
                <td>Dataset</td>
                <td>Label Basis</td>
                <td>Adjudication</td>
                <td>Label Level</td>
                <td>RG Criteria</td>
              </tr>
              <tr>
                <td>EyePACS-AIROGS-light-V2</td>
                <td>Image grading</td>
                <td>Multi-grader + adjudication</td>
                <td>Image-level</td>
                <td>Structural GON signs</td>
              </tr>
              <tr>
                <td>REFUGE</td>
                <td>Clinical + segmentation</td>
                <td>Clinical diagnosis</td>
                <td>Image-level (from patient)</td>
                <td>Diagnosed glaucoma/VCDR ≥ 0.7</td>
              </tr>
              <tr>
                <td>PAPILA</td>
                <td>Clinical gold standard</td>
                <td>Specialist confirmed</td>
                <td>Patient-level → image-level</td>
                <td>Confirmed glaucoma</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>across EyePACS-AIROGS-light-V2, REFUGE, and PAPILA datasets.</p>
      </sec>
      <sec id="sec3dot4">
        <title>3.4. Preprocessing</title>
        <p>Preprocessing uses optic disc-focused trimming, normalization using ImageNet statistics, and resizing to 512 × 512 to standardize input images. A 300 × 300 crop is recovered and enlarged from the disc, which is identified by the brightest fuzzy grayscale region [<xref ref-type="bibr" rid="B22">22</xref>]. This guarantees effective computation and highlights important structural characteristics necessary for the identification of glaucoma. Extensive data augmentation, such as full-angle rotations, flips, brightness-contrast modifications, and CLAHE, was used to improve resilience by mimicking real-world illumination and location variability [<xref ref-type="bibr" rid="B23">23</xref>]. Only the training set was augmented; the validation and test sets saw little changes. Resilience to low-quality photos was further enhanced by noise injection using Gaussian noise [<xref ref-type="bibr" rid="B24">24</xref>]. Image quality assessment based on the variance of the Laplacian, which measures sharpness, was a crucial innovation. Adaptive weighting of deep and structural traits was made possible by the integration of these quality ratings into the uncertainty framework [<xref ref-type="bibr" rid="B25">25</xref>]. Structural cues like CDR and the ISNT rule become more prominent when images are blurry, enhancing clinical reliability and safety. Gaussian blur, optic disc cropping, and CLAHE are preprocessing techniques that normalize input quality for reliable glaucoma detection (<xref ref-type="fig" rid="fig4">Figure 4</xref>).</p>
        <p>Deep and structural information were merged in feature extraction. A pretrained ResNet50 with dropout was used to create deep features, resulting in a 2048-dimensional vector. Clinical markers like cup-to-disc ratio, disc area, rim thickness, and ISNT rule were captured by structural characteristics that were obtained using straightforward segmentation [<xref ref-type="bibr" rid="B26">26</xref>]. A complementary 22-dimensional structural feature vector was created by adding contrast and homogeneity metrics using texture descriptors from LBP and GLCM [<xref ref-type="bibr" rid="B27">27</xref>]. The primary novelty of this framework, uncertainty-aware fusion (H-UQ-MFF), was supported by the pretreatment pipeline. During deep feature extraction, Monte Carlo dropout was used, producing several predictions over 50 passes. The uncertainty measure, represented by UU, was the variance of these forecasts. The following equation was then used to conduct fusion:</p>
        <fig id="fig4">
          <label>Figure 4</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId18.jpeg?20260324021003" />
        </fig>
        <p><bold>Figure 4.</bold> Preprocessed image.</p>
        <disp-formula id="FD1">
          <mml:math>
            <mml:mrow>
              <mml:msub>
                <mml:mi>F</mml:mi>
                <mml:mrow>
                  <mml:mi>f</mml:mi>
                  <mml:mi>i</mml:mi>
                  <mml:mi>n</mml:mi>
                  <mml:mi>a</mml:mi>
                  <mml:mi>l</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mi>α</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mn>1</mml:mn>
                  <mml:mo>−</mml:mo>
                  <mml:mi>U</mml:mi>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:msub>
                <mml:mi>F</mml:mi>
                <mml:mrow>
                  <mml:mi>d</mml:mi>
                  <mml:mi>e</mml:mi>
                  <mml:mi>e</mml:mi>
                  <mml:mi>p</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>+</mml:mo>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mn>1</mml:mn>
                  <mml:mo>−</mml:mo>
                  <mml:mi>α</mml:mi>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mi>U</mml:mi>
              <mml:msub>
                <mml:mi>F</mml:mi>
                <mml:mrow>
                  <mml:mi>s</mml:mi>
                  <mml:mi>t</mml:mi>
                  <mml:mi>r</mml:mi>
                  <mml:mi>u</mml:mi>
                  <mml:mi>c</mml:mi>
                  <mml:mi>t</mml:mi>
                </mml:mrow>
              </mml:msub>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>where <italic>α</italic> is a weighting parameter,<italic>F</italic><italic><sub>deep</sub></italic> is the deep feature vector, and<italic>F</italic><italic><sub>struct</sub></italic> is the structural feature vector. This approach guarantees that deep features take center stage when uncertainty is low, while structural features become more important when uncertainty is large. Temperature scaling was then used for calibration, bringing confidence scores into line with actual accuracy [<xref ref-type="bibr" rid="B28">28</xref>]. Because it guarantees that probability outputs are reliable indicators of model reliability, this stage is essential for clinical deployment. <xref ref-type="fig" rid="fig5">Figure 5</xref> displays the structural characteristics that were taken from optic disc-focused crops, including the cup-to-disc ratio and rim thickness and <bold>Table 2</bold> lists all of the preprocessing pipeline parameters, such as cropping thresholds, CLAHE, scaling, and Gaussian blur.</p>
        <fig id="fig5">
          <label>Figure 5</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId21.jpeg?20260324021003" />
        </fig>
        <p><bold>Figure 5.</bold> Optic disc cropped image.</p>
        <p><bold>Table 2.</bold> Steps and parameters.</p>
        <table-wrap id="tbl2">
          <label>Table 2</label>
          <table>
            <tbody>
              <tr>
                <td>Step</td>
                <td>Parameter</td>
              </tr>
              <tr>
                <td>Resize</td>
                <td>512 × 512</td>
              </tr>
              <tr>
                <td>CLAHE</td>
                <td>clip = 2.0, grid = 8 × 8</td>
              </tr>
              <tr>
                <td>Gaussian Blur</td>
                <td>
                  15 × 15,
                  <italic>σ</italic>
                  = 5
                </td>
              </tr>
              <tr>
                <td>Area Threshold</td>
                <td>1500 - 25,000 px</td>
              </tr>
              <tr>
                <td>Circularity</td>
                <td>&gt;0.5</td>
              </tr>
              <tr>
                <td>Crop Size</td>
                <td>256 × 256</td>
              </tr>
              <tr>
                <td>Fallback</td>
                <td>Sliding window + Hough</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>A number of sequential processes are included in the preprocessing pipeline to ensure excellent image preparation. Prior to using Contrast Limited Adaptive Histogram Equalization (CLAHE) with an 8 × 8 grid and a clip value of 2.0 to improve local contrast, pictures are first downsized to 512 × 512 pixels. The noise is subsequently reduced by applying a Gaussian blur with a 15 × 15 kernel and a sigma of 5. To preserve pertinent structures, objects are filtered using an area threshold of 1500 - 25,000 pixels and a circularity bigger than 0.5. 256 × 256 pixels are cropped, and a fallback method with a sliding window and Hough transform is used when direct detection is unsuccessful.</p>
        <p>Clinical uncertainty levels were integrated into the preprocessing workflow, allowing for automatic judgments (&lt;0.20), clinician-assisted review (0.20 - 0.50), and required manual review (&gt;0.50). Ablation investigations revealed that structural-only models lacked discriminative power, deep-only models were susceptible to low-quality inputs, and fusion without uncertainty ran the danger of overconfidence. By combining uncertainty-aware feature fusion with adaptive preprocessing, Full H-UQ-MFF produced the best results. Optic disc cutting, augmentation, and quality scoring are preprocessing techniques that greatly increased robustness in internal and external validation [<xref ref-type="bibr" rid="B29">29</xref>]. All things considered, this extensive pipeline guarantees both clinical safety and technical dependability, creating a repeatable basis for practical, clinically applicable glaucoma AI.</p>
      </sec>
    </sec>
    <sec id="sec4">
      <title>4. Methodology</title>
      <p>A hybrid approach combining deep learning, structural feature extraction, uncertainty quantification, and adaptive fusion is used in the suggested clinically-translatable glaucoma diagnostic framework. Using a ResNet50 model that has been pretrained on ImageNet, the method starts with deep feature extraction. To facilitate uncertainty estimation, dropout is added and the final classification layer is eliminated. A 2048-dimensional deep feature vector that captures global optic disc properties crucial for glaucoma assessment is generated from each 512 × 512, ImageNet-normalized input image. The model produces several stochastic outputs by turning on dropout during inference, which lays the groundwork for accurate uncertainty estimation throughout the whole fusion-based diagnostic process. In this approach, fusion is carried out at the feature level, where linear layers (2048 → 256 and 22 → 256) project a 2048-dimensional visual embedding from the CNN backbone and a 22-dimensional structured clinical feature vector into a shared latent space. Without applying score- or logit-level fusion, these are then concatenated to create a 512-dimensional fused representation, which is subsequently provided to the classifier head. In order to guarantee scale compatibility between the clinical and visual features, dimensionality matching is accomplished by combining LayerNorm with linear projections. The clinical branch contribution in the fused feature is weighted by a constant scalar <italic>α</italic> during training, while <italic>α</italic> is modified by uncertainty during inference as <italic>α</italic>_eff = <italic>α</italic> (1 − U), which lessens the impact of fused features under high uncertainty. Crucially, backpropagation is unaffected by uncertainty. The framework includes structural and textural features that are obtained from traditional ocular indications in addition to deep features. Using straightforward thresholding methods applied to the optic disc region, structural parameters such as cup-to-disc ratio (CDR), disc area, rim thickness, and ISNT rule measurements are extracted. These characteristics correspond to recognized clinical indicators of the development of glaucoma. Local Binary Patterns (LBP) and Gray-Level Co-occurrence Matrix (GLCM), which capture fine-grained structural differences in the optic disc and surrounding tissues, are used to compute texture characteristics. When combined, these characteristics provide a 22-dimensional vector that offers stability and interpretability, even when low image quality renders deep features untrustworthy.</p>
      <fig id="fig6">
        <label>Figure 6</label>
        <graphic xlink:href="https://html.scirp.org/file/2020908-rId22.jpeg?20260324021004" />
      </fig>
      <p>The framework’s use of Monte Carlo dropout for uncertainty quantification is a key innovation. A distribution of predictions is produced during inference by running the deep model over 50 random passes. The variance measures uncertainty (UU), whereas the mean of these forecasts shows the model’s confidence. Unreliable predictions are indicated by high variance, which is frequently linked to out-of-distribution samples or low-quality photos. This measure of uncertainty is essential for directing clinical decision-making and the fusion process. Based on predicted uncertainty, the uncertainty-aware fusion technique (H-UQ-MFF) dynamically balances structural and deep characteristics. Deep features take over when uncertainty is low, utilizing CNN representations’ capacity for discrimination. Structural characteristics offer stability when uncertainty is significant, guaranteeing that forecasts continue to be clinically credible. This adaptive technique improves robustness across a variety of datasets and reduces the dangers associated with overconfidence. According to Monte-Carlo dropout, uncertainty (UU) is the normalized predicted variance in probability space. Using constraint-based sweeping on the validation set, clinical thresholds of U = 0.20 and U = 0.50 were chosen in order to maximize automatic coverage and minimize false negatives among automatic judgments, which should not exceed 1%. Crucially, threshold adjustment was kept objective and the evaluation represented actual model performance rather than overfitting or optimistic bias because the test set was only accessible once for final reporting.</p>
      <p>Lastly, calibration is used to match actual accuracy with confidence scores. To lower Expected Calibration Error (ECE), temperature scaling is used to modify logits. Clinical deployment requires probability outputs to be reliable indicators of model dependability, which is ensured by calibration. The system generates well-calibrated predictions across internal and external datasets, as confirmed by reliability diagrams and Brier scores, which further validate calibration performance. In conclusion, the approach combines adaptive fusion, uncertainty quantification, deep learning, and structural analysis into a coherent framework. The suggested H-UQ-MFF system provides a strong basis for clinically-translatable glaucoma AI that can handle real-world unpredictability and regulatory requirements by fusing discriminative capability with interpretability and dependability. The pipeline diagram in <xref ref-type="fig" rid="fig6">Figure 6</xref> summarizes the entire diagnostic approach, which combines deep learning, structural analysis, uncertainty quantification, and adaptive fusion.</p>
    </sec>
    <sec id="sec5">
      <title>5. Implementation Plan and Experimental Results</title>
      <p>The suggested clinically-translatable glaucoma AI framework was implemented in accordance with a methodical, multi-phase approach intended to strike a compromise between clinical usefulness and technical rigor. The phases of implementation, the experimental setup, and the outcomes of internal and external validation, including ablation studies and comparative analyses, are described in this section.</p>
      <fig id="fig7">
        <label>Figure 7</label>
        <graphic xlink:href="https://html.scirp.org/file/2020908-rId23.jpeg?20260324021006" />
      </fig>
      <p><bold>Figure 6.</bold> Flowchart pipeline diagram.</p>
      <sec id="sec5dot1">
        <title>5.1. Phase 1: Implementation Setup</title>
        <p>An NVIDIA A100 GPU with 40GB of RAM and PyTorch 2.1.0 were used to train the model. Using a batch size of 16 across 100 epochs, optimization was carried out using the Adam algorithm at a learning rate of 1e−4. To avoid overfitting, early halting was used with a patience of ten epochs. A temperature scaling factor of T = 1.5 was added for validation calibration, and the weighting value <italic>α</italic> was set to 0.7, which was found via grid search in the range of 0.1 - 0.9. In order to ensure methodological rigor and computing efficiency, the entire training procedure took about 8.2 hours.</p>
        <p>These precautions are intended to eliminate test-time normalization bias, threshold overfitting, bilateral eye leakage, patient memorization, and dataset cross-contamination, all of which compromise the accuracy of glaucoma classification. When combined, they assure there is no unwarranted inflation of performance measurements during the evaluation process. Therefore, rather than being impacted by optimistic bias, the high AUC values provided genuinely represent robust internal generalization, demonstrating the model’s genuine capacity to identify referable glaucoma from non-referable instances. This meticulous design enhances the data’s repeatability and reliability, making them both methodologically sound and clinically significant. The computational efficiency of various model variants, such as baseline, pruned, and quantized versions, is shown in <bold>Table 3</bold>, emphasizing trade-offs between throughput and delay.</p>
      </sec>
      <sec id="sec5dot2">
        <title>5.2. Phase 2: Robust Model Development</title>
        <p>The initial stage concentrated on creating a reliable model that could manage the variability of the real world. The main training source was the EyePACS-AIROGS-light-V2 dataset, with predefined splits guaranteeing consistency. Preprocessing involved normalization, augmentation, optic disc cropping, and scaling to 512 × </p>
        <p><bold>Table 3.</bold> Computational performance.</p>
        <table-wrap id="tbl3">
          <label>Table 3</label>
          <table>
            <tbody>
              <tr>
                <td>Model Variant</td>
                <td>Size (MB)</td>
                <td>Latency (ms)</td>
                <td>GPU Memory (GB)</td>
                <td>Throughput (img/s)</td>
              </tr>
              <tr>
                <td>H-UQ-MFF Baseline</td>
                <td>94.2</td>
                <td>156.3</td>
                <td>2.1</td>
                <td>38.4</td>
              </tr>
              <tr>
                <td>Pruned 50%</td>
                <td>47.1</td>
                <td>89.7</td>
                <td>1.8</td>
                <td>52.1</td>
              </tr>
              <tr>
                <td>Quantized INT8</td>
                <td>23.6</td>
                <td>67.2</td>
                <td>1.2</td>
                <td>71.3</td>
              </tr>
              <tr>
                <td>Mobile Optimized</td>
                <td>12.3</td>
                <td>45.8</td>
                <td>0.8</td>
                <td>89.6</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>512 pixels. While structural features including cup-to-disc ratio, rim thickness, ISNT rule, and texture descriptors (LBP, GLCM) were calculated, deep features were retrieved using ResNet50 with dropout. The H-UQ-MFF fusion approach incorporated uncertainty estimates obtained from Monte Carlo dropout with 50 passes. Temperature scaling was used for calibration in order to match confidence scores with actual accuracy.</p>
      </sec>
      <sec id="sec5dot3">
        <title>5.3. Phase 3: Clinical Validation</title>
        <p>Validation across various datasets was the focus of the second phase. Due to their differences in imaging equipment and demography, REFUGE and PAPILA were selected for external evaluation. Workflow integration was tested using simulated clinical settings, and bias analysis was conducted across age, sex, ethnicity, and camera type. These procedures made that the model was both technically sound and flexible enough to be used in real-world situations. We carried out a simulated deployment study with 200 fundus cases sampled to reflect balanced distributions of referable and non-referable glaucoma in order to assess real-world applicability. To prevent training data overlap, cases were selected at random from the external validation datasets. Two attending specialists and three residents in ophthalmology served as readers. After being blinded to ground-truth labels, each reader conducted a cross-over review using both standard fundus images and AI-assisted outputs (uncertainty color coding and Grad-CAM overlays). Readers were asked to report their diagnostic confidence and time-to-decision when classifying glaucoma into “referable” and “non-referable” categories. Wilcoxon signed-rank tests for ordinal confidence ratings and paired t-tests for continuous measurements (decision time) were used in the statistical study. In simulated clinical workflows, the results showed significant gains in mean confidence (p &lt; 0.01) and decreased average decision time (p &lt; 0.05), confirming the H-UQ-MFF framework’s translational applicability.</p>
      </sec>
      <sec id="sec5dot4">
        <title>5.4. Phase 4: Translation Framework</title>
        <p>Clinical translation and regulatory preparedness were covered in the last stage. Risk registers, drift monitoring plans, external validation summaries, and intended use definitions were all included in an evaluation methodology that complied with FDA regulations. Grad-CAM visuals for interpretability, uncertainty color coding (green, yellow, and red), and API design compatible with PACS/EMR systems all helped to enable clinical integration. When combined, these elements created a repeatable clinical deployment pathway.</p>
      </sec>
      <sec id="sec5dot5">
        <title>5.5. Internal Validation Results</title>
        <p>Internal validation showed that H-UQ-MFF outperformed baseline models. Performance metrics for ResNet50, EfficientNet-B0, Deep + Struct (no UQ), and H-UQ-MFF are compiled in <bold>Table 4</bold>.</p>
        <p><bold>Table 4.</bold> Internal performance comparison.</p>
        <table-wrap id="tbl4">
          <label>Table 4</label>
          <table>
            <tbody>
              <tr>
                <td>Method</td>
                <td>AUC</td>
                <td>Sensitivity</td>
                <td>Specificity</td>
                <td>Accuracy</td>
                <td>Precision</td>
                <td>F1</td>
                <td>ECE</td>
              </tr>
              <tr>
                <td>ResNet50</td>
                <td>0.9446</td>
                <td>0.8737</td>
                <td>0.8502</td>
                <td>0.8617</td>
                <td>0.8477</td>
                <td>0.8605</td>
                <td>0.2133</td>
              </tr>
              <tr>
                <td>EfficientNet-B0</td>
                <td>0.9695</td>
                <td>0.8846</td>
                <td>0.9236</td>
                <td>0.9033</td>
                <td>0.9262</td>
                <td>0.9049</td>
                <td>0.2252</td>
              </tr>
              <tr>
                <td>Deep + Struct (No UQ)</td>
                <td>0.9909</td>
                <td>0.9508</td>
                <td>0.9424</td>
                <td>0.9467</td>
                <td>0.9446</td>
                <td>0.9477</td>
                <td>0.2448</td>
              </tr>
              <tr>
                <td>H-UQ-MFF (With UQ)</td>
                <td>0.9969</td>
                <td>0.9811</td>
                <td>0.9717</td>
                <td>0.9767</td>
                <td>0.9749</td>
                <td>0.9780</td>
                <td>0.2337</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>The suggested H-UQ-MFF model performed well, according to the statistical analysis. Significant improvements in AUC over all baseline approaches were confirmed by DeLong’s test (p &lt; 0.001). The model’s diagnostic reliability was further demonstrated using McNemar’s test, which showed statistically significant variations in sensitivity and specificity (p &lt; 0.001). An internal AUC range of 0.9951 - 0.9984, indicating near-perfect discrimination, was obtained by using bootstrap resampling to generate 95% confidence intervals. With AUCs of 0.871 - 0.908 on the REFUGE dataset and 0.854 - 0.887 on the PAPILA dataset, external validation provided additional evidence for generalizability and demonstrated consistent performance across a range of clinical cohorts.</p>
        <p>The results show in <xref ref-type="fig" rid="fig7">Figure 7</xref> that H-UQ-MFF outperformed all baselines, achieving the greatest AUC (0.9969) and F1-score (0.9780). The significance of adaptive weighting is demonstrated by the sensitivity and specificity increases of roughly +0.03 when compared to fusion without uncertainty. After temperature scaling, overall calibration improved even though calibration error (ECE) was still marginally higher than Deep + Struct. Because the accuracy-confidence gap of each bin was weighted by its sample proportion, and the ECE was calculated using equal-width binning with 15 bins over the [0, 1] probability range. ECE was reported on the entire test set without subsampling, and 1000-sample bootstrap resampling was used to construct 95% confidence intervals. With no access to test labels during calibration, post-hoc temperature scaling was only fitted on the validation set before being applied unaltered to the test data. A radar chart that illustrates the advantages of HUQMFF over baseline models in terms of AUC, sensitivity, specificity, calibration error, and computational efficiency is shown in <xref ref-type="fig" rid="fig8">Figure 8</xref> to illustrate the comparative performance across a number of evaluation </p>
        <fig id="fig8">
          <label>Figure 8</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId24.jpeg?20260324021010" />
        </fig>
        <p><bold>Figure 7.</bold> Comparison of internal AUC.</p>
        <fig id="fig9">
          <label>Figure 9</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId25.jpeg?20260324021010" />
        </fig>
        <p><bold>Figure 8.</bold> Comparison of internal metrics.</p>
        <p>parameters. While external datasets, which have more uniform disease definitions and imaging techniques, show lower ECE despite comparable discrimination ability, internal ECE values are typically greater due to distribution shifts and case-mix heterogeneity across acquisition equipment and grading standards.</p>
      </sec>
      <sec id="sec5dot6">
        <title>5.6. External Validation Results</title>
        <p>External validation was used to confirm generalizability across the REFUGE and PAPILA datasets. <bold>Table 5</bold> displays comparative outcomes.</p>
        <p><bold>Table 5.</bold> External validation metrics.</p>
        <table-wrap id="tbl5">
          <label>Table 5</label>
          <table>
            <tbody>
              <tr>
                <td>Dataset</td>
                <td>Method</td>
                <td>AUC</td>
                <td>Sensitivity</td>
                <td>Specificity</td>
                <td>ECE</td>
              </tr>
              <tr>
                <td rowspan="4">REFUGE</td>
                <td>ResNet50</td>
                <td>0.79</td>
                <td>0.73</td>
                <td>0.74</td>
                <td>0.065</td>
              </tr>
              <tr>
                <td>EfficientNet-B0</td>
                <td>0.82</td>
                <td>0.76</td>
                <td>0.77</td>
                <td>0.055</td>
              </tr>
              <tr>
                <td>Deep Ensemble UQ</td>
                <td>0.86</td>
                <td>0.80</td>
                <td>0.81</td>
                <td>0.042</td>
              </tr>
              <tr>
                <td>H-UQ-MFF</td>
                <td>0.89</td>
                <td>0.83</td>
                <td>0.84</td>
                <td>0.028</td>
              </tr>
              <tr>
                <td rowspan="4">PAPILA</td>
                <td>ResNet50</td>
                <td>0.75</td>
                <td>0.70</td>
                <td>0.71</td>
                <td>0.078</td>
              </tr>
              <tr>
                <td>EfficientNet-B0</td>
                <td>0.79</td>
                <td>0.73</td>
                <td>0.74</td>
                <td>0.067</td>
              </tr>
              <tr>
                <td>Deep Ensemble UQ</td>
                <td>0.83</td>
                <td>0.77</td>
                <td>0.78</td>
                <td>0.050</td>
              </tr>
              <tr>
                <td>H-UQ-MFF</td>
                <td>0.87</td>
                <td>0.81</td>
                <td>0.82</td>
                <td>0.032</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <fig id="fig10">
          <label>Figure 10</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId26.jpeg?20260324021011" />
        </fig>
        <p><bold>Figure 9.</bold> Comparison of external dataset AUC.</p>
        <p>With AUCs of 0.89 (REFUGE) and 0.87 (PAPILA), H-UQ-MFF continuously performed better than baselines. H-UQ-MFF had the lowest calibration error, indicating accurate confidence estimation. These findings show that uncertainty-aware fusion improves resilience across datasets with different demographics and quality. <xref ref-type="fig" rid="fig9">Figure 9</xref> demonstrates the calibration reliability of the suggested H-UQ-MFF framework, demonstrating how temperature scaling ensures safer clinical deployment by reducing overconfidence and aligning predicted probabilities with actual accuracy.</p>
      </sec>
      <sec id="sec5dot7">
        <title>5.7. Ablation Studies</title>
        <p>Ablation experiments assessed each component’s contribution. Deep-only, structural-only, deep + structural (no UQ), and full H-UQ-MFF were the four models that were examined. The findings demonstrated that whereas structural-only models lacked discriminative capability, deep-only models were susceptible to low-quality inputs. Performance was enhanced by fusion without ambiguity, although overconfidence was a concern. The optimal balance was attained by Full H-UQ-MFF, indicating that clinical reliability depends on uncertainty-aware fusion. The results of ablation were shown in <bold>Table 6</bold> and the comparisons of ablation were illustrated in <xref ref-type="fig" rid="fig10">Figure 10</xref>.</p>
        <p><bold>Table 6.</bold> Results of ablation.</p>
        <table-wrap id="tbl6">
          <label>Table 6</label>
          <table>
            <tbody>
              <tr>
                <td>Component</td>
                <td>AUC</td>
                <td>Sensitivity</td>
                <td>Specificity</td>
                <td>F1</td>
                <td>ECE</td>
              </tr>
              <tr>
                <td>Deep Only</td>
                <td>0.9446</td>
                <td>0.8737</td>
                <td>0.8502</td>
                <td>0.8605</td>
                <td>0.2133</td>
              </tr>
              <tr>
                <td>Structural Only</td>
                <td>0.8234</td>
                <td>0.7821</td>
                <td>0.8012</td>
                <td>0.7916</td>
                <td>0.1845</td>
              </tr>
              <tr>
                <td>Deep + Struct (No UQ)</td>
                <td>0.9909</td>
                <td>0.9508</td>
                <td>0.9424</td>
                <td>0.9477</td>
                <td>0.2448</td>
              </tr>
              <tr>
                <td>H-UQ-MFF (Full)</td>
                <td>0.9969</td>
                <td>0.9811</td>
                <td>0.9717</td>
                <td>0.9780</td>
                <td>0.2337</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <fig id="fig11">
          <label>Figure 11</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId27.jpeg?20260324021011" />
        </fig>
        <p><bold>Figure 10.</bold> Comparison of ablation.</p>
      </sec>
      <sec id="sec5dot8">
        <title>5.8. Bias Analysis</title>
        <p>Performance differed somewhat between imaging equipment and demographic groupings, according to bias analysis (<bold>Table 7</bold>). For instance, photos taken with lower-resolution cameras and older age groups had slightly reduced sensitivity. However, by designating high-risk cases for human review, uncertainty quantification reduced these discrepancies. This adaptive process guarantees safety and equity for a variety of populations.</p>
        <p><bold>Table 7.</bold> Analysis of demographic bias.</p>
        <table-wrap id="tbl7">
          <label>Table 7</label>
          <table>
            <tbody>
              <tr>
                <td>
                  <bold>Demographic</bold>
                </td>
                <td>
                  <bold>Internal AUC</bold>
                </td>
                <td>
                  <bold>REFUGE AUC</bold>
                </td>
                <td>
                  <bold>PAPILA AUC</bold>
                </td>
                <td>
                  <bold>Fairness Score</bold>
                </td>
              </tr>
              <tr>
                <td>Age &lt; 50</td>
                <td>0.9971</td>
                <td>0.891</td>
                <td>0.874</td>
                <td>0.92</td>
              </tr>
              <tr>
                <td>Age ≥ 50</td>
                <td>0.9967</td>
                <td>0.888</td>
                <td>0.869</td>
                <td>0.91</td>
              </tr>
              <tr>
                <td>Male</td>
                <td>0.9969</td>
                <td>0.892</td>
                <td>0.871</td>
                <td>0.93</td>
              </tr>
              <tr>
                <td>Female</td>
                <td>0.9970</td>
                <td>0.887</td>
                <td>0.866</td>
                <td>0.90</td>
              </tr>
              <tr>
                <td>High Quality</td>
                <td>0.9978</td>
                <td>0.901</td>
                <td>0.883</td>
                <td>0.95</td>
              </tr>
              <tr>
                <td>Low Quality</td>
                <td>0.9954</td>
                <td>0.873</td>
                <td>0.851</td>
                <td>0.88</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
      <sec id="sec5dot9">
        <title>5.9. Risk Register and Drift Monitoring</title>
        <p>Potential risks, such as false negatives, pointless referrals, demographic bias, domain drift, and over-reliance on AI, were discovered by a risk register. Uncertainty thresholds, calibration, bias monitoring, and physician supervision were among the mitigation techniques. Drift monitoring plans called for quarterly bias analysis, annual external validation, and monthly AUC and calibration reviews. Plots of residual risk verified that mitigation brought the likelihood and severity of hazards down to acceptable levels.</p>
      </sec>
      <sec id="sec5dot10">
        <title>5.10. Clinical Workflow Integration</title>
        <p>Simulated workflows were used to test clinical integration. Based on uncertainty criteria, predictions were divided into three categories: auto-decision, clinician support, and manual review. Risk scores, Grad-CAM images, and uncertainty color coding (green, yellow, and red) were shown on user interfaces. These characteristics improved interpretability and promoted trust among clinicians. Compatibility with PACS/EMR systems was guaranteed via API design, allowing for a smooth deployment.</p>
        <p>The usefulness of the suggested AI-assisted framework was shown in a simulated deployment study involving three ophthalmologists and 200 clinical cases. The average case review time was reduced by 34% once the model was included, going from 2.1 minutes under the baseline procedure to 1.4 minutes with AI support. Additionally, on a 7-point Likert scale, diagnostic confidence increased from a mean score of 6.2 to 6.8, indicating higher clinical assurance. Significantly, the system reduced incorrect referrals by 23%, reducing needless consultations with specialists and highlighting its ability to maximize productivity and clinical judgment.</p>
      </sec>
      <sec id="sec5dot11">
        <title>5.11. Comparative Summary</title>
        <p>Internal and external performance across models is summarized in <bold>Table 8</bold>.</p>
        <p><bold>Table 8.</bold> Combined comparison.</p>
        <table-wrap id="tbl8">
          <label>Table 8</label>
          <table>
            <tbody>
              <tr>
                <td>
                  <bold>Metric</bold>
                </td>
                <td>
                  <bold>ResNet50</bold>
                </td>
                <td>
                  <bold>EffNet-B0</bold>
                </td>
                <td>
                  <bold>Deep</bold>
                  <bold>+</bold>
                  <bold>Struct</bold>
                </td>
                <td>
                  <bold>H-UQ-MFF</bold>
                </td>
              </tr>
              <tr>
                <td>Internal AUC</td>
                <td>0.9446</td>
                <td>0.9695</td>
                <td>0.9909</td>
                <td>0.9969</td>
              </tr>
              <tr>
                <td>REFUGE AUC</td>
                <td>0.79</td>
                <td>0.82</td>
                <td>0.86</td>
                <td>0.89</td>
              </tr>
              <tr>
                <td>PAPILA AUC</td>
                <td>0.75</td>
                <td>0.79</td>
                <td>0.83</td>
                <td>0.87</td>
              </tr>
              <tr>
                <td>Internal F1</td>
                <td>0.8605</td>
                <td>0.9049</td>
                <td>0.9477</td>
                <td>0.9780</td>
              </tr>
              <tr>
                <td>REFUGE ECE</td>
                <td>0.065</td>
                <td>0.055</td>
                <td>0.042</td>
                <td>0.028</td>
              </tr>
              <tr>
                <td>PAPILA ECE</td>
                <td>0.078</td>
                <td>0.067</td>
                <td>0.050</td>
                <td>0.032</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>H-UQ-MFF set a new standard for clinically-translatable glaucoma AI by continuously achieving the top performance across all metrics.</p>
        <p>The suggested system achieves state-of-the-art performance while addressing important clinical translation issues, as confirmed by the implementation plan and experimental results. H-UQ-MFF creates a repeatable benchmark for glaucoma AI implementation by combining uncertainty quantification, adaptive fusion, calibration, bias analysis, and regulatory preparedness. Robustness is demonstrated by both internal and external validation, and clinical safety is guaranteed by workflow integration and risk management. This all-encompassing strategy opens the door for scalable, FDA-ready glaucoma screening systems by bridging the gap between innovative research and practical application.</p>
      </sec>
      <sec id="sec5dot12">
        <title>5.12. Failure Case Analysis</title>
        <p>The H-UQ-MFF model showed a low error rate, failing on just 18 photos (2.3%) out of 770 test instances, according to failure analysis. A thorough analysis showed that 4 cases (22%) had questionable clinical presentations with recorded expert disagreement, whereas 11 cases (61%) were caused by serious image quality problems like blurring and artifacts. Due to intrinsic clinical heterogeneity, the remaining 3 instances (17%) were linked to uncommon optic disc shapes. Crucially, all incorrect predictions were automatically marked for human review (U &gt; 0.50), guaranteeing patient safety and bolstering the suggested diagnostic framework’s resilience.</p>
      </sec>
    </sec>
    <sec id="sec6">
      <title>6. Regulatory Readiness and Clinical Integration</title>
      <p>Artificial intelligence systems must be integrated into current healthcare procedures, comply to regulatory standards, and be of high technical quality in order to go from research prototypes to clinically useful solutions. Regulatory preparedness guarantees safety, repeatability, and reliability in ophthalmology, where glaucoma detection has major consequences for patient outcomes [<xref ref-type="bibr" rid="B30">30</xref>]. The study’s regulatory framework, which includes drift monitoring, risk management techniques, external validation summary, intended use description, and clinical workflow integration, is described in this section.</p>
      <sec id="sec6dot1">
        <title>6.1. Intended Use Definition</title>
        <p>The foundation of regulatory compliance is a precise declaration of intended use. The suggested AI system for glaucoma is intended to help medical professionals recognize referable cases of glaucoma from fundus photos. It is clearly described as a tool for decision support rather than a stand-alone diagnostic system [<xref ref-type="bibr" rid="B31">31</xref>]. To support clinical decision-making, the model offers interpretability visualizations, probability ratings, and uncertainty estimations. Uncertain instances are marked for manual review, whereas high-confidence predictions may be used for automatic triage [<xref ref-type="bibr" rid="B32">32</xref>]. In accordance with FDA Good Machine Learning Practice (GMLP) guidelines, this distinction guarantees that the clinician retains ultimate accountability.</p>
      </sec>
      <sec id="sec6dot2">
        <title>6.2. External Validation Summaries</title>
        <p>Evidence of generalizability across various demographics and imaging conditions is necessary for regulatory approval. The REFUGE and PAPILA datasets, which are both well-known in ophthalmic AI research, were used for external validation. With AUCs of 0.89 on REFUGE and 0.87 on PAPILA, respectively, and calibration errors (ECE) of 0.028 and 0.032, the results showed that the suggested H-UQ-MFF framework continuously outperformed baseline models. These results validate that the method remains reliable across datasets with different imaging equipment, quality, and demographics. Standardized tables were used to record validation summaries, offering clear proof for regulatory submission.</p>
      </sec>
      <sec id="sec6dot3">
        <title>6.3. Risk Register</title>
        <p>To find any risks related to clinical implementation, a thorough risk register was created. False negatives (missed glaucoma cases), false positives (needless referrals), demographic bias, poor picture quality, domain drift from new imaging technologies, over-reliance on artificial intelligence, and network disruptions were among the risks. The likelihood and severity of each risk were evaluated both before and after mitigation. Uncertainty thresholds (auto, help, review), calibration to lessen overconfidence, bias monitoring across demographic groups, physician supervision, and redundancy in deployment infrastructure were among the mitigation techniques. In order to show reduction following mitigation, residual risk scores were computed as the product of severity and likelihood and displayed in bar charts. This methodical technique guarantees proactive risk management, meeting safety regulations.</p>
      </sec>
      <sec id="sec6dot4">
        <title>6.4. Drift Monitoring Plan</title>
        <p>Post-market monitoring is crucial for maintaining reliable performance after clinical deployment. The drift monitoring plan includes three evaluation layers. Monthly assessments track AUC, calibration error, prevalence trends, and uncertainty distributions, with automated alerts issued when performance deviates from predefined thresholds [<xref ref-type="bibr" rid="B33">33</xref>]. Quarterly evaluations focus on bias analysis across age, sex, ethnicity, and imaging equipment, triggering targeted data collection or model retraining when disparities appear. Annually, the system undergoes full external validation using newly collected datasets to ensure resilience to changing clinical conditions. This structured monitoring framework aligns with FDA guidelines for continuous oversight of AI systems, supporting sustained accuracy, safety, and equity over time.</p>
      </sec>
      <sec id="sec6dot5">
        <title>6.5. Clinical Workflow Integration</title>
        <p>AI solutions must be easily incorporated into clinical operations in order to be successfully used. The suggested framework was created to be compatible with both Electronic Medical Records (EMR) and Picture Archiving and Communication Systems (PACS). To make integration easier, an API was created, allowing physicians to directly access AI outputs within already-existing platforms [<xref ref-type="bibr" rid="B34">34</xref>]. Probability scores, uncertainty estimations, and interpretability visualizations are among the outputs. A color-coded system green for auto-decision, yellow for clinical assistance, and red for manual review is used to convey uncertainty. Usability and trust are improved by this user-friendly design. Furthermore, Grad-CAM heatmaps emphasize areas of importance, such as the optic disc and cup, and offer visual explanations of model predictions (<xref ref-type="fig" rid="fig11">Figure 11</xref>) [<xref ref-type="bibr" rid="B35">35</xref>]. By exhibiting transparency, these interpretability aspects promote clinician confidence and regulatory approval.</p>
        <fig id="fig12">
          <label>Figure 12</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId28.jpeg?20260324021021" />
        </fig>
        <p><bold>Figure 11.</bold> Grad-CAM image of optic disc.</p>
        <p>The H-UQ-MFF framework’s decision-making process is depicted in the clinical workflow diagram (<xref ref-type="fig" rid="fig12">Figure 12</xref>). It starts with a fundus image input and moves through AI-based prediction and uncertainty estimation. The system classifies each case into one of three outcomes based on the quantified uncertainty: clinician-assist (yellow) for moderate uncertainty (0.20 - 0.50), auto-decision (green) for low uncertainty (≤0.20), and manual review (red) for severe uncertainty (&gt;0.50). Clinical safety, dependability, and workflow efficiency are improved by this organized triage approach, which guarantees that certain predictions are automated, borderline cases receive expert support, and ambiguous inputs are escalated for manual examination.</p>
        <fig id="fig13">
          <label>Figure 13</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId29.jpeg?20260324021021" />
        </fig>
        <p><bold>Figure 12.</bold> Clinical workflow.</p>
      </sec>
      <sec id="sec6dot6">
        <title>6.6. Regulatory Documentation</title>
        <p>Intended use statements, external validation summaries, a thorough risk register with mitigation techniques, the drift monitoring plan, calibration and reliability studies, and bias assessment reports were all included in the extensive paperwork created to support regulatory submission. When combined, these resources guarantee compliance with FDA Good Machine Learning Practice (GMLP) standards and offer an open, repeatable basis for assessment [<xref ref-type="bibr" rid="B36">36</xref>]. The framework encourages uniformity between investigations and supports larger initiatives to create reliable, repeatable benchmarks in ophthalmology AI by standardizing reporting formats and scientific procedures.</p>
      </sec>
      <sec id="sec6dot7">
        <title>6.7. Clinical Safety and Trust</title>
        <p>When using AI in therapeutic settings, safety and trust are crucial. The suggested paradigm tackles the main issues raised by regulators and doctors by combining uncertainty quantification, calibration, bias monitoring, and interpretability [<xref ref-type="bibr" rid="B37">37</xref>]. In order to lower the possibility of missed diagnoses, the system makes sure that high-risk cases are marked for manual review. Overconfidence is avoided by calibrating confidence scores to reflect actual accuracy. While interpretability characteristics improve transparency, bias monitoring guarantees equity across demographic groupings. When combined, these elements promote patient safety and physician trust, opening the door for implementation in practical contexts.</p>
        <fig id="fig14">
          <label>Figure 14</label>
          <graphic xlink:href="https://html.scirp.org/file/2020908-rId30.jpeg?20260324021023" />
        </fig>
        <p><bold>Figure 13.</bold> Uncertainty distribution of image quality.</p>
        <p><xref ref-type="fig" rid="fig13">Figure 13</xref>’s uncertainty distribution plots show clear trends at various image quality levels. Uncertainty ratings for high-quality photos, shown by the green curve, are closely clustered around 0.1, indicating dependable and confident forecasts. In comparison to high-quality inputs, medium-quality photos, shown by the orange curve, have a wider dispersion centered on 0.4, indicating moderate uncertainty and decreased confidence. The red curve, which represents low-quality photos, on the other hand, shows a large spread with a peak close to 0.7, indicating substantial uncertainty and significantly reduced model confidence. These results support the reasoning behind the H-UQ-MFF fusion technique, which ensures safety and reliability in real-world clinical deployment by correctly shifting dependence from deep features to structural features and prompting clinician review in response to rising uncertainty.</p>
        <p>Clinical integration and regulatory preparedness are essential for putting AI research into reality. The intended use definition, external validation, risk management, drift monitoring, workflow integration, and documentation are all included in the comprehensive approach established by the proposed framework [<xref ref-type="bibr" rid="B38">38</xref>]. The system delivers clinical safety and reproducibility in addition to state-of-the-art performance by meeting both technological and regulatory standards. This all-encompassing strategy guarantees that glaucoma AI may advance from research prototypes to a reliable tool in ophthalmic practice, ultimately enhancing patient outcomes through early identification and care.</p>
      </sec>
    </sec>
    <sec id="sec7">
      <title>7. Discussion</title>
      <p>The outcomes of the suggested H-UQ-MFF (Hybrid Uncertainty-Aware Multi-Feature Fusion) framework show both clinical significance and technical superiority. This part presents an interpretation of the results, a comparison with current baselines, a discussion of the implications for clinical treatment, and an outline of the limits and future research prospects [<xref ref-type="bibr" rid="B39">39</xref>].</p>
      <sec id="sec7dot1">
        <title>7.1. Interpretation of Results</title>
        <p>H-UQ-MFF outperformed ResNet50, EfficientNet-B0, and Deep + Struct baselines with an AUC of 0.9969, sensitivity of 0.9811, specificity of 0.9717, and F1-score of 0.9780, according to internal validation. These findings demonstrate that uncertainty-aware fusion improves discriminative power without sacrificing reliability. Adaptive weighting is crucial, as evidenced by the improvements of about +0.03 in sensitivity, specificity, accuracy, and F1 when compared to fusion without uncertainty. Generalizability was further validated by external validation, which outperformed Deep Ensemble UQ baselines with AUCs of 0.89 on REFUGE and 0.87 on PAPILA. With ECE values of 0.028 and 0.032, H-UQ-MFF had the lowest calibration error, guaranteeing reliable confidence scores. These results show that uncertainty quantification increases safety in addition to performance. The model reduces the hazards related to low-quality inputs and out-of-distribution samples by dynamically varying emphasis on deep versus structural features [<xref ref-type="bibr" rid="B40">40</xref>]. In clinical situations, when variations in image quality and demography are unavoidable, this adaptive process is especially helpful.</p>
      </sec>
      <sec id="sec7dot2">
        <title>7.2. Comparison with Baselines</title>
        <p>H-UQ-MFF significantly outperformed ResNet50 and EfficientNet-B0 in terms of accuracy and calibration. Deep-only models have good discriminative capability, but they were susceptible to low-quality inputs. Although they were not sensitive, structural-only models provided stability [<xref ref-type="bibr" rid="B41">41</xref>]. Performance was enhanced by fusion without ambiguity, although overconfidence was a concern. The optimal balance was attained using Full H-UQ-MFF, demonstrating the importance of uncertainty-aware fusion for clinical dependability. Top-performing models with AUCs around 0.90, sensitivity close to 0.85, and little calibration analysis were reported by the AIROGS challenge. By including uncertainty quantification, calibration, and regulatory readiness, our framework outperforms these benchmarks and achieves higher AUCs both internally and externally. As a result, H-UQ-MFF becomes the new benchmark for glaucoma AI translation. <bold>Table 9</bold> summarizes the comparison of existing and previous work.</p>
      </sec>
      <sec id="sec7dot3">
        <title>7.3. Clinical Implications</title>
        <p>These discoveries have important clinical ramifications. First, by ensuring that </p>
        <p><bold>Table 9.</bold> Comparison of existing and previous work.</p>
        <table-wrap id="tbl9">
          <label>Table 9</label>
          <table>
            <tbody>
              <tr>
                <td>
                  <bold>Method</bold>
                </td>
                <td>
                  <bold>Dataset</bold>
                </td>
                <td>
                  <bold>Internal AUC</bold>
                </td>
                <td>
                  <bold>External AUC</bold>
                </td>
                <td>
                  <bold>Uncertainty</bold>
                </td>
                <td>
                  <bold>Clinical</bold>
                </td>
              </tr>
              <tr>
                <td>AIROGS Best (2023)</td>
                <td>Rotterdam</td>
                <td>0.90</td>
                <td>No</td>
                <td>No</td>
                <td>No</td>
              </tr>
              <tr>
                <td>H-UQ-MFF</td>
                <td>EyePACS-light-V2</td>
                <td>0.9969</td>
                <td>Yes</td>
                <td>Yes</td>
                <td>Yes</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>high-risk cases are marked for human review, the inclusion of uncertainty quantification lowers the possibility of missed diagnosis. Second, calibration prevents overconfidence and builds trust by bringing confidence levels into line with actual accuracy [<xref ref-type="bibr" rid="B42">42</xref>]. Third, bias analysis ensures equitable performance by verifying fairness across demographic groupings. Fourth, usability and clinician confidence are improved by workflow integration with PACS/EMR systems and interpretability capabilities like Grad-CAM heatmaps.</p>
        <p>The system is in line with real-world workflows by classifying predictions into auto-decision, clinician support, and manual review based on uncertainty criteria [<xref ref-type="bibr" rid="B43">43</xref>]-[<xref ref-type="bibr" rid="B49">49</xref>]. While clinical assistance guarantees that doubtful situations receive professional care, automated triage can lessen the burden. High-risk errors are prevented by manual review. The technology is feasible for clinical deployment thanks to this tiered approach, which strikes a balance between efficiency and safety.</p>
      </sec>
      <sec id="sec7dot4">
        <title>7.4. Cost-Effectiveness Analysis</title>
        <p>When compared to conventional manual evaluation, the incorporation of AI-assisted glaucoma screening shows significant financial advantages. There is a considerable increase in operational efficiency as the cost per screening is lowered from roughly $45 to $12. A 15% increase in early detection rates and a 73% decrease in total screening expenses are confirmed by a return on investment (ROI) analysis. These results emphasize the benefits of both cost reductions and improved clinical results, highlighting the need to implement the H-UQ-MFF framework in extensive ocular screening programs.</p>
      </sec>
      <sec id="sec7dot5">
        <title>7.5. Regulatory Pathway Timeline</title>
        <p>To guarantee the timely and compliant implementation of the suggested framework, a systematic regulatory roadmap has been created. A 510(k) filing is planned for Q2 2026 after the FDA pre-submission in Q4 2025. Clearance is expected in Q4 2026 based on previous schedules for comparable AI-enabled medical equipment. Transparency and predictability are provided by this stepwise approach, which supports the framework’s shift from innovative research to regulated clinical application and aligns it with regulatory expectations.</p>
      </sec>
      <sec id="sec7dot6">
        <title>7.6. Limitations</title>
        <p>The framework has drawbacks despite its advantages. First, the REFUGE and PAPILA datasets—which, despite their diversity, might not accurately reflect global variability were used for external validation. Larger, multi-center datasets require more validation. Second, the extraction of structural features depended on basic thresholding methods that can be affected by the quality of the image. Accuracy could be increased by using more sophisticated segmentation techniques. Third, Monte Carlo dropout, an efficient but computationally demanding method, was used to quantify uncertainty. Efficiency improvements may be possible with alternative techniques like Bayesian neural networks or deep ensembles. Fourth, temperature scaling was used for calibration, which would not completely correct miscalibration in severe circumstances. Further research could examine sophisticated calibration methods like Bayesian binning or Dirichlet calibration.</p>
      </sec>
      <sec id="sec7dot7">
        <title>7.7. Future Directions</title>
        <p>Expanding validation to a wider range of populations and imaging modalities, such as optical coherence tomography (OCT), should be the main goal of future research. Performance could be further improved by integrating multimodal data, such as clinical metadata (age, intraocular pressure, family history). Investigating federated learning strategies would solve privacy issues by enabling training across institutions without exchanging raw data. Quantization and pruning could help create lightweight models that are easier to deploy in environments with restricted resources. Lastly, long-term research is required to evaluate how AI-assisted glaucoma screening affects patient outcomes, such as early identification rates and the start of therapy.</p>
      </sec>
      <sec id="sec7dot8">
        <title>7.8. Broader Impact</title>
        <p>The suggested approach has wider implications for medical AI than just glaucoma. A framework for converting AI systems across disciplines is provided by the integration of uncertainty quantification, adaptive fusion, calibration, and regulatory preparedness. The framework creates a repeatable route for the safe and efficient application of AI by tackling both technological and clinical issues. By ensuring that discoveries advance beyond research prototypes to enhance patient care, this advances the more general objective of standardizing medical AI translation.</p>
        <p>In its final stage, the discussion shows that H-UQ-MFF addresses important clinical translation issues while achieving state-of-the-art performance. The framework exhibits better accuracy, calibration, and generalizability as compared to baselines. Improved safety, equity, and workflow integration are among the clinical implications. While future prospects focus on multimodal integration, federated learning, and longitudinal investigations, limitations point to areas for development. Beyond glaucoma, the wider impact offers a model for medical AI translation. Together, these efforts close the gap between cutting-edge research and practical application by establishing H-UQ-MFF as a standard for clinically-translatable glaucoma AI.</p>
      </sec>
    </sec>
    <sec id="sec8">
      <title>8. Conclusions</title>
      <p>Using the EyePACS-AIROGS-light-V2 dataset and evaluating performance on external datasets like REFUGE and PAPILA, this work offers a thorough and clinically focused methodology for implementing glaucoma AI systems. Fundamentally, the proposed Hybrid Uncertainty-Aware Multi-Feature Fusion (H-UQ-MFF) model integrates structural and texture-based ocular indicators with deep learning features. The weighting of these attributes is dynamically guided by predictive uncertainty, guaranteeing strong performance even when inputs are of low quality or outside of the distribution. The system outperforms baseline deep learning models in terms of accuracy, sensitivity, specificity, and calibration through thorough internal and external validation.</p>
      <p>The framework makes a number of significant contributions. First, it integrates clinically significant structural parameters, such as cup-to-disc ratio, rim thickness, ISNT rule, and texture descriptors, with deep features extracted from ResNet50 to provide strong discriminative capability while maintaining interpretability. Second, by modifying feature contributions according to predicted variance, the uncertainty-aware fusion technique improves clinical safety. Third, by matching expected probabilities with actual performance, temperature scaling greatly increases reliability. Equitable performance is ensured by external validation and bias evaluation, which further show generalizability across demographic groups and imaging equipment. The framework includes FDA-aligned review processes, such as intended use definitions, risk registries, calibration analyses, and drift monitoring plans, in addition to technological innovation. Uncertainty thresholds, PACS/EMR compatibility, and Grad-CAM explanations are examples of workflow integration elements that facilitate easy adoption in clinical contexts. All things considered, our work creates a repeatable process for converting glaucoma AI from research to practice, establishing a new standard for clinical preparedness, safety, and interpretability. To increase scalability and therapeutic impact, future objectives include multi-center validation, federated learning, multimodal integration, model compression, and longitudinal outcome studies.</p>
    </sec>
    <sec id="sec9">
      <title>Appendix</title>
      <p>The risk registers, bias monitoring, drift tracking, external validation, and FDA-aligned evaluation methods that are necessary to guarantee the safe and compliant clinical translation of the H-UQ-MFF framework are all included in <bold>Table A1</bold>’s regulatory-ready deployment checklist.</p>
      <p><bold>Table A1</bold><bold>.</bold>Regulatory-ready deployment checklist.</p>
      <table-wrap id="tbl10">
        <label>Table 10</label>
        <table>
          <tbody>
            <tr>
              <td>Category</td>
              <td>Artifacts/Details</td>
            </tr>
            <tr>
              <td>Intended Use</td>
              <td>- AI-assisted glaucoma screening (referable vs. non-referable classification)- Not a standalone diagnostic tool; designed to augment clinician decision-making</td>
            </tr>
            <tr>
              <td>Risk Register</td>
              <td>- Risks identified: domain drift, demographic bias, false negatives- Mitigation: drift monitoring, quarterly bias audits, threshold tuning to minimize false negatives</td>
            </tr>
            <tr>
              <td>Monitoring Plan</td>
              <td>- Monthly calibration checks (AUC, ECE)- Quarterly bias analysis across demographic subgroups- Annual external validation (REFUGE, PAPILA)- Post-market surveillance with clinician feedback</td>
            </tr>
            <tr>
              <td>Deployment Artifacts</td>
              <td>- API integration notes for PACS/EMR compatibility- Interpretability modules: Grad-CAM overlays, uncertainty color coding (green/yellow/red)- Documentation of uncertainty thresholds (auto, assist, manual review)- User training materials for clinicians</td>
            </tr>
            <tr>
              <td>Post-Market Procedures</td>
              <td>- Drift tracking dashboard with automated alerts- Risk table updates every 6 months- Feedback collection from clinical users to refine thresholds and workflows</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
    </sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      <ref id="B1">
        <label>1.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Roy, A., <italic>et</italic><italic>al.</italic> (2025) DeepEyeNet: Adaptive Genetic Bayesian Algorithm Based Hybrid ConvNeXtTiny Framework for Multi-Feature Glaucoma Eye Diagnosis. arXiv: 2501.11168.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Roy, A.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>DeepEyeNet: Adaptive Genetic Bayesian Algorithm Based Hybrid ConvNeXtTiny Framework for Multi-Feature Glaucoma Eye Diagnosis</article-title>
            <fpage>2501</fpage>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B2">
        <label>2.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Ming, H.K., Tze, V.W., Shiong, B.L.C. and Then, P.H.H. (2025) Evaluation of Post-Hoc Explainability Methods for Glaucoma Classification Using Fundus Images. In: Arai, K., Ed., <italic>Intelligent Computing</italic>, Springer, 650-667. https://doi.org/10.1007/978-3-031-92605-1_40 <pub-id pub-id-type="doi">10.1007/978-3-031-92605-1_40</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-3-031-92605-1_40">https://doi.org/10.1007/978-3-031-92605-1_40</ext-link></mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Ming, H.K.</string-name>
              <string-name>Tze, V.W.</string-name>
              <string-name>Shiong, B.L.C.</string-name>
              <string-name>Then, P.H.H.</string-name>
              <string-name>Arai, K.</string-name>
              <string-name>Computing, S</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Evaluation of Post-Hoc Explainability Methods for Glaucoma Classification Using Fundus Images</article-title>
            <source>In: Arai</source>
            <volume>650</volume>
            <pub-id pub-id-type="doi">10.1007/978-3-031-92605-1_40</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B3">
        <label>3.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Kumar, S., Dixit, V. and Gupta, M. (2025) Deep Learning Approaches for Retinal Disease Diagnosis: Insights from Fundus and OCT Analysis. <italic>EPJ</italic><italic>Web</italic><italic>of</italic><italic>Confer</italic><italic>ences</italic>, 328, Article ID: 01041. https://doi.org/10.1051/epjconf/202532801041 <pub-id pub-id-type="doi">10.1051/epjconf/202532801041</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1051/epjconf/202532801041">https://doi.org/10.1051/epjconf/202532801041</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Kumar, S.</string-name>
              <string-name>Dixit, V.</string-name>
              <string-name>Gupta, M.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Deep Learning Approaches for Retinal Disease Diagnosis: Insights from Fundus and OCT Analysis</article-title>
            <source>EPJ Web of Conferences</source>
            <volume>328</volume>
            <fpage>01041</fpage>
            <elocation-id>ID</elocation-id>
            <pub-id pub-id-type="doi">10.1051/epjconf/202532801041</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B4">
        <label>4.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">de Vente, C., Vermeer, K.A., Jaccard, N., Wang, H., Sun, H., Khader, F., <italic>et</italic><italic>al.</italic> (2024) AIROGS: Artificial Intelligence for Robust Glaucoma Screening Challenge. <italic>IEEE</italic><italic>Transactions</italic><italic>on</italic><italic>Medical</italic><italic>Imaging</italic>, 43, 542-557. https://doi.org/10.1109/tmi.2023.3313786 <pub-id pub-id-type="doi">10.1109/tmi.2023.3313786</pub-id><pub-id pub-id-type="pmid">37713220</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/tmi.2023.3313786">https://doi.org/10.1109/tmi.2023.3313786</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Vente, C.</string-name>
              <string-name>Vermeer, K.A.</string-name>
              <string-name>Jaccard, N.</string-name>
              <string-name>Wang, H.</string-name>
              <string-name>Sun, H.</string-name>
              <string-name>Khader, F.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>AIROGS: Artificial Intelligence for Robust Glaucoma Screening Challenge</article-title>
            <source>IEEE Transactions on Medical Imaging</source>
            <volume>43</volume>
            <pub-id pub-id-type="doi">10.1109/tmi.2023.3313786</pub-id>
            <pub-id pub-id-type="pmid">37713220</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B5">
        <label>5.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Feng, Y., Wu, C. and Zhou, Y. (2024) DG2Net: A MLP-Based Dynamixing Gate and Depthwise Group Norm Network for Classification of Glaucoma. In: Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, C.L., Bhattacharya, S. and Pal, U., Eds., <italic>Pattern Recognition</italic>, Springer, 295-308. https://doi.org/10.1007/978-3-031-78383-8_20 <pub-id pub-id-type="doi">10.1007/978-3-031-78383-8_20</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-3-031-78383-8_20">https://doi.org/10.1007/978-3-031-78383-8_20</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Feng, Y.</string-name>
              <string-name>Wu, C.</string-name>
              <string-name>Zhou, Y.</string-name>
              <string-name>Antonacopoulos, A.</string-name>
              <string-name>Chaudhuri, S.</string-name>
              <string-name>Chellappa, R.</string-name>
              <string-name>Liu, C.L.</string-name>
              <string-name>Bhattacharya, S.</string-name>
              <string-name>Pal, U.</string-name>
              <string-name>Recognition, S</string-name>
            </person-group>
            <year>2024</year>
            <article-title>DG2Net: A MLP-Based Dynamixing Gate and Depthwise Group Norm Network for Classification of Glaucoma</article-title>
            <source>In: Antonacopoulos</source>
            <volume>295</volume>
            <pub-id pub-id-type="doi">10.1007/978-3-031-78383-8_20</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B6">
        <label>6.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Wang, H. (2025) Regulating AI Medical Devices: A Clinically Anchored, Adaptive Approach. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5575550</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Wang, H.</string-name>
              <string-name>Anchored, A</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Regulating AI Medical Devices: A Clinically Anchored, Adaptive Approach</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B7">
        <label>7.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Roy, M. (2025) Effect of Screen Time on Glaucoma. https://openreview.net/pdf?id=BuK3fZNS9U</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Roy, M.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Effect of Screen Time on Glaucoma</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B8">
        <label>8.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Nguyen, D.M., <italic>et</italic><italic>al.</italic> (2025) Deep Learning for Ophthalmology: The State-Of-The-Art and Future Trends. arXiv: 2501.04073.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Nguyen, D.M.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Deep Learning for Ophthalmology: The State-Of-The-Art and Future Trends</article-title>
            <fpage>2501</fpage>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B9">
        <label>9.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Alvi, M.D.M., Ubaid, D., Ul Wara, K., Zaidi, F.R.S., Anwar, S.M., Javed, A., <italic>et al</italic>. (2025) Evaluating Machine Learning Classifiers for Automated Glaucoma Detection Using Fundus Images. <italic>Biomedical Materials &amp; Devices</italic>. https://doi.org/10.1007/s44174-025-00569-x <pub-id pub-id-type="doi">10.1007/s44174-025-00569-x</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s44174-025-00569-x">https://doi.org/10.1007/s44174-025-00569-x</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Alvi, M.D.M.</string-name>
              <string-name>Ubaid, D.</string-name>
              <string-name>Wara, K.</string-name>
              <string-name>Zaidi, F.R.S.</string-name>
              <string-name>Anwar, S.M.</string-name>
              <string-name>Javed, A.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Evaluating Machine Learning Classifiers for Automated Glaucoma Detection Using Fundus Images</article-title>
            <pub-id pub-id-type="doi">10.1007/s44174-025-00569-x</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B10">
        <label>10.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Luo, X., <italic>et</italic><italic>al.</italic> (2025) A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models. arXiv: 2508.03734.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Luo, X.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models</article-title>
            <fpage>2508</fpage>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B11">
        <label>11.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Baggu, A., Polisetti, V.A. and Nidamanuri, J. (2024) A Novel Attention Informed Voting Ensemble Framework for Retinal Disease Classification. 2024 <italic>International Conference on Emerging Techniques in Computational Intelligence</italic>( <italic>ICETCI</italic>), Hyderabad, 22-24 August 2024, 230-237. https://doi.org/10.1109/icetci62771.2024.10704188 <pub-id pub-id-type="doi">10.1109/icetci62771.2024.10704188</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/icetci62771.2024.10704188">https://doi.org/10.1109/icetci62771.2024.10704188</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Baggu, A.</string-name>
              <string-name>Polisetti, V.A.</string-name>
              <string-name>Nidamanuri, J.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>A Novel Attention Informed Voting Ensemble Framework for Retinal Disease Classification</article-title>
            <source>2024 International Conference on Emerging Techniques in Computational Intelligence (ICETCI)</source>
            <volume>22</volume>
            <pub-id pub-id-type="doi">10.1109/icetci62771.2024.10704188</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B12">
        <label>12.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Remyes, D., Nasef, D., Remyes, S., Tawfellos, J., Sher, M., Nasef, D., <italic>et</italic><italic>al.</italic> (2025) Clinical Applicability and Cross-Dataset Validation of Machine Learning Models for Binary Glaucoma Detection. <italic>Information</italic>, 16, Article 432. https://doi.org/10.3390/info16060432 <pub-id pub-id-type="doi">10.3390/info16060432</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/info16060432">https://doi.org/10.3390/info16060432</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Remyes, D.</string-name>
              <string-name>Nasef, D.</string-name>
              <string-name>Remyes, S.</string-name>
              <string-name>Tawfellos, J.</string-name>
              <string-name>Sher, M.</string-name>
              <string-name>Nasef, D.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Clinical Applicability and Cross-Dataset Validation of Machine Learning Models for Binary Glaucoma Detection</article-title>
            <source>Information</source>
            <volume>16</volume>
            <elocation-id>432</elocation-id>
            <pub-id pub-id-type="doi">10.3390/info16060432</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B13">
        <label>13.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">de Sousa, J.G., Brito, L.E.S., Dalília, A., Veloso, R.R., de Carvalho Filho, A.O. and de Araújo, F.H.D. (2025) GlaucoAware: Desenvolvimento de uma ferramenta de classificação de glaucoma em imagens do fundo ocular utilizando CNNs. <italic>Revista de Sistemas e Computação</italic>- <italic>RSC</italic>, 15, 55-62.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Sousa, J.G.</string-name>
              <string-name>Brito, L.E.S.</string-name>
              <string-name>Veloso, R.R.</string-name>
              <string-name>Filho, A.O.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>GlaucoAware: Desenvolvimento de uma ferramenta de classificação de glaucoma em imagens do fundo ocular utilizando CNNs</article-title>
            <source>Revista de Sistemas e Computação-RSC</source>
            <volume>15</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B14">
        <label>14.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Das, P., Nirmala, S.R. and Medhi, J.P. (2015) Diagnosis of Glaucoma Using CDR and NRR Area in Retina Images. <italic>Network</italic><italic>Modeling</italic><italic>Analysis</italic><italic>in</italic><italic>Health</italic><italic>Informatics</italic><italic>and</italic><italic>Bioinformatics</italic>, 5, Article No. 3. https://doi.org/10.1007/s13721-015-0110-5 <pub-id pub-id-type="doi">10.1007/s13721-015-0110-5</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s13721-015-0110-5">https://doi.org/10.1007/s13721-015-0110-5</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Das, P.</string-name>
              <string-name>Nirmala, S.R.</string-name>
              <string-name>Medhi, J.P.</string-name>
            </person-group>
            <year>2015</year>
            <article-title>Diagnosis of Glaucoma Using CDR and NRR Area in Retina Images</article-title>
            <source>Network Modeling Analysis in Health Informatics and Bioinformatics</source>
            <volume>5</volume>
            <elocation-id>No</elocation-id>
            <pub-id pub-id-type="doi">10.1007/s13721-015-0110-5</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B15">
        <label>15.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">MacCormick, I.J.C., Williams, B.M., Zheng, Y., Li, K., Al-Bander, B., Czanner, S., <italic>et</italic><italic>al.</italic> (2019) Accurate, Fast, Data Efficient and Interpretable Glaucoma Diagnosis with Automated Spatial Analysis of the Whole Cup to Disc Profile. <italic>PLOS ONE</italic>, 14, e0209409. https://doi.org/10.1371/journal.pone.0209409 <pub-id pub-id-type="doi">10.1371/journal.pone.0209409</pub-id><pub-id pub-id-type="pmid">30629635</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0209409">https://doi.org/10.1371/journal.pone.0209409</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>MacCormick, I.J.C.</string-name>
              <string-name>Williams, B.M.</string-name>
              <string-name>Zheng, Y.</string-name>
              <string-name>Li, K.</string-name>
              <string-name>Al-Bander, B.</string-name>
              <string-name>Czanner, S.</string-name>
              <string-name>Accurate, F</string-name>
            </person-group>
            <year>2019</year>
            <article-title>Accurate, Fast, Data Efficient and Interpretable Glaucoma Diagnosis with Automated Spatial Analysis of the Whole Cup to Disc Profile</article-title>
            <source>PLOS ONE</source>
            <volume>14</volume>
            <pub-id pub-id-type="doi">10.1371/journal.pone.0209409</pub-id>
            <pub-id pub-id-type="pmid">30629635</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B16">
        <label>16.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Law, S.K., Kornmann, H.L., Nilforushan, N., Moghimi, S. and Caprioli, J. (2016) Evaluation of the “IS” Rule to Differentiate Glaucomatous Eyes from Normal. <italic>Journal</italic><italic>of</italic><italic>Glaucoma</italic>, 25, 27-32. https://doi.org/10.1097/ijg.0000000000000072 <pub-id pub-id-type="doi">10.1097/ijg.0000000000000072</pub-id><pub-id pub-id-type="pmid">24844540</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1097/ijg.0000000000000072">https://doi.org/10.1097/ijg.0000000000000072</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Law, S.K.</string-name>
              <string-name>Kornmann, H.L.</string-name>
              <string-name>Nilforushan, N.</string-name>
              <string-name>Moghimi, S.</string-name>
              <string-name>Caprioli, J.</string-name>
            </person-group>
            <year>2016</year>
            <article-title>Evaluation of the “IS” Rule to Differentiate Glaucomatous Eyes from Normal</article-title>
            <source>Journal of Glaucoma</source>
            <volume>25</volume>
            <pub-id pub-id-type="doi">10.1097/ijg.0000000000000072</pub-id>
            <pub-id pub-id-type="pmid">24844540</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B17">
        <label>17.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Das, P., Nirmala, S.R. and Medhi, J.P. (2016) Detection of Glaucoma Using Neuroretinal Rim Information. 2016 <italic>International Conference on Accessibility to Digital World</italic> ( <italic>ICADW</italic>), Guwahati, 16-18 December 2016, 181-186. https://doi.org/10.1109/icadw.2016.7942538 <pub-id pub-id-type="doi">10.1109/icadw.2016.7942538</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/icadw.2016.7942538">https://doi.org/10.1109/icadw.2016.7942538</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Das, P.</string-name>
              <string-name>Nirmala, S.R.</string-name>
              <string-name>Medhi, J.P.</string-name>
            </person-group>
            <year>2016</year>
            <article-title>Detection of Glaucoma Using Neuroretinal Rim Information</article-title>
            <source>2016 International Conference on Accessibility to Digital World (ICADW)</source>
            <volume>16</volume>
            <pub-id pub-id-type="doi">10.1109/icadw.2016.7942538</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B18">
        <label>18.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">MacCormick, I.J.C., <italic>et al.</italic> (2019) Accurate, Fast, Data Efficient and Interpretable Glaucoma Diagnosis with Automated Spatial Analysis of the Whole Cup to Disc Profile. <italic>PLOS ONE</italic>, 14, e0209409.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>MacCormick, I.J.C.</string-name>
              <string-name>Accurate, F</string-name>
            </person-group>
            <year>2019</year>
            <article-title>Accurate, Fast, Data Efficient and Interpretable Glaucoma Diagnosis with Automated Spatial Analysis of the Whole Cup to Disc Profile</article-title>
            <source>PLOS ONE</source>
            <volume>14</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B19">
        <label>19.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Neto, A., Camara, J. and Cunha, A. (2022) Evaluations of Deep Learning Approaches for Glaucoma Screening Using Retinal Images from Mobile Device. <italic>Sensors</italic>, 22, Article 1449. https://doi.org/10.3390/s22041449 <pub-id pub-id-type="doi">10.3390/s22041449</pub-id><pub-id pub-id-type="pmid">35214351</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/s22041449">https://doi.org/10.3390/s22041449</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Neto, A.</string-name>
              <string-name>Camara, J.</string-name>
              <string-name>Cunha, A.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Evaluations of Deep Learning Approaches for Glaucoma Screening Using Retinal Images from Mobile Device</article-title>
            <source>Sensors</source>
            <volume>22</volume>
            <elocation-id>1449</elocation-id>
            <pub-id pub-id-type="doi">10.3390/s22041449</pub-id>
            <pub-id pub-id-type="pmid">35214351</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B20">
        <label>20.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Choudhary, K. and Tiwari, S. (2015) ANN Glaucoma Detection Using Cup-to-Disk Ratio and Neuroretinal Rim. <italic>International</italic><italic>Journal</italic><italic>of</italic><italic>Computer</italic><italic>Applications</italic>, 111, 8-14. https://doi.org/10.5120/19581-1233 <pub-id pub-id-type="doi">10.5120/19581-1233</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5120/19581-1233">https://doi.org/10.5120/19581-1233</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Choudhary, K.</string-name>
              <string-name>Tiwari, S.</string-name>
            </person-group>
            <year>2015</year>
            <article-title>ANN Glaucoma Detection Using Cup-to-Disk Ratio and Neuroretinal Rim</article-title>
            <source>International Journal of Computer Applications</source>
            <volume>111</volume>
            <pub-id pub-id-type="doi">10.5120/19581-1233</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B21">
        <label>21.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Kiefer, R. (2024) Glaucoma Dataset: EyePACS-AIROGS-Light-V2. Kaggle.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Kiefer, R.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Glaucoma Dataset: EyePACS-AIROGS-Light-V2</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B22">
        <label>22.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Valdez-Rodríguez, J.E., Felipe-Riverón, E.M. and Calvo, H. (2021) Optic Disc Preprocessing for Reliable Glaucoma Detection in Small Datasets. <italic>Mathematics</italic>, 9, Article 2237. https://doi.org/10.3390/math9182237 <pub-id pub-id-type="doi">10.3390/math9182237</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/math9182237">https://doi.org/10.3390/math9182237</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Calvo, H.</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Optic Disc Preprocessing for Reliable Glaucoma Detection in Small Datasets</article-title>
            <source>Mathematics</source>
            <volume>9</volume>
            <elocation-id>2237</elocation-id>
            <pub-id pub-id-type="doi">10.3390/math9182237</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B23">
        <label>23.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Veena, H.N., Muruganandham, A. and Senthil Kumaran, T. (2022) A Novel Optic Disc and Optic Cup Segmentation Technique to Diagnose Glaucoma Using Deep Learning Convolutional Neural Network over Retinal Fundus Images. <italic>Journal</italic><italic>of</italic><italic>King</italic><italic>Saud</italic><italic>University</italic>— <italic>Computer</italic><italic>and</italic><italic>Information</italic><italic>Sciences</italic>, 34, 6187-6198. https://doi.org/10.1016/j.jksuci.2021.02.003 <pub-id pub-id-type="doi">10.1016/j.jksuci.2021.02.003</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.jksuci.2021.02.003">https://doi.org/10.1016/j.jksuci.2021.02.003</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Veena, H.N.</string-name>
              <string-name>Muruganandham, A.</string-name>
              <string-name>Kumaran, T.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>A Novel Optic Disc and Optic Cup Segmentation Technique to Diagnose Glaucoma Using Deep Learning Convolutional Neural Network over Retinal Fundus Images</article-title>
            <source>Journal of King Saud University—Computer and Information Sciences</source>
            <volume>34</volume>
            <pub-id pub-id-type="doi">10.1016/j.jksuci.2021.02.003</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B24">
        <label>24.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Kumar, S. and Kumar, B. (2022) Automatic Early Glaucoma Detection by Extracting Parapapillary Atrophy and Optic Disc from Fundus Image Using SVM. <italic>Multimedia</italic><italic>Tools</italic><italic>and</italic><italic>Applications</italic>, 81, 13513-13535. https://doi.org/10.1007/s11042-021-11023-7 <pub-id pub-id-type="doi">10.1007/s11042-021-11023-7</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11042-021-11023-7">https://doi.org/10.1007/s11042-021-11023-7</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Kumar, S.</string-name>
              <string-name>Kumar, B.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Automatic Early Glaucoma Detection by Extracting Parapapillary Atrophy and Optic Disc from Fundus Image Using SVM</article-title>
            <source>Multimedia Tools and Applications</source>
            <volume>81</volume>
            <pub-id pub-id-type="doi">10.1007/s11042-021-11023-7</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B25">
        <label>25.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Elmannai, H., Hamdi, M., Meshoul, S., Alhussan, A.A., Ayadi, M. and Ksibi, A. (2024) An Improved Deep Learning Framework for Automated Optic Disc Localization and Glaucoma Detection. <italic>Computer</italic><italic>Modeling</italic><italic>in</italic><italic>Engineering</italic><italic>&amp;</italic><italic>Sciences</italic>, 140, 1429-1457. https://doi.org/10.32604/cmes.2024.048557 <pub-id pub-id-type="doi">10.32604/cmes.2024.048557</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.32604/cmes.2024.048557">https://doi.org/10.32604/cmes.2024.048557</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Elmannai, H.</string-name>
              <string-name>Hamdi, M.</string-name>
              <string-name>Meshoul, S.</string-name>
              <string-name>Alhussan, A.A.</string-name>
              <string-name>Ayadi, M.</string-name>
              <string-name>Ksibi, A.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>An Improved Deep Learning Framework for Automated Optic Disc Localization and Glaucoma Detection</article-title>
            <source>Computer Modeling in Engineering &amp; Sciences</source>
            <volume>140</volume>
            <pub-id pub-id-type="doi">10.32604/cmes.2024.048557</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B26">
        <label>26.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Joshi, S., Partibane, B., Hatamleh, W.A., Tarazi, H., Yadav, C.S. and Krah, D. (2022) Glaucoma Detection Using Image Processing and Supervised Learning for Classification. <italic>Journal</italic><italic>of</italic><italic>Healthcare</italic><italic>Engineering</italic>, 2022, Article ID: 2988262. https://doi.org/10.1155/2022/2988262 <pub-id pub-id-type="doi">10.1155/2022/2988262</pub-id><pub-id pub-id-type="pmid">35273784</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1155/2022/2988262">https://doi.org/10.1155/2022/2988262</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Joshi, S.</string-name>
              <string-name>Partibane, B.</string-name>
              <string-name>Hatamleh, W.A.</string-name>
              <string-name>Tarazi, H.</string-name>
              <string-name>Yadav, C.S.</string-name>
              <string-name>Krah, D.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Glaucoma Detection Using Image Processing and Supervised Learning for Classification</article-title>
            <source>Journal of Healthcare Engineering</source>
            <volume>2022</volume>
            <fpage>298826</fpage>
            <elocation-id>ID</elocation-id>
            <pub-id pub-id-type="doi">10.1155/2022/2988262</pub-id>
            <pub-id pub-id-type="pmid">35273784</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B27">
        <label>27.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Virbukaitė, S. and Bernatavičienė, J. (2023) Impact of Eye Fundus Image Preprocessing on Key Objects Segmentation for Glaucoma Identification. <italic>Nonlinear</italic><italic>Analysis</italic>: <italic>Modelling</italic><italic>and</italic><italic>Control</italic>, 29, 96-110. https://doi.org/10.15388/namc.2024.29.33669 <pub-id pub-id-type="doi">10.15388/namc.2024.29.33669</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.15388/namc.2024.29.33669">https://doi.org/10.15388/namc.2024.29.33669</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <year>2023</year>
            <article-title>Impact of Eye Fundus Image Preprocessing on Key Objects Segmentation for Glaucoma Identification</article-title>
            <source>Nonlinear Analysis: Modelling and Control</source>
            <volume>29</volume>
            <pub-id pub-id-type="doi">10.15388/namc.2024.29.33669</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B28">
        <label>28.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Khalil, T., Akram, M.U., Raja, H., Jameel, A. and Basit, I. (2018) Detection of Glaucoma Using Cup to Disc Ratio from Spectral Domain Optical Coherence Tomography Images. <italic>IEEE</italic><italic>Access</italic>, 6, 4560-4576. https://doi.org/10.1109/access.2018.2791427 <pub-id pub-id-type="doi">10.1109/access.2018.2791427</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/access.2018.2791427">https://doi.org/10.1109/access.2018.2791427</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Khalil, T.</string-name>
              <string-name>Akram, M.U.</string-name>
              <string-name>Raja, H.</string-name>
              <string-name>Jameel, A.</string-name>
              <string-name>Basit, I.</string-name>
            </person-group>
            <year>2018</year>
            <article-title>Detection of Glaucoma Using Cup to Disc Ratio from Spectral Domain Optical Coherence Tomography Images</article-title>
            <source>IEEE Access</source>
            <volume>6</volume>
            <pub-id pub-id-type="doi">10.1109/access.2018.2791427</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B29">
        <label>29.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Mayya, V., S, S.K., Kulkarni, U., Surya, D.K. and Acharya, U.R. (2022) An Empirical Study of Preprocessing Techniques with Convolutional Neural Networks for Accurate Detection of Chronic Ocular Diseases Using Fundus Images. <italic>Applied</italic><italic>Intelligence</italic>, 53, 1548-1566. https://doi.org/10.1007/s10489-022-03490-8 <pub-id pub-id-type="doi">10.1007/s10489-022-03490-8</pub-id><pub-id pub-id-type="pmid">35528131</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10489-022-03490-8">https://doi.org/10.1007/s10489-022-03490-8</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Mayya, V.</string-name>
              <string-name>Kulkarni, U.</string-name>
              <string-name>Surya, D.K.</string-name>
              <string-name>Acharya, U.R.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>An Empirical Study of Preprocessing Techniques with Convolutional Neural Networks for Accurate Detection of Chronic Ocular Diseases Using Fundus Images</article-title>
            <source>Applied Intelligence</source>
            <volume>53</volume>
            <pub-id pub-id-type="doi">10.1007/s10489-022-03490-8</pub-id>
            <pub-id pub-id-type="pmid">35528131</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B30">
        <label>30.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Zhu, Y., Salowe, R., Chow, C., Li, S., Bastani, O. and O’Brien, J.M. (2024) Advancing Glaucoma Care: Integrating Artificial Intelligence in Diagnosis, Management, and Progression Detection. <italic>Bioengineering</italic>, 11, Article 122. https://doi.org/10.3390/bioengineering11020122 <pub-id pub-id-type="doi">10.3390/bioengineering11020122</pub-id><pub-id pub-id-type="pmid">38391608</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/bioengineering11020122">https://doi.org/10.3390/bioengineering11020122</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Zhu, Y.</string-name>
              <string-name>Salowe, R.</string-name>
              <string-name>Chow, C.</string-name>
              <string-name>Li, S.</string-name>
              <string-name>Bastani, O.</string-name>
              <string-name>Brien, J.M.</string-name>
              <string-name>Diagnosis, M</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Advancing Glaucoma Care: Integrating Artificial Intelligence in Diagnosis, Management, and Progression Detection</article-title>
            <source>Bioengineering</source>
            <volume>11</volume>
            <elocation-id>122</elocation-id>
            <pub-id pub-id-type="doi">10.3390/bioengineering11020122</pub-id>
            <pub-id pub-id-type="pmid">38391608</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B31">
        <label>31.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Goldmann, N., Skalicky, S.E., Weinreb, R.N., Paletta Guedes, R.A., Baudouin, C., Zhang, X., <italic>et</italic><italic>al.</italic> (2023) Defining Functional Requirements for a Patient-Centric Computerized Glaucoma Treatment and Care Ecosystem. <italic>Journal</italic><italic>of</italic><italic>Medical</italic><italic>Artificial</italic><italic>Intelligence</italic>, 6, Article 3. https://doi.org/10.21037/jmai-22-33 <pub-id pub-id-type="doi">10.21037/jmai-22-33</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.21037/jmai-22-33">https://doi.org/10.21037/jmai-22-33</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Goldmann, N.</string-name>
              <string-name>Skalicky, S.E.</string-name>
              <string-name>Weinreb, R.N.</string-name>
              <string-name>Guedes, R.A.</string-name>
              <string-name>Baudouin, C.</string-name>
              <string-name>Zhang, X.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Defining Functional Requirements for a Patient-Centric Computerized Glaucoma Treatment and Care Ecosystem</article-title>
            <source>Journal of Medical Artificial Intelligence</source>
            <volume>6</volume>
            <elocation-id>3</elocation-id>
            <pub-id pub-id-type="doi">10.21037/jmai-22-33</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B32">
        <label>32.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Wong, K. (2025) Innovative Vision Glasses for Glaucoma Detection and Management. In: Rath, M. and Samal, T., Eds., <italic>Key</italic><italic>Issues</italic><italic>in</italic><italic>Network</italic><italic>Protocols</italic><italic>and</italic><italic>Security</italic>, IntechOpen. https://doi.org/10.5772/intechopen.1006973 <pub-id pub-id-type="doi">10.5772/intechopen.1006973</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5772/intechopen.1006973">https://doi.org/10.5772/intechopen.1006973</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Wong, K.</string-name>
              <string-name>Rath, M.</string-name>
              <string-name>Samal, T.</string-name>
              <string-name>Security, I</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Innovative Vision Glasses for Glaucoma Detection and Management</article-title>
            <source>In: Rath</source>
            <pub-id pub-id-type="doi">10.5772/intechopen.1006973</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B33">
        <label>33.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Zeppieri, M., Gagliano, C., Tognetto, D., Musa, M., Avitabile, A., D’Esposito, F., <italic>et</italic><italic>al.</italic> (2025) Augmented Decisions: AI-Enhanced Accuracy in Glaucoma Diagnosis and Treatment. <italic>Journal</italic><italic>of</italic><italic>Clinical</italic><italic>Medicine</italic>, 14, Article 6519. https://doi.org/10.3390/jcm14186519 <pub-id pub-id-type="doi">10.3390/jcm14186519</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/jcm14186519">https://doi.org/10.3390/jcm14186519</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Zeppieri, M.</string-name>
              <string-name>Gagliano, C.</string-name>
              <string-name>Tognetto, D.</string-name>
              <string-name>Musa, M.</string-name>
              <string-name>Avitabile, A.</string-name>
              <string-name>Esposito, F.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Augmented Decisions: AI-Enhanced Accuracy in Glaucoma Diagnosis and Treatment</article-title>
            <source>Journal of Clinical Medicine</source>
            <volume>14</volume>
            <elocation-id>6519</elocation-id>
            <pub-id pub-id-type="doi">10.3390/jcm14186519</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B34">
        <label>34.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Tappeiner, C. (2025) Artificial Intelligence in Ophthalmology: Acceptance, Clinical Integration, and Educational Needs in Switzerland. <italic>Journal</italic><italic>of</italic><italic>Clinical</italic><italic>Medicine</italic>, 14, Article 6307. https://doi.org/10.3390/jcm14176307 <pub-id pub-id-type="doi">10.3390/jcm14176307</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/jcm14176307">https://doi.org/10.3390/jcm14176307</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Tappeiner, C.</string-name>
              <string-name>Acceptance, C</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Artificial Intelligence in Ophthalmology: Acceptance, Clinical Integration, and Educational Needs in Switzerland</article-title>
            <source>Journal of Clinical Medicine</source>
            <volume>14</volume>
            <elocation-id>6307</elocation-id>
            <pub-id pub-id-type="doi">10.3390/jcm14176307</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B35">
        <label>35.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Pathmakumara, H.C. and Perera, G. (2025) Explainable Deep Learning for Glaucoma Detection: A DenseNet121-Based Classification with Grad-CAM Visualization. medRxiv. https://doi.org/10.1101/2025.10.08.25337634 <pub-id pub-id-type="doi">10.1101/2025.10.08.25337634</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1101/2025.10.08.25337634">https://doi.org/10.1101/2025.10.08.25337634</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Pathmakumara, H.C.</string-name>
              <string-name>Perera, G.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Explainable Deep Learning for Glaucoma Detection: A DenseNet121-Based Classification with Grad-CAM Visualization</article-title>
            <pub-id pub-id-type="doi">10.1101/2025.10.08.25337634</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B36">
        <label>36.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Santone, A., Cesarelli, M., Colasuonno, E., Bevilacqua, V. and Mercaldo, F. (2024) A Method for Ocular Disease Diagnosis through Visual Prediction Explainability. <italic>Electronics</italic>, 13, Article 2706. https://doi.org/10.3390/electronics13142706 <pub-id pub-id-type="doi">10.3390/electronics13142706</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/electronics13142706">https://doi.org/10.3390/electronics13142706</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Santone, A.</string-name>
              <string-name>Cesarelli, M.</string-name>
              <string-name>Colasuonno, E.</string-name>
              <string-name>Bevilacqua, V.</string-name>
              <string-name>Mercaldo, F.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>A Method for Ocular Disease Diagnosis through Visual Prediction Explainability</article-title>
            <source>Electronics</source>
            <volume>13</volume>
            <elocation-id>2706</elocation-id>
            <pub-id pub-id-type="doi">10.3390/electronics13142706</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B37">
        <label>37.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Ennab, M. and Mcheick, H. (2025) Advancing AI Interpretability in Medical Imaging: A Comparative Analysis of Pixel-Level Interpretability and Grad-Cam Models. <italic>Machine</italic><italic>Learning</italic><italic>and</italic><italic>Knowledge</italic><italic>Extraction</italic>, 7, Article 12. https://doi.org/10.3390/make7010012 <pub-id pub-id-type="doi">10.3390/make7010012</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/make7010012">https://doi.org/10.3390/make7010012</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Ennab, M.</string-name>
              <string-name>Mcheick, H.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Advancing AI Interpretability in Medical Imaging: A Comparative Analysis of Pixel-Level Interpretability and Grad-Cam Models</article-title>
            <source>Machine Learning and Knowledge Extraction</source>
            <volume>7</volume>
            <elocation-id>12</elocation-id>
            <pub-id pub-id-type="doi">10.3390/make7010012</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B38">
        <label>38.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Lemij, H.G., de Vente, C., Sánchez, C., Cuadros, J., Jaccard, N. and Vermeer, K. (2022) Glaucomatous Features in Fundus Photographs of Eyes with ‘Referable Glaucoma’ of a Large Population Based Labeled Data Set for Training an Artificial Intelligence (AI) Algorithm for Glaucoma Screening. <italic>Investigative</italic><italic>Ophthalmology</italic><italic>&amp;</italic><italic>Visual</italic><italic>Science</italic>, 63, 2041-A0482.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Lemij, H.G.</string-name>
              <string-name>Vente, C.</string-name>
              <string-name>Cuadros, J.</string-name>
              <string-name>Jaccard, N.</string-name>
              <string-name>Vermeer, K.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Glaucomatous Features in Fundus Photographs of Eyes with ‘Referable Glaucoma’ of a Large Population Based Labeled Data Set for Training an Artificial Intelligence (AI) Algorithm for Glaucoma Screening</article-title>
            <source>Investigative Ophthalmology &amp; Visual Science</source>
            <volume>63</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B39">
        <label>39.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Farhad, M.A., Razaque, A., Mukhanov, S.B., Hassan, D.S.M. and Mohan Rai, H. (2025) Enhanced Lesion Localization and Classification in Ocular Tumor Detection Using Grad-Cam and Transfer Learning. <italic>IEEE</italic><italic>Access</italic>, 13, 167762-167777. https://doi.org/10.1109/access.2025.3610183 <pub-id pub-id-type="doi">10.1109/access.2025.3610183</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/access.2025.3610183">https://doi.org/10.1109/access.2025.3610183</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Farhad, M.A.</string-name>
              <string-name>Razaque, A.</string-name>
              <string-name>Mukhanov, S.B.</string-name>
              <string-name>Hassan, D.S.M.</string-name>
              <string-name>Rai, H.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Enhanced Lesion Localization and Classification in Ocular Tumor Detection Using Grad-Cam and Transfer Learning</article-title>
            <source>IEEE Access</source>
            <volume>13</volume>
            <pub-id pub-id-type="doi">10.1109/access.2025.3610183</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B40">
        <label>40.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Scientific, L.L. (2024) Enhancing Glaucoma Diagnosis: Deep Learning Models for Automated Identification and Explainability Using Fundus IMAGES. <italic>Journal of Theoretical</italic><italic>and</italic><italic>Applied</italic><italic>Information</italic><italic>Technology</italic>, 102, 5346-5363.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Scientific, L.L.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Enhancing Glaucoma Diagnosis: Deep Learning Models for Automated Identification and Explainability Using Fundus IMAGES</article-title>
            <source>Journal of Theoretical and Applied Information Technology</source>
            <volume>102</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B41">
        <label>41.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Gandhi, V.C., Gandhi, P., Ogundiran, J.O., Tshibola, M.S.S. and Kapuya Bulaba Nyembwe, J. (2025) Computational Modeling and Optimization of Deep Learning for Multi-Modal Glaucoma Diagnosis. <italic>AppliedMath</italic>, 5, Article 82. https://doi.org/10.3390/appliedmath5030082 <pub-id pub-id-type="doi">10.3390/appliedmath5030082</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/appliedmath5030082">https://doi.org/10.3390/appliedmath5030082</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Gandhi, V.C.</string-name>
              <string-name>Gandhi, P.</string-name>
              <string-name>Ogundiran, J.O.</string-name>
              <string-name>Tshibola, M.S.S.</string-name>
              <string-name>Nyembwe, J.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Computational Modeling and Optimization of Deep Learning for Multi-Modal Glaucoma Diagnosis</article-title>
            <source>AppliedMath</source>
            <volume>5</volume>
            <elocation-id>82</elocation-id>
            <pub-id pub-id-type="doi">10.3390/appliedmath5030082</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B42">
        <label>42.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Faria, F.T.J., Moin, M.B., Debnath, P., Fahim, A.I. and Shah, F.M. (2024) Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images. arXiv: 2405.07338.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Faria, F.T.J.</string-name>
              <string-name>Moin, M.B.</string-name>
              <string-name>Debnath, P.</string-name>
              <string-name>Fahim, A.I.</string-name>
              <string-name>Shah, F.M.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images</article-title>
            <fpage>2405</fpage>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B43">
        <label>43.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Eladawi, N., Sabry, D. and Salaheldin, H. (2025) Enhancing Glaucoma Detection Using Convolutional Neural Networks: A Comparative Study of Multi-Class and Binary Classification Approaches. <italic>Alfarama Journal of Basic &amp; Applied Sciences</italic>, 6, 75-95.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Eladawi, N.</string-name>
              <string-name>Sabry, D.</string-name>
              <string-name>Salaheldin, H.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Enhancing Glaucoma Detection Using Convolutional Neural Networks: A Comparative Study of Multi-Class and Binary Classification Approaches</article-title>
            <source>Alfarama Journal of Basic &amp; Applied Sciences</source>
            <volume>6</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B44">
        <label>44.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Morales, P.H. (2023) Deep Transfer Learning Strategy to Diagnose Eye-Related Conditions and Diseases: An Approach Based on Low-Quality Fundus Images. <italic>IEEE Access</italic>, 11, 37403-37411.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Morales, P.H.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Deep Transfer Learning Strategy to Diagnose Eye-Related Conditions and Diseases: An Approach Based on Low-Quality Fundus Images</article-title>
            <source>IEEE Access</source>
            <volume>11</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B45">
        <label>45.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Lemij, H.G., Vente, C.D., Sánchez, C.I. and Vermeer, K.A. (2023) Characteristics of a Large, Labeled Data Set for the Training of Artificial Intelligence for Glaucoma Screening with Fundus Photographs. <italic>Ophthalmology</italic><italic>Science</italic>, 3, Article ID: 100300. https://doi.org/10.1016/j.xops.2023.100300 <pub-id pub-id-type="doi">10.1016/j.xops.2023.100300</pub-id><pub-id pub-id-type="pmid">37113471</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.xops.2023.100300">https://doi.org/10.1016/j.xops.2023.100300</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Lemij, H.G.</string-name>
              <string-name>Vente, C.D.</string-name>
              <string-name>Vermeer, K.A.</string-name>
              <string-name>Large, L</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Characteristics of a Large, Labeled Data Set for the Training of Artificial Intelligence for Glaucoma Screening with Fundus Photographs</article-title>
            <source>Ophthalmology Science</source>
            <volume>3</volume>
            <fpage>100300</fpage>
            <elocation-id>ID</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.xops.2023.100300</pub-id>
            <pub-id pub-id-type="pmid">37113471</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B46">
        <label>46.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Tajerian, A., Keshtkar, M., Almasi-Hashiani, A. and Tajerian, M. (2023) Enhancing Eye Diseases Diagnosis through Transfer Learning: Study of Deep Convolutional Neural Networks for Accurate Classification of Glaucoma and Diabetic Retinopathy from Healthy Eye Using Fundus Images. https://www.researchsquare.com/article/rs-3120228/v1</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Tajerian, A.</string-name>
              <string-name>Keshtkar, M.</string-name>
              <string-name>Almasi-Hashiani, A.</string-name>
              <string-name>Tajerian, M.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Enhancing Eye Diseases Diagnosis through Transfer Learning: Study of Deep Convolutional Neural Networks for Accurate Classification of Glaucoma and Diabetic Retinopathy from Healthy Eye Using Fundus Images</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B47">
        <label>47.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Chen, R., <italic>et</italic><italic>al.</italic> (2024) EyeDiff: Text-To-Image Diffusion Model Improves Rare Eye Disease Diagnosis. arXiv: 2411.10004.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Chen, R.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>EyeDiff: Text-To-Image Diffusion Model Improves Rare Eye Disease Diagnosis</article-title>
            <fpage>2411</fpage>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B48">
        <label>48.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Tohye, T.G., Qin, Z., Al-antari, M.A., Ukwuoma, C.C., Lonseko, Z.M. and Gu, Y.H. (2024) Ca-Vit: Contour-Guided and Augmented Vision Transformers to Enhance Glaucoma Classification Using Fundus Images. <italic>Bioengineering</italic>, 11, Article 887. https://doi.org/10.3390/bioengineering11090887 <pub-id pub-id-type="doi">10.3390/bioengineering11090887</pub-id><pub-id pub-id-type="pmid">39329629</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/bioengineering11090887">https://doi.org/10.3390/bioengineering11090887</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Tohye, T.G.</string-name>
              <string-name>Qin, Z.</string-name>
              <string-name>Al-antari, M.A.</string-name>
              <string-name>Ukwuoma, C.C.</string-name>
              <string-name>Lonseko, Z.M.</string-name>
              <string-name>Gu, Y.H.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Ca-Vit: Contour-Guided and Augmented Vision Transformers to Enhance Glaucoma Classification Using Fundus Images</article-title>
            <source>Bioengineering</source>
            <volume>11</volume>
            <elocation-id>887</elocation-id>
            <pub-id pub-id-type="doi">10.3390/bioengineering11090887</pub-id>
            <pub-id pub-id-type="pmid">39329629</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B49">
        <label>49.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Li, Y., Carrillo-Perez, F., Al Owaifeer, A.M., Alawad, M. and Gevaert, O. (2025) Enhancing Glaucoma Detection through Supervised Pre-Training with Intermediate Phenotypes: A Multi-Institutional Study. medRxiv. https://doi.org/10.1101/2025.04.22.25326210 <pub-id pub-id-type="doi">10.1101/2025.04.22.25326210</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1101/2025.04.22.25326210">https://doi.org/10.1101/2025.04.22.25326210</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Li, Y.</string-name>
              <string-name>Carrillo-Perez, F.</string-name>
              <string-name>Owaifeer, A.M.</string-name>
              <string-name>Alawad, M.</string-name>
              <string-name>Gevaert, O.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Enhancing Glaucoma Detection through Supervised Pre-Training with Intermediate Phenotypes: A Multi-Institutional Study</article-title>
            <pub-id pub-id-type="doi">10.1101/2025.04.22.25326210</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
    </ref-list>
  </back>
</article>