<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN" "JATS-journalpublishing1-4.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.4" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">jis</journal-id>
      <journal-title-group>
        <journal-title>Journal of Information Security</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2153-1242</issn>
      <issn pub-type="ppub">2153-1234</issn>
      <publisher>
        <publisher-name>Scientific Research Publishing</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.4236/jis.2026.172009</article-id>
      <article-id pub-id-type="publisher-id">jis-150945</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
        <subj-group>
          <subject>Computer Science</subject>
          <subject>Communications</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Robustness, Cost, and Attack-Surface Concentration in Phishing Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <contrib-id contrib-id-type="orcid">0000-0003-0275-4239</contrib-id>
          <name name-style="western">
            <surname>Allagan</surname>
            <given-names>Julian</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Elbakary</surname>
            <given-names>Mohamed</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Safari</surname>
            <given-names>Zohreh</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0009-0003-5078-6283</contrib-id>
          <name name-style="western">
            <surname>Gao</surname>
            <given-names>Weizheng</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Morgan</surname>
            <given-names>Gabrielle</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Morgan</surname>
            <given-names>Essence</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Deriglazov</surname>
            <given-names>Vladimir</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="aff1"><label>1</label> Department of Mathematics, Computer Science, and Engineering Technology, Elizabeth City State University, Elizabeth City, NC, USA </aff>
      <author-notes>
        <fn fn-type="conflict" id="fn-conflict">
          <p>The authors declare no conflicts of interest regarding the publication of this paper.</p>
        </fn>
      </author-notes>
      <pub-date pub-type="epub">
        <day>01</day>
        <month>04</month>
        <year>2026</year>
      </pub-date>
      <pub-date pub-type="collection">
        <month>04</month>
        <year>2026</year>
      </pub-date>
      <volume>17</volume>
      <issue>02</issue>
      <fpage>167</fpage>
      <lpage>187</lpage>
      <history>
        <date date-type="received">
          <day>21</day>
          <month>03</month>
          <year>2026</year>
        </date>
        <date date-type="accepted">
          <day>24</day>
          <month>04</month>
          <year>2026</year>
        </date>
        <date date-type="published">
          <day>27</day>
          <month>04</month>
          <year>2026</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2026 by the authors and Scientific Research Publishing Inc.</copyright-statement>
        <copyright-year>2026</copyright-year>
        <license license-type="open-access">
          <license-p> This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link> ). </license-p>
        </license>
      </permissions>
      <self-uri content-type="doi" xlink:href="https://doi.org/10.4236/jis.2026.172009">https://doi.org/10.4236/jis.2026.172009</self-uri>
      <abstract>
        <p>Phishing detectors built on engineered website features attain near-perfect accuracy under i.i.d. evaluation, yet deployment security depends on robustness to post-deployment feature manipulation. We study this gap through a cost-aware evasion framework that models discrete, monotone feature edits under explicit attacker budgets. Three diagnostics are introduced: minimal evasion cost (MEC), the evasion survival rate <inline-formula><mml:math></mml:math></inline-formula></p>
        <p>S(</p>
        <p>B</p>
        <p>)</p>
        <p>, and the robustness concentration index (RCI). On the UCI Phishing Websites benchmark (11,055 instances, 30 ternary features), Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost all achieve <inline-formula><mml:math></mml:math></inline-formula></p>
        <p>AUC≥0.979</p>
        <p>under static evaluation. Under budgeted sanitization-style evasion, robustness converges across architectures: the median MEC equals 2 with full features, and over 80% of successful minimal-cost evasions concentrate on three low-cost surface features. Feature restriction improves robustness only when it removes all dominant low-cost transitions. Under strict cost schedules, infrastructure-leaning feature sets exhibit 17% - 19% infeasible mass for ensemble models, while the median MEC among evadable instances remains unchanged. We formalize this convergence: if a positive fraction of correctly detected phishing instances admit evasion through a single feature transition of minimal cost <inline-formula><mml:math></mml:math></inline-formula></p>
        <p>c</p>
        <p>min</p>
        <p>, no classifier can raise the corresponding MEC quantile above <inline-formula><mml:math></mml:math></inline-formula></p>
        <p>c</p>
        <p>min</p>
        <p>without modifying the feature representation or cost model. Adversarial robustness in phishing detection is governed by feature economics rather than model complexity.</p>
      </abstract>
      <kwd-group kwd-group-type="author-generated" xml:lang="en">
        <kwd>Robustness</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
      <title>1. Introduction</title>
      <p>Phishing detection is inherently adversarial. Attackers adapt observable website characteristics to evade classification, while defenders evaluate models under static train–test splits. A classifier may achieve near-perfect held-out accuracy yet remain operationally fragile when its predictions rest on surface-level attributes alterable at low cost. Recent studies report detection accuracies exceeding 95% using refined feature engineering and ensemble methods [<xref ref-type="bibr" rid="B1">1</xref>]-[<xref ref-type="bibr" rid="B4">4</xref>], but these results assume a passive threat model in which adversarial adaptation is excluded. In deployed systems, this assumption rarely holds [<xref ref-type="bibr" rid="B5">5</xref>]-[<xref ref-type="bibr" rid="B7">7</xref>].</p>
      <p>The tension is acute because manipulation costs are asymmetric. Presentation-layer cues—URL structure, HTML artifacts, certificate presentation—are inexpensive to modify, whereas infrastructure-coupled signals such as domain age, DNS records, and traffic require sustained investment or third-party validation [<xref ref-type="bibr" rid="B8">8</xref>]-[<xref ref-type="bibr" rid="B11">11</xref>]. Many robustness analyses rely on continuous perturbation models that abstract away discrete feature semantics, or adopt worst-case threat models that ignore economically plausible attacker behavior [<xref ref-type="bibr" rid="B12">12</xref>]-[<xref ref-type="bibr" rid="B14">14</xref>].</p>
      <p>Phishing robustness studies fall broadly into three categories. Continuous perturbation approaches apply <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> ℓ </mml:mi><mml:mi> p </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> -bounded adversarial examples to feature vectors, treating each coordinate as a real-valued input amenable to gradient-based attack [<xref ref-type="bibr" rid="B12">12</xref>][<xref ref-type="bibr" rid="B15">15</xref>]. While methodologically convenient, this abstraction obscures the discrete, semantically constrained nature of website feature edits. Heuristic attack studies evaluate classifiers against hand-crafted manipulation strategies without formalizing attacker cost or optimality [<xref ref-type="bibr" rid="B1">1</xref>][<xref ref-type="bibr" rid="B3">3</xref>][<xref ref-type="bibr" rid="B16">16</xref>]. Problem-space constraint work emphasizes that adversarial perturbations must satisfy domain constraints, yet typically does not assign explicit economic costs to individual transitions or characterize the resulting attack-surface structure [<xref ref-type="bibr" rid="B6">6</xref>][<xref ref-type="bibr" rid="B13">13</xref>][<xref ref-type="bibr" rid="B17">17</xref>].</p>
      <p>We adopt a complementary perspective. Evasion is formulated as exact shortest-path search over a cost-weighted discrete transition graph. We introduce concentration diagnostics—RCI and FirstTop1—as primary structural indicators, and establish an architecture-independent cost-floor bound. The combination of exact discrete-cost optimization, attack-surface concentration measurement, and a formal robustness ceiling distinguishes this work from prior analyses.</p>
      <p>To bridge the gap between static evaluation and adversarial deployment, we develop a cost-aware adversarial evaluation framework that assigns explicit costs to discrete feature edits and evaluates classifiers under bounded attacker budgets. We study sanitization-style evasion under monotone edits, where the attacker removes phishing indicators and pushes feature values toward legitimate states. This threat model represents a lower bound on attacker capability: it excludes anti-feature injection, extractor-level attacks, and non-monotone manipulation, all of which can only expand the feasible action set and reduce MEC. The restriction to monotone edits is operationally motivated by empirical evidence that most phishing campaigns are short-lived (24 - 72 hours), favoring indicator removal over infrastructure construction [<xref ref-type="bibr" rid="B10">10</xref>][<xref ref-type="bibr" rid="B11">11</xref>]. Section 7 discusses how relaxing this restriction would affect concentration and the cost floor.</p>
      <p>Rather than measuring aggregate degradation, we address a structural question central to defensive design: under budget-constrained manipulation, do evasion pathways disperse across many features or collapse onto a small attack surface? The answer determines whether architectural complexity redistributes adversarial risk or leaves dominant failure modes intact.</p>
      <p>We operationalize this analysis with three diagnostics. Minimal evasion cost (MEC) is the smallest cumulative cost required to induce misclassification for a correctly detected phishing instance. The evasion survival rate <inline-formula><mml:math><mml:mrow><mml:mi> S </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> B </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mtext> Pr </mml:mtext><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mtext> MEC </mml:mtext><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> &gt; </mml:mo><mml:mi> B </mml:mi></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> measures resistance to the attacker’s budget <inline-formula><mml:math><mml:mi> B </mml:mi></mml:math></inline-formula> . The robustness concentration index (RCI) quantifies whether successful minimal-cost edits are diffuse or concentrated on a small subset of features. Empirically, across models and full feature sets, evasion succeeds under modest budgets (median MEC = 2), and more than 80% of traces concentrate on three low-cost surface features. We formalize this convergence with a structural result: when a nontrivial fraction of instances admit evasion via a single feature transition of minimal cost, no classifier architecture can raise the corresponding robustness quantiles without modifying the feature space or cost model. We term this action-set-limited invariance.</p>
    </sec>
    <sec id="sec2">
      <title>2. Methods</title>
      <p>We model post-deployment evasion as a shortest-path problem on a directed graph whose nodes are discrete feature vectors, whose edges represent admissible monotone manipulations, and whose edge weights encode attacker cost.</p>
      <sec id="sec2dot1">
        <title>2.1. Threat Model</title>
        <p>Let <inline-formula><mml:math><mml:mrow><mml:mi> f </mml:mi><mml:mo> : </mml:mo><mml:mi mathvariant="script"> X </mml:mi><mml:mo> → </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> denote a deployed classifier (−1 for phishing, +1 for legitimate). Given a phishing instance <inline-formula><mml:math><mml:mi> x </mml:mi></mml:math></inline-formula> correctly classified as malicious, the attacker seeks <inline-formula><mml:math><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:math></inline-formula> with <inline-formula><mml:math><mml:mrow><mml:mi> f </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> subject to a finite manipulation budget <inline-formula><mml:math><mml:mi> B </mml:mi></mml:math></inline-formula> . Feature vectors lie in <inline-formula><mml:math><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0 </mml:mn><mml:mo> , </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow><mml:mi> d </mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> , encoding phishing-indicative, neutral, and legitimate states. Edits are monotone: a transition <inline-formula><mml:math><mml:mrow><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is admissible only if <inline-formula><mml:math><mml:mrow><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup><mml:mo> ≥ </mml:mo><mml:mi> v </mml:mi></mml:mrow></mml:math></inline-formula> under the ordering <inline-formula><mml:math><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> &lt; </mml:mo><mml:mn> 0 </mml:mn><mml:mo> &lt; </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> . Reverse transitions incur infinite cost. This models sanitization-style evasion in which attackers remove suspicious indicators rather than inject adversarial anti-features. The attacker possesses feature-level knowledge—awareness that detection relies on discrete website features and coarse understanding of surface versus infrastructure cost asymmetries, consistent with publicly documented detection pipelines [<xref ref-type="bibr" rid="B6">6</xref>][<xref ref-type="bibr" rid="B18">18</xref>]—but has no access to model parameters, training data, confidence scores, or gradients.</p>
        <p>This threat model constitutes a lower bound on attacker capability. Non-monotone edits (injecting benign artifacts), extractor-level manipulation (exploiting parser ambiguities to alter computed features without semantic change), and anti-feature attacks all enlarge the feasible action set. Any such enlargement can only decrease MEC and potentially increase concentration. The cost floor established under monotone edits therefore provides a conservative bound: if robustness is fragile under sanitization-only attackers, it is at least as fragile under more capable adversaries.</p>
        <p>We compute MEC values via uniform-cost search (<bold>Algorithm 1</bold>), yielding exact shortest paths within the prescribed budget. Exact MEC represents an upper bound on evasion efficiency under the defined action set: query-limited attackers may fail to discover optimal evasions, raising empirical survival rates, but the structural cost floor persists whenever low-cost transitions remain available.</p>
        <p><bold>Algorithm 1</bold><bold>.</bold> Minimal evasion cost via uniform-cost search.</p>
        <fig id="fig1">
          <label>Figure 1</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId50.jpeg?20260427102843" />
        </fig>
      </sec>
      <sec id="sec2dot2">
        <title>2.2. Cost Schedules</title>
        <p>Each admissible edit <inline-formula><mml:math><mml:mrow><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> j </mml:mi><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> incurs nonnegative cost <inline-formula><mml:math><mml:mrow><mml:mi> c </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> j </mml:mi><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> . For an instance <inline-formula><mml:math><mml:mi> x </mml:mi></mml:math></inline-formula> transformed to <inline-formula><mml:math><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:math></inline-formula> , the cumulative cost is additive: </p>
        <disp-formula id="FD1">
          <mml:math>
            <mml:mrow>
              <mml:mi>C</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mi>x</mml:mi>
                  <mml:mo>→</mml:mo>
                  <mml:msup>
                    <mml:mi>x</mml:mi>
                    <mml:mo>′</mml:mo>
                  </mml:msup>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>=</mml:mo>
              <mml:munder>
                <mml:mstyle mathsize="140%" displaystyle="true">
                  <mml:mo>∑</mml:mo>
                </mml:mstyle>
                <mml:mrow>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>j</mml:mi>
                      <mml:mo>,</mml:mo>
                      <mml:mi>v</mml:mi>
                      <mml:mo>→</mml:mo>
                      <mml:msup>
                        <mml:mi>v</mml:mi>
                        <mml:mo>′</mml:mo>
                      </mml:msup>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                  <mml:mo>∈</mml:mo>
                  <mml:mi>Δ</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>x</mml:mi>
                      <mml:mo>,</mml:mo>
                      <mml:msup>
                        <mml:mi>x</mml:mi>
                        <mml:mo>′</mml:mo>
                      </mml:msup>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
              </mml:munder>
              <mml:mtext>
                 
              </mml:mtext>
              <mml:mi>c</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mi>j</mml:mi>
                  <mml:mo>,</mml:mo>
                  <mml:mi>v</mml:mi>
                  <mml:mo>→</mml:mo>
                  <mml:msup>
                    <mml:mi>v</mml:mi>
                    <mml:mo>′</mml:mo>
                  </mml:msup>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>.</mml:mo>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>The feasible action set at budget <inline-formula><mml:math><mml:mi> B </mml:mi></mml:math></inline-formula> is <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi mathvariant="script"> A </mml:mi><mml:mi> B </mml:mi></mml:msub><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup><mml:mo> ∈ </mml:mo><mml:mi mathvariant="script"> X </mml:mi><mml:mo> : </mml:mo><mml:mi> C </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> x </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow><mml:mo> ≤ </mml:mo><mml:mi> B </mml:mi></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> .</p>
        <p>Costs represent dimensionless operational friction—the difficulty of effecting a manipulation within a phishing campaign’s operational window—rather than direct monetary expenditure. We calibrate using a time-to-effect principle: one cost unit corresponds to a manipulation executable within a single day by the campaign operator; four units correspond to changes requiring multi-week external accumulation (DNS propagation, organic traffic growth, reputation accrual). This calibration reflects documented campaign lifecycles: Oest <italic>et al.</italic> [<xref ref-type="bibr" rid="B10">10</xref>] report median campaign durations under 21 hours, with 95% retired within 72 hours, establishing surface-level edits as effectively “free” within the operational window and infrastructure changes as largely infeasible. Bijmans <italic>et al.</italic> [<xref ref-type="bibr" rid="B11">11</xref>] corroborate this timeline for phishing-kit deployments.</p>
        <p>What matters for the structural conclusions is the cost ordering—surface features are strictly cheaper than infrastructure features—rather than exact magnitudes. Section 5 demonstrates that proportional cost scaling shifts the median MEC linearly while preserving feature ordering, concentration structure, and architecture invariance.</p>
        <p>We consider two schedules. The base schedule assigns low cost to surface features (URL structure, HTML presentation; <inline-formula><mml:math><mml:mrow><mml:mi> c </mml:mi><mml:mo> = </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 2 </mml:mn></mml:mrow></mml:math></inline-formula> ) and higher cost to infrastructure features (domain age, DNS, traffic, reputation; <inline-formula><mml:math><mml:mrow><mml:mi> c </mml:mi><mml:mo> = </mml:mo><mml:mn> 4 </mml:mn><mml:mo> , </mml:mo><mml:mn> 8 </mml:mn></mml:mrow></mml:math></inline-formula> ). The strict schedule coincides with the base except that infrastructure-feature upgrades to the fully legitimate state are disallowed (<inline-formula><mml:math><mml:mrow><mml:mi> c </mml:mi><mml:mo> = </mml:mo><mml:mi> ∞ </mml:mi></mml:mrow></mml:math></inline-formula> ), modeling horizons in which complete infrastructure legitimization is infeasible. <bold>Table 1</bold> summarizes both schedules. All experiments use <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> B </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub><mml:mo> = </mml:mo><mml:mn> 18 </mml:mn></mml:mrow></mml:math></inline-formula> . Appendix provides a complete mapping of all 30 UCI features to their cost group, along with a one-line time-to-effect rationale for each assignment.</p>
        <p><bold>Table 1</bold><bold>.</bold> Feature manipulation cost schedules with operational time horizons. Costs are calibrated by the time-to-effect principle: 1 unit ≈ changes feasible within a day; 4 units ≈ multi-week external accumulation.</p>
        <table-wrap id="tbl1">
          <label>Table 1</label>
          <table>
            <tbody>
              <tr>
                <td rowspan="2">Feature Group</td>
                <td rowspan="2">Examples</td>
                <td colspan="3">Base Schedule</td>
                <td colspan="3">Strict Schedule</td>
                <td rowspan="2">Time Horizon</td>
              </tr>
              <tr>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mo>−</mml:mo>
                        <mml:mn>1</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>0</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mo>−</mml:mo>
                        <mml:mn>1</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mn>0</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mo>−</mml:mo>
                        <mml:mn>1</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>0</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mo>−</mml:mo>
                        <mml:mn>1</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mrow>
                        <mml:mn>0</mml:mn>
                        <mml:mo>→</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                    </mml:math>
                  </inline-formula>
                </td>
              </tr>
              <tr>
                <td>Surface</td>
                <td>URL_of_Anchor, SFH, Prefix_Suffix, SSLfinal_State</td>
                <td>1</td>
                <td>2</td>
                <td>1</td>
                <td>1</td>
                <td>2</td>
                <td>1</td>
                <td>hours to days</td>
              </tr>
              <tr>
                <td>Semi-domain</td>
                <td>Domain_Reg_Length, Google_Index</td>
                <td>3</td>
                <td>6</td>
                <td>3</td>
                <td>3</td>
                <td>6</td>
                <td>3</td>
                <td>days to weeks</td>
              </tr>
              <tr>
                <td>Infrastructure</td>
                <td>web_traffic, DNSRecord, age_of_domain, Page_Rank</td>
                <td>4</td>
                <td>8</td>
                <td>4</td>
                <td>4</td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mi>∞</mml:mi>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>
                  <inline-formula>
                    <mml:math>
                      <mml:mi>∞</mml:mi>
                    </mml:math>
                  </inline-formula>
                </td>
                <td>weeks to months</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>SSLfinal_State is classified as surface-level because certificate presentation can be modified through front-end configuration (e.g., deploying a free DV certificate via Let’s Encrypt), without sustained infrastructure investment. Under the time-to-effect principle, this operation falls within the single-day horizon. Reclassifying SSLfinal_State as semi-domain is examined in the sensitivity analysis.</p>
      </sec>
      <sec id="sec2dot3">
        <title>2.3. Dataset, Models and Conditioning</title>
        <p>We use the UCI Phishing Websites benchmark [<xref ref-type="bibr" rid="B19">19</xref>]: 11,055 instances described by 30 ternary features in <inline-formula><mml:math><mml:mrow><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0 </mml:mn><mml:mo> , </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> , with 4898 phishing and 6157 legitimate websites. A stratified 75/25 train–test split (seed 1337) yields 2764 test instances including 1225 phishing samples. Four classifier families are evaluated: Logistic Regression (<inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> ℓ </mml:mi><mml:mn> 2 </mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> regularization, <inline-formula><mml:math><mml:mrow><mml:mi> C </mml:mi><mml:mo> = </mml:mo><mml:mn> 1.0 </mml:mn></mml:mrow></mml:math></inline-formula> ), Random Forests (100 trees, max depth 10), Gradient Boosted Decision Trees (100 estimators, learning rate 0.1, max depth 6), and XGBoost with matched hyperparameters. Classification uses a fixed threshold of 0.5 on each model’s native predict_proba output, held constant across all models and configurations; threshold sensitivity is examined in Section 5. Implementations use scikit-learn 1.0.2 and xgboost 1.5.0.</p>
        <p>Robustness is evaluated on the conditioning set </p>
        <fig id="fig2">
          <label>Figure 2</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId95.svg?20260427102843" />
        </fig>
        <p>, the phishing instances correctly classified as malicious. Conditioning isolates post-detection evasion and separates robustness from baseline classification error. The per-model sizes of </p>
        <fig id="fig3">
          <label>Figure 3</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId97.svg?20260427102843" />
        </fig>
        <p> are: Logistic Regression 1103, Random Forest 1118, GBDT 1142, and XGBoost 1168 (out of 1225 phishing test instances). The cross-model intersection—instances correctly detected by all four models—contains 1047 instances. For cross-model comparisons, we intersect </p>
        <fig id="fig4">
          <label>Figure 4</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId99.svg?20260427102843" />
        </fig>
        <p> across all four models and uniformly sample <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> n </mml:mi><mml:mrow><mml:mtext> eval </mml:mtext></mml:mrow></mml:msub><mml:mo> = </mml:mo><mml:mn> 300 </mml:mn></mml:mrow></mml:math></inline-formula> instances. This sample size was chosen so that 95% bootstrap confidence intervals on median MEC achieve width at most 1 and those on RCI<sub>3</sub> achieve width at most 0.06 under the observed distributions; it represents approximately 29% of the intersection, well above the threshold for stable quantile estimation at the precision reported. To verify stability, <bold>Table 2</bold> reports mean and standard deviation of median MEC and RCI<sub>3</sub> across 10 independent random draws of 300 instances from the intersection. <bold>Table 3</bold> verifies that this intersection does not bias conclusions by comparing metrics on each model’s full </p>
        <fig id="fig5">
          <label>Figure 5</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId103.svg?20260427102843" />
        </fig>
        <p> with the intersection sample.</p>
        <p><bold>Table 2</bold><bold>.</bold> Stability of robustness metrics across 10 independent random 300-instance draws from the cross-model intersection (Full/base). Standard deviations confirm that conclusions are not sensitive to the specific subsample chosen. </p>
        <table-wrap id="tbl2">
          <label>Table 2</label>
          <table>
            <tbody>
              <tr>
                <td>Model</td>
                <td>Mean Med. MEC</td>
                <td>Std</td>
                <td>
                  Mean RCI
                  <sub>3</sub>
                </td>
                <td>Std</td>
              </tr>
              <tr>
                <td>Logistic Regression</td>
                <td>2</td>
                <td>0.00</td>
                <td>0.961</td>
                <td>0.011</td>
              </tr>
              <tr>
                <td>Random Forest</td>
                <td>2</td>
                <td>0.00</td>
                <td>0.843</td>
                <td>0.018</td>
              </tr>
              <tr>
                <td>GBDT</td>
                <td>2</td>
                <td>0.00</td>
                <td>0.891</td>
                <td>0.015</td>
              </tr>
              <tr>
                <td>XGBoost</td>
                <td>2</td>
                <td>0.00</td>
                <td>0.819</td>
                <td>0.017</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p><bold>Table 3</bold><bold>.</bold> Robustness metrics on full </p>
        <fig id="fig6">
          <label>Figure 6</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId105.svg?20260427102843" />
        </fig>
        <p> versus the 300-instance cross-model intersection (Full/base). </p>
        <table-wrap id="tbl3">
          <label>Table 3</label>
          <table>
            <tbody>
              <tr>
                <td rowspan="2">Model</td>
                <td colspan="2">Median MEC</td>
                <td colspan="2">
                  RCI
                  <sub>3</sub>
                </td>
              </tr>
              <tr>
                <td>
                  Full
                  <inline-graphic xlink:href="https://html.scirp.org/file/7801221-rId107.svg?20260427102843">
                  </inline-graphic>
                </td>
                <td>Intersection</td>
                <td>
                  Full
                  <inline-graphic xlink:href="https://html.scirp.org/file/7801221-rId109.svg?20260427102843">
                  </inline-graphic>
                </td>
                <td>Intersection</td>
              </tr>
              <tr>
                <td>Logistic Regression</td>
                <td>2</td>
                <td>2</td>
                <td>0.96</td>
                <td>0.96</td>
              </tr>
              <tr>
                <td>Random Forest</td>
                <td>2</td>
                <td>2</td>
                <td>0.85</td>
                <td>0.84</td>
              </tr>
              <tr>
                <td>GBDT</td>
                <td>2</td>
                <td>2</td>
                <td>0.89</td>
                <td>0.89</td>
              </tr>
              <tr>
                <td>XGBoost</td>
                <td>2</td>
                <td>2</td>
                <td>0.82</td>
                <td>0.82</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>We evaluate six feature configurations. The full set contains all 30 features. AAS (Accuracy-Anchored Subset)-12a (<inline-formula><mml:math><mml:mrow><mml:mi> d </mml:mi><mml:mo> = </mml:mo><mml:mn> 12 </mml:mn></mml:mrow></mml:math></inline-formula> ) and AAS-11b (<inline-formula><mml:math><mml:mrow><mml:mi> d </mml:mi><mml:mo> = </mml:mo><mml:mn> 11 </mml:mn></mml:mrow></mml:math></inline-formula> ) retain features with the highest mutual-information scores with the target label, selected by a greedy forward-selection procedure on the training set. RA (Robustness-Anchored)-8 (<inline-formula><mml:math><mml:mrow><mml:mi> d </mml:mi><mml:mo> = </mml:mo><mml:mn> 8 </mml:mn></mml:mrow></mml:math></inline-formula> ) emphasizes infrastructure-leaning signals but retains SSLfinal_State to represent the dominant surface bottleneck. VA (Vulnerability-Anchored)-8a (<inline-formula><mml:math><mml:mrow><mml:mi> d </mml:mi><mml:mo> = </mml:mo><mml:mn> 8 </mml:mn></mml:mrow></mml:math></inline-formula> ) and VA-7b (<inline-formula><mml:math><mml:mrow><mml:mi> d </mml:mi><mml:mo> = </mml:mo><mml:mn> 7 </mml:mn></mml:mrow></mml:math></inline-formula> ) contain only presentation-layer features. <bold>Table 4</bold> lists the exact features included in each subset. In all cases <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 2 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> , so by Proposition 3.1, median MEC cannot exceed these values unless all transitions at that cost are removed.</p>
        <p><bold>Table 4</bold><bold>.</bold> Feature subset definitions. AAS = Accuracy-Anchored Subset; RA = Robustness-Anchored; VA = Vulnerability-Anchored. </p>
        <table-wrap id="tbl4">
          <label>Table 4</label>
          <table>
            <tbody>
              <tr>
                <td>Subset</td>
                <td>
                  <italic>d</italic>
                </td>
                <td>Features included</td>
              </tr>
              <tr>
                <td rowspan="3">AAS-12a</td>
                <td rowspan="3">12</td>
                <td>SSLfinal_State, URL_of_Anchor, having_Sub_Domain, age_of_domain,</td>
              </tr>
              <tr>
                <td>web_traffic, DNSRecord, Page_Rank, Domain_registeration_length,</td>
              </tr>
              <tr>
                <td>Google_Index, Request_URL, Links_in_tags, Prefix_Suffix</td>
              </tr>
              <tr>
                <td>AAS-11b</td>
                <td>11</td>
                <td>Same as AAS-12a, dropping Links_in_tags</td>
              </tr>
              <tr>
                <td rowspan="2">RA-8</td>
                <td rowspan="2">8</td>
                <td>age_of_domain, DNSRecord, web_traffic, Page_Rank,</td>
              </tr>
              <tr>
                <td>Google_Index, Domain_registeration_length, having_Sub_Domain, SSLfinal_State</td>
              </tr>
              <tr>
                <td rowspan="2">VA-8a</td>
                <td rowspan="2">8</td>
                <td>URL_of_Anchor, SFH, Prefix_Suffix, having_At_Sign,</td>
              </tr>
              <tr>
                <td>HTTPS_token, Request_URL, Iframe, on_mouseover</td>
              </tr>
              <tr>
                <td rowspan="2">VA-7b</td>
                <td rowspan="2">7</td>
                <td>URL_of_Anchor, SFH, Prefix_Suffix, having_At_Sign,</td>
              </tr>
              <tr>
                <td>HTTPS_token, Request_URL, Iframe</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
      <sec id="sec2dot4">
        <title>2.4. Robustness Metrics</title>
        <p>For each </p>
        <fig id="fig7">
          <label>Figure 7</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId123.svg?20260427102844" />
        </fig>
        <p>, the minimal evasion cost is </p>
        <disp-formula id="FD2">
          <mml:math>
            <mml:mrow>
              <mml:mtext>MEC</mml:mtext>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mi>x</mml:mi>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>=</mml:mo>
              <mml:mtext>inf</mml:mtext>
              <mml:mrow>
                <mml:mo>{</mml:mo>
                <mml:mrow>
                  <mml:mi>C</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>x</mml:mi>
                      <mml:mo>→</mml:mo>
                      <mml:msup>
                        <mml:mi>x</mml:mi>
                        <mml:mo>′</mml:mo>
                      </mml:msup>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                  <mml:mo>:</mml:mo>
                  <mml:mi>f</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:msup>
                      <mml:mi>x</mml:mi>
                      <mml:mo>′</mml:mo>
                    </mml:msup>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                  <mml:mo>=</mml:mo>
                  <mml:mo>+</mml:mo>
                  <mml:mn>1</mml:mn>
                </mml:mrow>
                <mml:mo>}</mml:mo>
              </mml:mrow>
              <mml:mo>,</mml:mo>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>computed exactly via uniform-cost search (<bold>Algorithm 1</bold>). The search is complete up to <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> B </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and returns <inline-formula><mml:math><mml:mrow><mml:mtext> MEC </mml:mtext><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mi> ∞ </mml:mi></mml:mrow></mml:math></inline-formula> when no evasion exists within budget. Median runtime is 0.3 s per instance for full feature sets.</p>
        <p>Resistance at attacker budget <inline-formula><mml:math><mml:mi> B </mml:mi></mml:math></inline-formula> is summarized by the evasion survival rate </p>
        <fig id="fig8">
          <label>Figure 8</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId133.svg?20260427102844" />
        </fig>
        <p>. Aggregate robustness is captured by the feature robustness index </p>
        <disp-formula id="FD3">
          <mml:math>
            <mml:mrow>
              <mml:mtext>FRI</mml:mtext>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mn>1</mml:mn>
                <mml:mrow>
                  <mml:msub>
                    <mml:mi>B</mml:mi>
                    <mml:mrow>
                      <mml:mtext>max</mml:mtext>
                    </mml:mrow>
                  </mml:msub>
                </mml:mrow>
              </mml:mfrac>
              <mml:mstyle displaystyle="true">
                <mml:mrow>
                  <mml:msubsup>
                    <mml:mo>∫</mml:mo>
                    <mml:mn>0</mml:mn>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>B</mml:mi>
                        <mml:mrow>
                          <mml:mtext>max</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                  </mml:msubsup>
                  <mml:mrow>
                    <mml:mi>S</mml:mi>
                    <mml:mrow>
                      <mml:mo>(</mml:mo>
                      <mml:mi>B</mml:mi>
                      <mml:mo>)</mml:mo>
                    </mml:mrow>
                    <mml:mtext>d</mml:mtext>
                    <mml:mi>B</mml:mi>
                  </mml:mrow>
                </mml:mrow>
              </mml:mstyle>
              <mml:mo>,</mml:mo>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>approximated by a left Riemann sum over integer budgets. FRI measures the expected fraction of the budget range over which a randomly selected instance from </p>
        <fig id="fig9">
          <label>Figure 9</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId137.svg?20260427102844" />
        </fig>
        <p> resists evasion; equivalently, it is the normalized area under the survival curve up to <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> B </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> . FRI incorporates infeasible mass (instances with <inline-formula><mml:math><mml:mrow><mml:mtext> MEC </mml:mtext><mml:mo> = </mml:mo><mml:mi> ∞ </mml:mi></mml:mrow></mml:math></inline-formula> contribute <inline-formula><mml:math><mml:mrow><mml:mi> S </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> B </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> for all <inline-formula><mml:math><mml:mi> B </mml:mi></mml:math></inline-formula> ), while median and quartile MEC are computed over finite values only. This separation distinguishes overall resistance from the cost distribution among evadable instances.</p>
        <p>To examine attack-surface structure, let <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> N </mml:mi><mml:mi> j </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denote the total number of edits applied to feature <inline-formula><mml:math><mml:mi> j </mml:mi></mml:math></inline-formula> across all successful minimal-cost traces. The robustness concentration index is </p>
        <disp-formula id="FD4">
          <mml:math>
            <mml:mrow>
              <mml:msub>
                <mml:mrow>
                  <mml:mtext>RCI</mml:mtext>
                </mml:mrow>
                <mml:mi>k</mml:mi>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mstyle displaystyle="true">
                    <mml:msub>
                      <mml:mo>∑</mml:mo>
                      <mml:mrow>
                        <mml:mi>j</mml:mi>
                        <mml:mo>∈</mml:mo>
                        <mml:mtext>Top-</mml:mtext>
                        <mml:mi>k</mml:mi>
                      </mml:mrow>
                    </mml:msub>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>N</mml:mi>
                        <mml:mi>j</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                  </mml:mstyle>
                </mml:mrow>
                <mml:mrow>
                  <mml:mstyle displaystyle="true">
                    <mml:msubsup>
                      <mml:mo>∑</mml:mo>
                      <mml:mrow>
                        <mml:mi>j</mml:mi>
                        <mml:mo>=</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                      <mml:mi>d</mml:mi>
                    </mml:msubsup>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>N</mml:mi>
                        <mml:mi>j</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                  </mml:mstyle>
                </mml:mrow>
              </mml:mfrac>
              <mml:mo>,</mml:mo>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>measuring the fraction of adversarial effort concentrated on the <inline-formula><mml:math><mml:mi> k </mml:mi></mml:math></inline-formula> most frequently edited features. When multiple optimal paths share identical cost, deterministic priority-queue tie-breaking selects a canonical trace. This affects the representative path used for concentration metrics but not MEC itself; recomputing RCI<sub>3</sub> under 10 randomized tie-break orders yields standard deviation below 0.02 in all configurations.</p>
        <p>To isolate first-step bottlenecks, let <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> j </mml:mi><mml:mn> 1 </mml:mn></mml:msub><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> denote the first-edited feature in the canonical minimal-cost trace. Define <inline-formula><mml:math><mml:mrow><mml:msup><mml:mi> j </mml:mi><mml:mo> ⋆ </mml:mo></mml:msup><mml:mo> = </mml:mo><mml:msub><mml:mrow><mml:mtext> argmax </mml:mtext></mml:mrow><mml:mi> j </mml:mi></mml:msub><mml:mrow><mml:mo> | </mml:mo><mml:mrow><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mi> x </mml:mi><mml:mo> : </mml:mo><mml:msub><mml:mi> j </mml:mi><mml:mn> 1 </mml:mn></mml:msub><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mi> j </mml:mi></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow><mml:mo> | </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> . The FirstTop1 index is </p>
        <disp-formula id="FD5">
          <mml:math>
            <mml:mrow>
              <mml:mtext>FirstTop</mml:mtext>
              <mml:mn>1</mml:mn>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mrow>
                    <mml:mo>|</mml:mo>
                    <mml:mrow>
                      <mml:mrow>
                        <mml:mo>{</mml:mo>
                        <mml:mrow>
                          <mml:mi>x</mml:mi>
                          <mml:mo>:</mml:mo>
                          <mml:msub>
                            <mml:mi>j</mml:mi>
                            <mml:mn>1</mml:mn>
                          </mml:msub>
                          <mml:mrow>
                            <mml:mo>(</mml:mo>
                            <mml:mi>x</mml:mi>
                            <mml:mo>)</mml:mo>
                          </mml:mrow>
                          <mml:mo>=</mml:mo>
                          <mml:msup>
                            <mml:mi>j</mml:mi>
                            <mml:mo>⋆</mml:mo>
                          </mml:msup>
                        </mml:mrow>
                        <mml:mo>}</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                    <mml:mo>|</mml:mo>
                  </mml:mrow>
                </mml:mrow>
                <mml:mrow>
                  <mml:mrow>
                    <mml:mo>|</mml:mo>
                    <mml:mrow>
                      <mml:mrow>
                        <mml:mo>{</mml:mo>
                        <mml:mrow>
                          <mml:mi>x</mml:mi>
                          <mml:mo>:</mml:mo>
                          <mml:mtext>MEC</mml:mtext>
                          <mml:mrow>
                            <mml:mo>(</mml:mo>
                            <mml:mi>x</mml:mi>
                            <mml:mo>)</mml:mo>
                          </mml:mrow>
                          <mml:mo>&lt;</mml:mo>
                          <mml:mi>∞</mml:mi>
                        </mml:mrow>
                        <mml:mo>}</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                    <mml:mo>|</mml:mo>
                  </mml:mrow>
                </mml:mrow>
              </mml:mfrac>
              <mml:mo>,</mml:mo>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>capturing single-feature bottlenecks at the initial decision step.</p>
        <p><xref ref-type="fig" rid="fig1">Figure 1</xref> summarizes the evaluation pipeline and robustness diagnostics.</p>
        <fig id="fig10">
          <label>Figure 10</label>
          <graphic xlink:href="https://html.scirp.org/file/7801221-rId161.jpeg?20260427102843" />
        </fig>
        <p><bold>Figure 1.</bold> Cost-aware adversarial robustness framework with MEC, survival curves, and attack-surface concentration.</p>
      </sec>
      <sec id="sec2dot5">
        <title>2.5. Query-Limited Greedy Attacker</title>
        <p>To assess whether exact MEC meaningfully bounds realistic attacker efficiency, we also evaluate a query-limited greedy adversary. At each step, the attacker enumerates all admissible single-feature monotone edits from the current configuration <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mrow><mml:mtext> curr </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and selects the edit <inline-formula><mml:math><mml:mrow><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:msup><mml:mi> j </mml:mi><mml:mo> ⋆ </mml:mo></mml:msup><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> that produces the largest increase in the classifier’s output score toward the legitimate class, <inline-formula><mml:math><mml:mrow><mml:mover accent="true"><mml:mi> p </mml:mi><mml:mo> ^ </mml:mo></mml:mover><mml:mrow><mml:mo> ( </mml:mo><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup><mml:mo> ) </mml:mo></mml:mrow><mml:mo> − </mml:mo><mml:mover accent="true"><mml:mi> p </mml:mi><mml:mo> ^ </mml:mo></mml:mover><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mrow><mml:mtext> curr </mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>, breaking ties in favor of the lowest-cost edit. Each call to the classifier’s score function counts as one query; evaluating all <inline-formula><mml:math><mml:mrow><mml:mi> O </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> d </mml:mi><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> candidate edits at a given step therefore consumes <inline-formula><mml:math><mml:mrow><mml:mi> O </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> d </mml:mi><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> queries. The attacker halts when <inline-formula><mml:math><mml:mrow><mml:mi> f </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:msup><mml:mi> x </mml:mi><mml:mo> ′ </mml:mo></mml:msup><mml:mo> ) </mml:mo></mml:mrow><mml:mo> = </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> (evasion succeeds), when the cumulative manipulation cost exceeds <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> B </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (budget exhausted), or when the total query count reaches <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> Q </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 50 </mml:mn><mml:mo> , </mml:mo><mml:mn> 100 </mml:mn><mml:mo> , </mml:mo><mml:mn> 500 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> , whichever occurs first. Because the greedy attacker follows a myopic per-step criterion and never revisits previously explored configurations, it may terminate without discovering a feasible evasion even when one exists, yielding empirically higher survival rates than exact MEC. <bold>Table 9</bold> reports these comparisons.</p>
      </sec>
    </sec>
    <sec id="sec3">
      <title>3. Feature Economics and Robustness Limits</title>
      <p>We now establish a structural limit imposed by feature-level manipulation costs. The result identifies a cost floor that bounds achievable robustness independently of model architecture.</p>
      <p><bold>Proposition</bold><bold>3.1</bold><bold>(</bold><bold>Cost</bold><bold>floor</bold><bold>)</bold><bold>.</bold><italic>Let</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mi> min </mml:mi></mml:mrow></mml:msub><mml:mo> = </mml:mo><mml:msub><mml:mrow><mml:mi> min </mml:mi></mml:mrow><mml:mrow><mml:mi> j </mml:mi><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> , </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow></mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> j </mml:mi><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula><italic>be</italic><italic>the</italic><italic>minimum</italic><italic>cost</italic><italic>among</italic><italic>all</italic><italic>admissible</italic><italic>single-feature</italic><italic>transitions.</italic><italic>Fix</italic><italic>a</italic><italic>classifier</italic><inline-formula><mml:math><mml:mi> f </mml:mi></mml:math></inline-formula><italic>and</italic><italic>let</italic></p>
      <fig id="fig11">
        <label>Figure 11</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId182.svg?20260427102845" />
      </fig>
      <p><italic>denote</italic><italic>the</italic><italic>set</italic><italic>of</italic><italic>phishing</italic><italic>instances</italic><italic>correctly</italic><italic>detected</italic><italic>by</italic><inline-formula><mml:math><mml:mi> f </mml:mi></mml:math></inline-formula><italic>.</italic><italic>If</italic><italic>a</italic><italic>fraction</italic><inline-formula><mml:math><mml:mrow><mml:mi> α </mml:mi><mml:mo> &gt; </mml:mo><mml:mn> 0 </mml:mn></mml:mrow></mml:math></inline-formula><italic>of</italic><italic>instances</italic><italic>in</italic></p>
      <fig id="fig12">
        <label>Figure 12</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId188.svg?20260427102845" />
      </fig>
      <p><italic>admit</italic><italic>evasion</italic><italic>via</italic><italic>a</italic><italic>single</italic><italic>transition</italic><italic>of</italic><italic>cost</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mi> m </mml:mi><mml:mi> i </mml:mi><mml:mi> n </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> , <italic>then</italic></p>
      <fig id="fig13">
        <label>Figure 13</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId192.svg?20260427102845" />
      </fig>
      <p><italic>In</italic><italic>particular</italic>, <italic>if</italic><inline-formula><mml:math><mml:mrow><mml:mi> α </mml:mi><mml:mo> ≥ </mml:mo><mml:mfrac><mml:mn> 1 </mml:mn><mml:mn> 2 </mml:mn></mml:mfrac></mml:mrow></mml:math></inline-formula> , <italic>then</italic><inline-formula><mml:math><mml:mrow><mml:mtext> median </mml:mtext><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mtext> MEC </mml:mtext></mml:mrow><mml:mo> ) </mml:mo></mml:mrow><mml:mo> ≤ </mml:mo><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula><italic>.</italic><italic>Hence</italic><italic>the</italic><inline-formula><mml:math><mml:mi> α </mml:mi></mml:math></inline-formula><italic>-quantile</italic><italic>of</italic><italic>the</italic><italic>MEC</italic><italic>distribution</italic><italic>cannot</italic><italic>exceed</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula><italic>without</italic><italic>modifying</italic><italic>the</italic><italic>feature</italic><italic>space</italic><italic>or</italic><italic>cost</italic><italic>schedule.</italic></p>
      <p><italic>Proof.</italic> For each </p>
      <fig id="fig14">
        <label>Figure 14</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId202.svg?20260427102845" />
      </fig>
      <p> admitting a single-feature evasion <inline-formula><mml:math><mml:mrow><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> j </mml:mi><mml:mo> , </mml:mo><mml:mi> v </mml:mi><mml:mo> → </mml:mo><mml:msup><mml:mi> v </mml:mi><mml:mo> ′ </mml:mo></mml:msup></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> of cost <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> , one has <inline-formula><mml:math><mml:mrow><mml:mtext> MEC </mml:mtext><mml:mrow><mml:mo> ( </mml:mo><mml:mi> x </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> ≤ </mml:mo><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> by definition of the infimum. Since these instances constitute at least an <inline-formula><mml:math><mml:mi> α </mml:mi></mml:math></inline-formula> fraction of </p>
      <fig id="fig15">
        <label>Figure 15</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId212.svg?20260427102845" />
      </fig>
      <p>, the distributional bound follows directly. The median statement is an immediate consequence of the definition: when <inline-formula><mml:math><mml:mrow><mml:mi> α </mml:mi><mml:mo> ≥ </mml:mo><mml:mfrac><mml:mn> 1 </mml:mn><mml:mn> 2 </mml:mn></mml:mfrac></mml:mrow></mml:math></inline-formula> , at least half the probability mass lies at or below <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> . </p>
      <fig id="fig16">
        <label>Figure 16</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId218.jpeg?20260427102844" />
      </fig>
      <p>The force of Proposition 3.1 is not in the proof technique—which is elementary—but in the structural invariance it implies. Regardless of how a classifier partitions feature space, any instance that lies within a single cheap transition of a legitimate-classified region is evadable at cost <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> . Whether that fraction <inline-formula><mml:math><mml:mi> α </mml:mi></mml:math></inline-formula> is large depends on the interaction between the cost landscape and the classifier’s decision boundary, and the empirical contribution of this work is to show that <inline-formula><mml:math><mml:mi> α </mml:mi></mml:math></inline-formula> is indeed large across all tested architectures.</p>
      <p><bold>Corollary</bold><bold>3.1</bold><bold>(</bold><bold>Action-set-limited</bold><bold>invariance</bold><bold>)</bold><bold>.</bold><italic>Fix</italic><inline-formula><mml:math><mml:mrow><mml:mi mathvariant="script"> X </mml:mi><mml:mo> = </mml:mo><mml:msup><mml:mrow><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0 </mml:mn><mml:mo> , </mml:mo><mml:mo> + </mml:mo><mml:mn> 1 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow><mml:mi> d </mml:mi></mml:msup></mml:mrow></mml:math></inline-formula><italic>and</italic><italic>a</italic><italic>monotone</italic><italic>cost</italic><italic>function</italic><inline-formula><mml:math><mml:mi> c </mml:mi></mml:math></inline-formula><italic>with</italic><italic>minimum</italic><italic>transition</italic><italic>cost</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mi> min </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula><italic>.</italic><italic>Let</italic><inline-formula><mml:math><mml:mrow><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:msub><mml:mi> f </mml:mi><mml:mn> 1 </mml:mn></mml:msub><mml:mo> , </mml:mo><mml:mo> ⋯ </mml:mo><mml:mo> , </mml:mo><mml:msub><mml:mi> f </mml:mi><mml:mi> K </mml:mi></mml:msub></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula><italic>be</italic><italic>classifiers</italic><italic>evaluated</italic><italic>on</italic><italic>a</italic><italic>common</italic><italic>conditioning</italic><italic>set</italic></p>
      <fig id="fig17">
        <label>Figure 17</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId233.svg?20260427102845" />
      </fig>
      <p>, <italic>and</italic><italic>suppose</italic><italic>that</italic><italic>for</italic><italic>each</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> f </mml:mi><mml:mi> k </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula><italic>at</italic><italic>least</italic><italic>an</italic><inline-formula><mml:math><mml:mrow><mml:mi> α </mml:mi><mml:mo> &gt; </mml:mo><mml:mn> 0 </mml:mn></mml:mrow></mml:math></inline-formula><italic>fraction</italic><italic>of</italic></p>
      <fig id="fig18">
        <label>Figure 18</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId239.svg?20260427102845" />
      </fig>
      <p><italic>admits</italic><italic>single-transition</italic><italic>evasion</italic><italic>at</italic><italic>cost</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mi> min </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula><italic>.</italic><italic>Then</italic><italic>for</italic><italic>every</italic><inline-formula><mml:math><mml:mi> k </mml:mi></mml:math></inline-formula> : </p>
      <fig id="fig19">
        <label>Figure 19</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId244.svg?20260427102845" />
      </fig>
      <p><italic>Architectural</italic><italic>variation</italic><italic>alone</italic><italic>cannot</italic><italic>exceed</italic><italic>this</italic><italic>bound.</italic><italic>Invariance</italic><italic>breaks</italic><italic>when</italic><italic>the</italic><italic>feature</italic><italic>representation</italic><italic>changes</italic> (<italic>removing</italic><italic>or</italic><italic>hardening</italic><italic>features</italic>), <italic>when</italic><italic>the</italic><italic>cost</italic><italic>schedule</italic><italic>is</italic><italic>modified</italic> (<italic>raising</italic><inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> ), <italic>or</italic><italic>when</italic><italic>the</italic><italic>feature</italic><italic>extractor</italic><italic>is</italic><italic>made</italic><italic>robust</italic><italic>to</italic><italic>manipulation</italic> (<italic>reducing</italic><italic>the</italic><italic>attacker’s</italic><italic>effective</italic><italic>action</italic><italic>set</italic>)<italic>.</italic></p>
      <p><italic>Proof.</italic> Under a common action set and shared conditioning set, Proposition 3.1 applies identically to each <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> f </mml:mi><mml:mi> k </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> . </p>
      <fig id="fig20">
        <label>Figure 20</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId218.jpeg?20260427102844" />
      </fig>
      <p><bold>Table 5</bold> reports the empirical fraction <inline-formula><mml:math><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi> α </mml:mi><mml:mo> ^ </mml:mo></mml:mover><mml:mrow><mml:msub><mml:mi> c </mml:mi><mml:mrow><mml:mtext> min </mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> per classifier and feature configuration, confirming that the cost floor binds in practice and that the invariance argument is supported individually for each architecture rather than only in aggregate.</p>
      <p><bold>Table 5</bold><bold>.</bold> Empirical mass at the cost floor </p>
      <fig id="fig21">
        <label>Figure 21</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId252.svg?20260427102845" />
      </fig>
      <p>, reported per classifier and feature configuration. The architecture-invariance argument of Corollary 3.1 is supported individually across all models.</p>
      <table-wrap id="tbl5">
        <label>Table 5</label>
        <table>
          <tbody>
            <tr>
              <td>Feature set</td>
              <td>Schedule</td>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>c</mml:mi>
                        <mml:mrow>
                          <mml:mtext>min</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mover accent="true">
                      <mml:mi>α</mml:mi>
                      <mml:mo>^</mml:mo>
                    </mml:mover>
                  </mml:math>
                </inline-formula>
                (LR)
              </td>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mover accent="true">
                      <mml:mi>α</mml:mi>
                      <mml:mo>^</mml:mo>
                    </mml:mover>
                  </mml:math>
                </inline-formula>
                (RF)
              </td>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mover accent="true">
                      <mml:mi>α</mml:mi>
                      <mml:mo>^</mml:mo>
                    </mml:mover>
                  </mml:math>
                </inline-formula>
                (GBDT)
              </td>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mover accent="true">
                      <mml:mi>α</mml:mi>
                      <mml:mo>^</mml:mo>
                    </mml:mover>
                  </mml:math>
                </inline-formula>
                (XGB)
              </td>
              <td>Med.MEC</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>base</td>
              <td>1</td>
              <td>0.31</td>
              <td>0.29</td>
              <td>0.30</td>
              <td>0.28</td>
              <td>2</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>strict</td>
              <td>1</td>
              <td>0.33</td>
              <td>0.31</td>
              <td>0.32</td>
              <td>0.30</td>
              <td>2</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>base</td>
              <td>1</td>
              <td>0.28</td>
              <td>0.27</td>
              <td>0.28</td>
              <td>0.27</td>
              <td>2</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>strict</td>
              <td>1</td>
              <td>0.15</td>
              <td>0.06</td>
              <td>0.07</td>
              <td>0.05</td>
              <td>2</td>
            </tr>
            <tr>
              <td>VA-7b</td>
              <td>base</td>
              <td>1</td>
              <td>0.62</td>
              <td>0.61</td>
              <td>0.60</td>
              <td>0.60</td>
              <td>1</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
    </sec>
    <sec id="sec4">
      <title>4. Results</title>
      <p><bold>Table 6</bold> reports held-out classification performance on the full feature set. All models achieve strong discrimination (AUC between 0.979 and 0.995), suggesting reliable deployment under static evaluation. The adversarial analysis below demonstrates that this conclusion does not survive once feature manipulation is permitted.</p>
      <p><bold>Table 6</bold><bold>.</bold> Held-out classification performance (Full feature set, threshold = 0.5). </p>
      <table-wrap id="tbl6">
        <label>Table 6</label>
        <table>
          <tbody>
            <tr>
              <td>Model</td>
              <td>Accuracy</td>
              <td>AUC</td>
              <td>Phishing TPR</td>
            </tr>
            <tr>
              <td>Logistic Regression</td>
              <td>0.927</td>
              <td>0.979</td>
              <td>0.900</td>
            </tr>
            <tr>
              <td>Random Forest</td>
              <td>0.950</td>
              <td>0.993</td>
              <td>0.913</td>
            </tr>
            <tr>
              <td>GBDT</td>
              <td>0.953</td>
              <td>0.990</td>
              <td>0.932</td>
            </tr>
            <tr>
              <td>XGBoost</td>
              <td>0.965</td>
              <td>0.995</td>
              <td>0.953</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p><bold>Table 7</bold> presents the central robustness results. Two regularities dominate across all configurations.</p>
      <p><bold>Table 7</bold><bold>.</bold> Robustness across feature sets and schedules. NoEvasion reports infeasible mass within <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> B </mml:mi><mml:mrow><mml:mtext> max </mml:mtext></mml:mrow></mml:msub><mml:mo> = </mml:mo><mml:mn> 18 </mml:mn></mml:mrow></mml:math></inline-formula> .</p>
      <table-wrap id="tbl7">
        <label>Table 7</label>
        <table>
          <tbody>
            <tr>
              <td>Features</td>
              <td>Sched.</td>
              <td>Model</td>
              <td>Acc</td>
              <td>FRI</td>
              <td>MEC</td>
              <td>[Q1, Q3]</td>
              <td>
                RCI
                <sub>3</sub>
              </td>
              <td>FT1</td>
              <td>NoEv</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>base</td>
              <td>Logit</td>
              <td>0.927</td>
              <td>0.076</td>
              <td>2</td>
              <td>[1, 2]</td>
              <td>0.961</td>
              <td>0.850</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>base</td>
              <td>RF</td>
              <td>0.950</td>
              <td>0.092</td>
              <td>2</td>
              <td>[2, 3]</td>
              <td>0.843</td>
              <td>0.580</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>base</td>
              <td>GBDT</td>
              <td>0.953</td>
              <td>0.076</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>0.892</td>
              <td>0.370</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>base</td>
              <td>XGB</td>
              <td>0.965</td>
              <td>0.092</td>
              <td>2</td>
              <td>[2, 3]</td>
              <td>0.815</td>
              <td>0.440</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>strict</td>
              <td>Logit</td>
              <td>0.927</td>
              <td>0.077</td>
              <td>2</td>
              <td>[1, 2]</td>
              <td>0.975</td>
              <td>0.847</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>strict</td>
              <td>RF</td>
              <td>0.950</td>
              <td>0.093</td>
              <td>2</td>
              <td>[2, 3]</td>
              <td>0.843</td>
              <td>0.540</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>strict</td>
              <td>GBDT</td>
              <td>0.953</td>
              <td>0.075</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>0.854</td>
              <td>0.397</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>Full</td>
              <td>strict</td>
              <td>XGB</td>
              <td>0.965</td>
              <td>0.091</td>
              <td>2</td>
              <td>[2, 3]</td>
              <td>0.784</td>
              <td>0.413</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>base</td>
              <td>Logit</td>
              <td>0.869</td>
              <td>0.081</td>
              <td>2</td>
              <td>[1, 2]</td>
              <td>1.00</td>
              <td>0.993</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>base</td>
              <td>RF</td>
              <td>0.900</td>
              <td>0.104</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>0.972</td>
              <td>0.973</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>base</td>
              <td>GBDT</td>
              <td>0.899</td>
              <td>0.091</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>1.00</td>
              <td>0.993</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>base</td>
              <td>XGB</td>
              <td>0.904</td>
              <td>0.096</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>0.986</td>
              <td>0.990</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>strict</td>
              <td>Logit</td>
              <td>0.869</td>
              <td>0.086</td>
              <td>2</td>
              <td>[1, 2]</td>
              <td>0.980</td>
              <td>0.997</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>strict</td>
              <td>RF</td>
              <td>0.900</td>
              <td>0.247</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>1.00</td>
              <td>1.00</td>
              <td>18%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>strict</td>
              <td>GBDT</td>
              <td>0.899</td>
              <td>0.231</td>
              <td>2</td>
              <td>[1.75, 2]</td>
              <td>1.00</td>
              <td>1.00</td>
              <td>17%</td>
            </tr>
            <tr>
              <td>RA-8</td>
              <td>strict</td>
              <td>XGB</td>
              <td>0.904</td>
              <td>0.251</td>
              <td>2</td>
              <td>[2, 2]</td>
              <td>1.00</td>
              <td>1.00</td>
              <td>19%</td>
            </tr>
            <tr>
              <td>VA-7b</td>
              <td>base</td>
              <td>Logit</td>
              <td>0.862</td>
              <td>0.049</td>
              <td>1</td>
              <td>[1, 2]</td>
              <td>0.983</td>
              <td>0.897</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>VA-7b</td>
              <td>base</td>
              <td>RF</td>
              <td>0.871</td>
              <td>0.042</td>
              <td>1</td>
              <td>[1, 2]</td>
              <td>0.987</td>
              <td>0.827</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>VA-7b</td>
              <td>base</td>
              <td>GBDT</td>
              <td>0.869</td>
              <td>0.046</td>
              <td>1</td>
              <td>[1, 2]</td>
              <td>0.997</td>
              <td>0.827</td>
              <td>0%</td>
            </tr>
            <tr>
              <td>VA-7b</td>
              <td>base</td>
              <td>XGB</td>
              <td>0.869</td>
              <td>0.044</td>
              <td>1</td>
              <td>[1, 2]</td>
              <td>0.880</td>
              <td>0.827</td>
              <td>0%</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p>First, robustness is bounded by a low effective cost floor. On the full feature set, all architectures exhibit median MEC = 2 with narrow interquartile ranges and small FRI values. Although single-feature transitions of cost 1 exist, the empirical mass at cost 1 falls below one half, so the median binds at the next effective threshold. The convergence of linear, bagging, and boosting models to the same median MEC confirms the action-set-limited invariance of Corollary 3.1.</p>
      <p>Second, successful evasion concentrates sharply on a small feature subset. For the full feature set under the base schedule, RCI<sub>3</sub> exceeds 0.78 across models and reaches 0.96 for logistic regression. Evasion traces collapse onto low-cost, high-influence features rather than dispersing across the representation. The 95% bootstrap confidence intervals (200 resamples) confirm that these patterns are statistically stable: median MEC = [2, 2] for all models, RCI<sub>3</sub> within ±0.03, and FRI within ±0.01 (<bold>Table 8</bold>).</p>
      <p><bold>Table 8</bold><bold>.</bold> 95% bootstrap confidence intervals (200 resamples, Full/base, 300-instance intersection). </p>
      <table-wrap id="tbl8">
        <label>Table 8</label>
        <table>
          <tbody>
            <tr>
              <td>Model</td>
              <td>Median MEC [95% CI]</td>
              <td>FRI [95% CI]</td>
              <td>
                RCI
                <sub>3</sub>
                [95% CI]
              </td>
            </tr>
            <tr>
              <td>Logistic Regression</td>
              <td>2 [2, 2]</td>
              <td>0.076 [0.068, 0.084]</td>
              <td>0.96 [0.94, 0.97]</td>
            </tr>
            <tr>
              <td>Random Forest</td>
              <td>2 [2, 2]</td>
              <td>0.092 [0.082, 0.101]</td>
              <td>0.84 [0.80, 0.87]</td>
            </tr>
            <tr>
              <td>GBDT</td>
              <td>2 [2, 2]</td>
              <td>0.076 [0.069, 0.083]</td>
              <td>0.89 [0.86, 0.92]</td>
            </tr>
            <tr>
              <td>XGBoost</td>
              <td>2 [2, 3]</td>
              <td>0.092 [0.083, 0.101]</td>
              <td>0.82 [0.78, 0.85]</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p>The RA-8 configuration makes the cost-floor mechanism explicit. Despite emphasizing infrastructure features, RA-8 retains SSLfinal_State, a low-cost surface coordinate. Median MEC remains 2, while concentration becomes nearly degenerate (<inline-formula><mml:math><mml:mrow><mml:msub><mml:mrow><mml:mtext> RCI </mml:mtext></mml:mrow><mml:mn> 3 </mml:mn></mml:msub><mml:mo> ≈ </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> , <inline-formula><mml:math><mml:mrow><mml:mtext> FirstTop </mml:mtext><mml:mn> 1 </mml:mn><mml:mo> ≈ </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> ). The surface-only VA-7b set exhibits the lowest robustness (median MEC = 1, FRI &lt; 0.05).</p>
      <p>Cost schedules matter only when they eliminate dominant cheap paths. This occurs in RA-8 under the strict schedule: ensemble models exhibit 17% - 19% infeasible mass, raising FRI to 0.23 - 0.25, while median MEC among evadable instances remains 2. The gain arises from blocked feasibility rather than uniformly higher evasion costs. Logistic regression remains fully evadable in RA-8/strict, indicating alternative low-cost paths in the linear boundary.</p>
      <p><xref ref-type="fig" rid="fig2">Figure 2</xref> displays evasion survival curves. VA-7b collapses immediately (<inline-formula><mml:math><mml:mrow><mml:mi> S </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> B </mml:mi><mml:mo> ) </mml:mo></mml:mrow><mml:mo> &lt; </mml:mo><mml:mn> 0.05 </mml:mn></mml:mrow></mml:math></inline-formula> by <inline-formula><mml:math><mml:mrow><mml:mi> B </mml:mi><mml:mo> = </mml:mo><mml:mn> 2 </mml:mn></mml:mrow></mml:math></inline-formula> ). Full and RA-8/base decay to near zero by <inline-formula><mml:math><mml:mrow><mml:mi> B </mml:mi><mml:mo> = </mml:mo><mml:mn> 4 </mml:mn></mml:mrow></mml:math></inline-formula> . RA-8/strict exhibits a persistent plateau near 0.18, matching the infeasible mass in <bold>Table 7</bold>. The strict schedule generates a structural tail rather than shifting the central cost distribution.</p>
      <fig id="fig22">
        <label>Figure 22</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId276.jpeg?20260427102845" />
      </fig>
      <p><bold>Figure 2.</bold> Evasion survival curves. RA-8/strict exhibits a persistent plateau corresponding to instances whose dominant low-cost path is blocked. Shaded bands (omitted for clarity) are narrow: 95% bootstrap intervals for <inline-formula><mml:math><mml:mrow><mml:mi> S </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mn> 2 </mml:mn><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> span ± 0.04 across configurations.</p>
      <p><xref ref-type="fig" rid="fig3">Figure 3</xref> displays first-edit concentration across feature sets. RA-8 concentrates nearly all optimal traces on a single initial edit (SSLfinal_State), while Full distributes first edits across a small but nontrivial subset. Even in the latter case, concentration remains substantial.</p>
      <fig id="fig23">
        <label>Figure 23</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId279.jpeg?20260427102845" />
      </fig>
      <p><bold>Figure 3.</bold> First-edit concentration by feature set and schedule. RA-8 exhibits near-total concentration on SSLfinal_State.</p>
      <p>Stratification by the bottleneck feature confirms the blocked-path mechanism in RA-8/strict. When SSLfinal_State begins at −1 or 0, low-cost upgrades remain available and evasion succeeds with median MEC between 1 and 2. When SSLfinal_State is already +1, the dominant path is blocked and a persistent infeasible tail appears (<xref ref-type="fig" rid="fig4">Figure 4</xref>).</p>
      <fig id="fig24">
        <label>Figure 24</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId280.jpeg?20260427102845" />
      </fig>
      <p><bold>Figure 4.</bold> RA-8/strict survival stratified by SSLfinal_State initial value. A persistent infeasible tail appears when the bottleneck feature is already at +1.</p>
      <p><xref ref-type="fig" rid="fig5">Figure 5</xref> compares i.i.d. accuracy with median MEC. All architectures align along a horizontal band at MEC = 2, confirming that higher accuracy does not yield higher median robustness when low-cost transitions remain available.</p>
      <fig id="fig25">
        <label>Figure 25</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId281.jpeg?20260427102845" />
      </fig>
      <p><bold>Figure 5.</bold> Accuracy versus median MEC. All architectures converge to the effective cost floor, consistent with Corollary 3.1.</p>
      <p><bold>Table 9</bold> compares exact MEC with greedy approximations under query budgets of 50, 100, and 500. Query limitations modestly increase survival, particularly at 50 queries, but the deviation from exact MEC is small and narrows rapidly. In Full/base, the maximum gap at <inline-formula><mml:math><mml:mrow><mml:mi> B </mml:mi><mml:mo> = </mml:mo><mml:mn> 2 </mml:mn></mml:mrow></mml:math></inline-formula> is 0.08. Exact MEC thus provides a meaningful upper bound on attacker capability: query-limited adversaries are less efficient but face the same structural cost-floor constraints.</p>
      <p><bold>Table 9</bold><bold>.</bold> Evasion survival <inline-formula><mml:math><mml:mrow><mml:mi> S </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mi> B </mml:mi><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> under query-limited greedy search versus exact MEC. The greedy attacker is described in Section 2.5. </p>
      <table-wrap id="tbl9">
        <label>Table 9</label>
        <table>
          <tbody>
            <tr>
              <td rowspan="2">Config</td>
              <td rowspan="2">Model</td>
              <td colspan="4">
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mrow>
                        <mml:mo>(</mml:mo>
                        <mml:mrow>
                          <mml:mi>B</mml:mi>
                          <mml:mo>=</mml:mo>
                          <mml:mn>2</mml:mn>
                        </mml:mrow>
                        <mml:mo>)</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td colspan="4">
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mrow>
                        <mml:mo>(</mml:mo>
                        <mml:mrow>
                          <mml:mi>B</mml:mi>
                          <mml:mo>=</mml:mo>
                          <mml:mn>4</mml:mn>
                        </mml:mrow>
                        <mml:mo>)</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
            </tr>
            <tr>
              <td>Q50</td>
              <td>Q100</td>
              <td>Q500</td>
              <td>Exact</td>
              <td>Q50</td>
              <td>Q100</td>
              <td>Q500</td>
              <td>Exact</td>
            </tr>
            <tr>
              <td>Full/base</td>
              <td>Logit</td>
              <td>0.28</td>
              <td>0.24</td>
              <td>0.23</td>
              <td>0.22</td>
              <td>0.05</td>
              <td>0.03</td>
              <td>0.02</td>
              <td>0.02</td>
            </tr>
            <tr>
              <td>Full/base</td>
              <td>GBDT</td>
              <td>0.30</td>
              <td>0.26</td>
              <td>0.23</td>
              <td>0.22</td>
              <td>0.08</td>
              <td>0.05</td>
              <td>0.03</td>
              <td>0.02</td>
            </tr>
            <tr>
              <td>Full/base</td>
              <td>XGB</td>
              <td>0.34</td>
              <td>0.30</td>
              <td>0.29</td>
              <td>0.28</td>
              <td>0.07</td>
              <td>0.05</td>
              <td>0.03</td>
              <td>0.03</td>
            </tr>
            <tr>
              <td>RA-8/str</td>
              <td>GBDT</td>
              <td>0.48</td>
              <td>0.44</td>
              <td>0.43</td>
              <td>0.42</td>
              <td>0.24</td>
              <td>0.21</td>
              <td>0.19</td>
              <td>0.19</td>
            </tr>
            <tr>
              <td>RA-8/str</td>
              <td>XGB</td>
              <td>0.50</td>
              <td>0.46</td>
              <td>0.44</td>
              <td>0.42</td>
              <td>0.26</td>
              <td>0.22</td>
              <td>0.20</td>
              <td>0.19</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
    </sec>
    <sec id="sec5">
      <title>5. Cost Sensitivity Analysis</title>
      <p>We evaluate robustness under three classes of cost perturbation to assess whether conclusions depend on the specific magnitudes chosen. First, surface costs are scaled by <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> λ </mml:mi><mml:mrow><mml:mtext> surf </mml:mtext></mml:mrow></mml:msub><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 2 </mml:mn><mml:mo> , </mml:mo><mml:mn> 3 </mml:mn><mml:mo> , </mml:mo><mml:mn> 4 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> . Second, semi-domain and infrastructure costs are scaled independently by <inline-formula><mml:math><mml:mrow><mml:mi> λ </mml:mi><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 0.5 </mml:mn><mml:mo> , </mml:mo><mml:mn> 1 </mml:mn><mml:mo> , </mml:mo><mml:mn> 2 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> . Third, SSLfinal_State is reclassified from surface to semi-domain, and a rank-preserving perturbation multiplies each cost by an independent factor <inline-formula><mml:math><mml:mrow><mml:mi> U </mml:mi><mml:mo> ~ </mml:mo><mml:mtext> Uniform </mml:mtext><mml:mrow><mml:mo> [ </mml:mo><mml:mrow><mml:mn> 0.8 </mml:mn><mml:mo> , </mml:mo><mml:mn> 1.2 </mml:mn></mml:mrow><mml:mo> ] </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> over 50 draws.</p>
      <p><bold>Table 10</bold> reports results under surface scaling. The median MEC shifts proportionally, confirming linear cost-floor behavior: doubling surface costs increases the median from 2 to 4, while preserving the identity and ordering of the three most-edited features. Even at <inline-formula><mml:math><mml:mrow><mml:msub><mml:mi> λ </mml:mi><mml:mrow><mml:mtext> surf </mml:mtext></mml:mrow></mml:msub><mml:mo> = </mml:mo><mml:mn> 4 </mml:mn></mml:mrow></mml:math></inline-formula> , concentration remains high (<inline-formula><mml:math><mml:mrow><mml:msub><mml:mrow><mml:mtext> RCI </mml:mtext></mml:mrow><mml:mn> 3 </mml:mn></mml:msub><mml:mo> ≥ </mml:mo><mml:mn> 0.80 </mml:mn></mml:mrow></mml:math></inline-formula> ).</p>
      <p><bold>Table 10</bold><bold>.</bold> Median MEC and concentration under surface cost scaling (Full/base, GBDT).</p>
      <table-wrap id="tbl10">
        <label>Table 10</label>
        <table>
          <tbody>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>λ</mml:mi>
                        <mml:mrow>
                          <mml:mtext>surf</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>Median MEC</td>
              <td>
                RCI
                <sub>3</sub>
              </td>
              <td>Top-3 features</td>
            </tr>
            <tr>
              <td>1 (base)</td>
              <td>2</td>
              <td>0.89</td>
              <td>URL_of_Anchor, SSLfinal_State, SFH</td>
            </tr>
            <tr>
              <td>2</td>
              <td>4</td>
              <td>0.87</td>
              <td>URL_of_Anchor, SSLfinal_State, SFH</td>
            </tr>
            <tr>
              <td>3</td>
              <td>6</td>
              <td>0.84</td>
              <td>URL_of_Anchor, SSLfinal_State, SFH</td>
            </tr>
            <tr>
              <td>4</td>
              <td>8</td>
              <td>0.82</td>
              <td>URL_of_Anchor, SSLfinal_State, SFH</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p><bold>Table 11</bold> reports extended perturbations. Scaling semi-domain or infrastructure costs does not alter median MEC because surface transitions remain dominant. Reclassifying SSLfinal_State increases median MEC to 3 in RA-8 (where it is the bottleneck) but leaves Full unchanged due to alternative surface paths. Under rank-preserving noise, mean RCI<sub>3</sub> = 0.88 ± 0.02, indicating stability to moderate cost uncertainty.</p>
      <p><bold>Table 11</bold><bold>.</bold> Extended cost sensitivity (Full/base, GBDT unless noted).</p>
      <table-wrap id="tbl11">
        <label>Table 11</label>
        <table>
          <tbody>
            <tr>
              <td>Perturbation</td>
              <td>Median MEC</td>
              <td>
                RCI
                <sub>3</sub>
              </td>
              <td>Notes</td>
            </tr>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>λ</mml:mi>
                        <mml:mrow>
                          <mml:mtext>semi</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                      <mml:mo>=</mml:mo>
                      <mml:mn>0.5</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>2</td>
              <td>0.89</td>
              <td>Surface paths remain cheapest</td>
            </tr>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>λ</mml:mi>
                        <mml:mrow>
                          <mml:mtext>semi</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                      <mml:mo>=</mml:mo>
                      <mml:mn>2</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>2</td>
              <td>0.89</td>
              <td>Semi-domain rarely on optimal path</td>
            </tr>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>λ</mml:mi>
                        <mml:mrow>
                          <mml:mtext>infra</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                      <mml:mo>=</mml:mo>
                      <mml:mn>0.5</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>2</td>
              <td>0.88</td>
              <td>Surface transitions dominate</td>
            </tr>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>λ</mml:mi>
                        <mml:mrow>
                          <mml:mtext>infra</mml:mtext>
                        </mml:mrow>
                      </mml:msub>
                      <mml:mo>=</mml:mo>
                      <mml:mn>2</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
              </td>
              <td>2</td>
              <td>0.90</td>
              <td>Infrastructure edits avoided</td>
            </tr>
            <tr>
              <td>
                SSL
                <inline-formula>
                  <mml:math>
                    <mml:mo>→</mml:mo>
                  </mml:math>
                </inline-formula>
                semi-domain (Full)
              </td>
              <td>2</td>
              <td>0.84</td>
              <td>Alternative surface paths used</td>
            </tr>
            <tr>
              <td>
                SSL
                <inline-formula>
                  <mml:math>
                    <mml:mo>→</mml:mo>
                  </mml:math>
                </inline-formula>
                semi-domain (RA-8)
              </td>
              <td>3</td>
              <td>0.95</td>
              <td>Bottleneck cost increases</td>
            </tr>
            <tr>
              <td>
                Random
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mi>U</mml:mi>
                      <mml:mrow>
                        <mml:mo>[</mml:mo>
                        <mml:mrow>
                          <mml:mn>0.8</mml:mn>
                          <mml:mo>,</mml:mo>
                          <mml:mn>1.2</mml:mn>
                        </mml:mrow>
                        <mml:mo>]</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
                (×50)
              </td>
              <td>2 ± 0</td>
              <td>0.88 ± 0.02</td>
              <td>Stable under moderate noise</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p>The main results use a fixed decision threshold of <inline-formula><mml:math><mml:mrow><mml:mi> τ </mml:mi><mml:mo> = </mml:mo><mml:mn> 0.5 </mml:mn></mml:mrow></mml:math></inline-formula> for all models. To verify that robustness conclusions are not conflated with cross-model calibration differences, we re-evaluate median MEC and RCI<sub>3</sub> under thresholds <inline-formula><mml:math><mml:mrow><mml:mi> τ </mml:mi><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 0.3 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.4 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.5 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.6 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.7 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> , redefining </p>
      <fig id="fig26">
        <label>Figure 26</label>
        <graphic xlink:href="https://html.scirp.org/file/7801221-rId320.svg?20260427102845" />
      </fig>
      <p> at each threshold. We additionally compare models at a matched operating point where thresholds are adjusted per model to achieve phishing TPR ≈ 0.95.</p>
      <p><bold>Table 12</bold> reports results for Full/base across all four models. Median MEC is 2 for <inline-formula><mml:math><mml:mrow><mml:mi> τ </mml:mi><mml:mo> ∈ </mml:mo><mml:mrow><mml:mo> { </mml:mo><mml:mrow><mml:mn> 0.4 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.5 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.6 </mml:mn><mml:mo> , </mml:mo><mml:mn> 0.7 </mml:mn></mml:mrow><mml:mo> } </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> across all models. At <inline-formula><mml:math><mml:mrow><mml:mi> τ </mml:mi><mml:mo> = </mml:mo><mml:mn> 0.3 </mml:mn></mml:mrow></mml:math></inline-formula> , conditioning on high-confidence phishing detections shifts the median to 3 for all models, consistent with the cost floor: these instances tend to be farther from the decision boundary, and a single cheap transition is insufficient for a larger fraction of them. At the matched TPR ≈ 0.95 operating point, median MEC remains 2 for all models. RCI<sub>3</sub> is stable across thresholds (range 0.80 - 0.97). These results confirm that the robustness convergence is not an artifact of threshold choice.</p>
      <p><bold>Table 12</bold><bold>.</bold> Median MEC under varying classification thresholds (Full/base). Matched TPR row adjusts each model’s threshold to achieve phishing TPR ≈ 0.95.</p>
      <table-wrap id="tbl12">
        <label>Table 12</label>
        <table>
          <tbody>
            <tr>
              <td>
                <inline-formula>
                  <mml:math>
                    <mml:mi>τ</mml:mi>
                  </mml:math>
                </inline-formula>
              </td>
              <td>LR</td>
              <td>RF</td>
              <td>GBDT</td>
              <td>XGB</td>
            </tr>
            <tr>
              <td>0.30</td>
              <td>3</td>
              <td>3</td>
              <td>3</td>
              <td>3</td>
            </tr>
            <tr>
              <td>0.40</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
            </tr>
            <tr>
              <td>0.50</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
            </tr>
            <tr>
              <td>0.60</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
            </tr>
            <tr>
              <td>0.70</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
            </tr>
            <tr>
              <td>Matched TPR ≈ 0.95</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
              <td>2</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p>These experiments reveal two conditions. In the cost-floor condition, MEC quantiles scale with the cheapest admissible transition and architecture invariance holds. In the path-removal condition, prohibiting dominant transitions induces infeasible mass without shifting the cost distribution among evadable instances. The strict schedule operates in the latter condition for RA-8, producing robustness gains through blocked feasibility.</p>
    </sec>
    <sec id="sec6">
      <title>6. Discussion</title>
      <p>Across all tested feature sets, cost schedules, and model families, robustness is governed by the cheapest admissible manipulation that remains available. The median MEC follows the effective cost floor across all configurations, rendering Proposition 3.1 empirically tight. When a low-cost transition suffices for a nontrivial fraction of correctly detected instances, architectural complexity does not move the median. This action-set-limited invariance means that linear models, bagging ensembles, and boosting methods converge to the same robustness ceiling.</p>
      <p>The implication is a shift in defensive emphasis from model selection to representation design and attacker economics. A feature may be highly predictive under i.i.d. evaluation yet operationally brittle if it is inexpensive to edit. The RA-8 configuration illustrates this: although it prioritizes infrastructure-leaning signals, retaining a single low-cost coordinate (SSLfinal_State) creates a bottleneck through which nearly all optimal evasions pass. Cost schedules improve robustness only when they eliminate dominant cheap paths, producing infeasible mass rather than uniformly higher evasion costs. Meaningful robustness gains require removing or economically disabling low-cost transitions and anchoring detection on signals whose manipulation costs exceed realistic attacker budgets, even at the expense of i.i.d. accuracy.</p>
      <p>The sanitization-only threat model constitutes a lower bound. Relaxing monotonicity, by allowing anti-feature injection (adding benign-looking HTML artifacts to boost legitimacy scores) or extractor-level manipulation (crafting raw pages to flip computed features without semantic change [<xref ref-type="bibr" rid="B13">13</xref>][<xref ref-type="bibr" rid="B17">17</xref>]), enlarges the feasible action set. The cost floor can only decrease or remain unchanged, since every monotone path remains available. Concentration may increase if newly available non-monotone transitions converge on a small set of vulnerable coordinates, or shift to different features if injected anti-features provide cheaper evasion than indicator removal. The infeasible mass observed under the strict schedule would likely shrink or vanish, as non-monotone paths can bypass blocked transitions. Formalizing these effects requires specifying non-monotone cost structures and is left to future work, but the qualitative conclusion is reinforced: the monotone analysis provides a conservative bound on attacker capability.</p>
      <p><bold>Limitations</bold><bold>and</bold><bold>external</bold><bold>validity.</bold> The UCI Phishing Websites benchmark [<xref ref-type="bibr" rid="B19">19</xref>] is a standard reference point but is dated: it uses a fixed, hand-engineered vocabulary that omits modern signals, including certificate-transparency logs, visual similarity [<xref ref-type="bibr" rid="B9">9</xref>], JavaScript behavioral fingerprints [<xref ref-type="bibr" rid="B20">20</xref>], and infrastructure patterns in contemporary kit-based campaigns [<xref ref-type="bibr" rid="B10">10</xref>][<xref ref-type="bibr" rid="B11">11</xref>]. Quantitative transfer to modern settings requires re-validation on current datasets, mapping contemporary features to a cost schedule via the time-to-effect principle, and verifying whether low-cost transitions continue to dominate MEC.</p>
      <p>Several structural conclusions are nevertheless important to the dataset choice. The surface-versus-infrastructure cost asymmetry is an economic regularity: presentation-layer signals are cheaper to manipulate than infrastructure-coupled signals, regardless of the specific feature dictionary [<xref ref-type="bibr" rid="B8">8</xref>][<xref ref-type="bibr" rid="B18">18</xref>]. Proposition 3.1 is a property of the action set and cost model, not the dataset; it applies whenever a nontrivial fraction of instances admit single-transition evasion at minimal cost. Concentration follows from heterogeneous costs interacting with feature influence, a generic property in discrete domains with uneven manipulation friction.</p>
      <p>Our MEC computation assumes unconstrained black-box label access. <bold>Table 9</bold> shows that reasonable query budgets reduce attacker efficiency without altering feasibility patterns, but production systems with aggressive rate-limiting can increase observed survival. The cost schedule represents dimensionless operational friction calibrated by the time-to-effect principle rather than direct monetary expenditure; translating to market-level budgets remains an open empirical problem.</p>
    </sec>
    <sec id="sec7">
      <title>7. Conclusion</title>
      <p>Near-perfect held-out accuracy does not imply deployment security when evasion is cheap. Across all model architectures, feature configurations, and cost schedules studied here, robustness is determined by the minimum manipulation cost available to the attacker, not by classifier complexity. Feature economics dominate adversarial robustness under cost-constrained post-deployment manipulation: the central obstacle to robustness is the continued availability of low-cost transitions, and architecture choice cannot compensate for this structural exposure. Effective defense requires either removing cheap-to-edit features from the detection vocabulary or raising their manipulation cost through verifiable infrastructure anchoring—even at some sacrifice in i.i.d. accuracy.</p>
    </sec>
    <sec id="sec8">
      <title>Acknowledgements</title>
      <p>This work was supported by the U.S. Department of Education under grant number P382G240006. The authors thank the anonymous reviewers for their helpful suggestions and careful reading of the manuscript.</p>
    </sec>
    <sec id="sec9">
      <title>Appendix: Feature-to-Cost-Group Mapping</title>
      <p><bold>Table A1</bold> maps all 30 UCI Phishing Websites features to their cost group assignment under the base schedule, along with the transition costs and a one-line time-to-effect rationale. The assignment is governed by the time-to-effect principle: surface features require at most one day to modify under campaign-operational conditions; semi-domain features require days to weeks; infrastructure features require weeks to months or are effectively infeasible within a typical campaign window. This table is provided to make the cost calibration fully reproducible.</p>
      <p><bold>Table A1</bold><bold>.</bold> Complete feature-to-cost-group mapping for the UCI Phishing Websites dataset (base schedule). Transition costs follow <bold>Table 1</bold>. The strict schedule sets infrastructure <inline-formula><mml:math><mml:mrow><mml:mo> − </mml:mo><mml:mn> 1 </mml:mn><mml:mo> → </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math><mml:mrow><mml:mn> 0 </mml:mn><mml:mo> → </mml:mo><mml:mn> 1 </mml:mn></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math><mml:mi> ∞ </mml:mi></mml:math></inline-formula> .</p>
      <table-wrap id="tbl13">
        <label>Table 13</label>
        <table>
          <tbody>
            <tr>
              <td>Feature</td>
              <td>Group</td>
              <td>
                Costs (
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mo>−</mml:mo>
                      <mml:mn>1</mml:mn>
                      <mml:mo>→</mml:mo>
                      <mml:mn>0</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
                ,
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mo>−</mml:mo>
                      <mml:mn>1</mml:mn>
                      <mml:mo>→</mml:mo>
                      <mml:mn>1</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
                ,
                <inline-formula>
                  <mml:math>
                    <mml:mrow>
                      <mml:mn>0</mml:mn>
                      <mml:mo>→</mml:mo>
                      <mml:mn>1</mml:mn>
                    </mml:mrow>
                  </mml:math>
                </inline-formula>
                )
              </td>
              <td>Time-to-effect rationale</td>
            </tr>
            <tr>
              <td>having_IP_Address</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Replace IP with registered domain in minutes</td>
            </tr>
            <tr>
              <td>URL_Length</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Shorten URL string in minutes</td>
            </tr>
            <tr>
              <td>Shortining_Service</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove or replace URL shortener within hours</td>
            </tr>
            <tr>
              <td>having_At_Sign</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Drop @ from URL string immediately</td>
            </tr>
            <tr>
              <td>double_slash_redirecting</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Correct redirect path within hours</td>
            </tr>
            <tr>
              <td>Prefix_Suffix</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove hyphen from domain string within hours</td>
            </tr>
            <tr>
              <td>having_Sub_Domain</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Adjust subdomain structure within hours</td>
            </tr>
            <tr>
              <td>SSLfinal_State</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Deploy free DV certificate (e.g., Let’s Encrypt) in under a day</td>
            </tr>
            <tr>
              <td>Favicon</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Replace favicon file on server within hours</td>
            </tr>
            <tr>
              <td>port</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Configure server to use standard port within hours</td>
            </tr>
            <tr>
              <td>HTTPS_token</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove “https” string from URL immediately</td>
            </tr>
            <tr>
              <td>Request_URL</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Adjust resource request paths within hours</td>
            </tr>
            <tr>
              <td>URL_of_Anchor</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Modify anchor href attributes in HTML within hours</td>
            </tr>
            <tr>
              <td>Links_in_tags</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Update embedded link tags in HTML within hours</td>
            </tr>
            <tr>
              <td>SFH</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Modify HTML form action to legitimate endpoint within hours</td>
            </tr>
            <tr>
              <td>Submitting_to_email</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove mailto: form action within hours</td>
            </tr>
            <tr>
              <td>Abnormal_URL</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Normalize URL structure relative to hostname within hours</td>
            </tr>
            <tr>
              <td>Redirect</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Adjust server-side redirect count within hours</td>
            </tr>
            <tr>
              <td>on_mouseover</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove or rewrite JavaScript mouseover handler within hours</td>
            </tr>
            <tr>
              <td>RightClick</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove JavaScript right-click disabler within hours</td>
            </tr>
            <tr>
              <td>popUpWidnow</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove pop-up window JavaScript within hours</td>
            </tr>
            <tr>
              <td>Iframe</td>
              <td>Surface</td>
              <td>1, 2, 1</td>
              <td>Remove elements from HTML within hours&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Domain_registeration_length&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Semi-domain&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;3, 6, 3&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Extend domain registration; requires payment and propagation over days&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Google_Index&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Semi-domain&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;3, 6, 3&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Submit sitemap; indexing typically takes days to a week&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Links_pointing_to_page&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Semi-domain&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;3, 6, 3&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Accumulate inbound links; requires days of coordination&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Statistical_report&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Semi-domain&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;3, 6, 3&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Clear entry from phishing databases; dispute takes days&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;age_of_domain&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Infrastructure&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;4, 8, 4&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Domain age cannot be accelerated; accrues over months&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;DNSRecord&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Infrastructure&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;4, 8, 4&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;DNS reputation requires weeks to establish&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;web_traffic&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Infrastructure&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;4, 8, 4&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;Organic traffic accumulation requires weeks to months&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;tr&amp;gt; &amp;lt;td class="acenter" width="24.25%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Page_Rank&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="11.69%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;Infrastructure&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="acenter" width="13.88%" &amp;gt;&amp;lt;p style="text-align:center"&amp;gt;4, 8, 4&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;td class="aleft" width="50.18%" &amp;gt;&amp;lt;p style="text-align:left"&amp;gt;PageRank accrues over months via link building&amp;lt;/p&amp;gt;&amp;lt;/td&amp;gt; &amp;lt;/tr&amp;gt; &amp;lt;/table&amp;gt;&amp;lt;/p&amp;gt;</td>
            </tr>
          </tbody>
        </table>
      </table-wrap>
    </sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      <ref id="B1">
        <label>1.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Basit, A., Zafar, M., Liu, X., Javed, A.R., Jalil, Z. and Kifayat, K. (2020) A Comprehensive Survey of AI-Enabled Phishing Attacks Detection Techniques. <italic>Telecommunica</italic><italic>tion Systems</italic>, 76, 139-154. https://doi.org/10.1007/s11235-020-00733-2 <pub-id pub-id-type="doi">10.1007/s11235-020-00733-2</pub-id><pub-id pub-id-type="pmid">33110340</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11235-020-00733-2">https://doi.org/10.1007/s11235-020-00733-2</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Basit, A.</string-name>
              <string-name>Zafar, M.</string-name>
              <string-name>Liu, X.</string-name>
              <string-name>Javed, A.R.</string-name>
              <string-name>Jalil, Z.</string-name>
              <string-name>Kifayat, K.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>A Comprehensive Survey of AI-Enabled Phishing Attacks Detection Techniques</article-title>
            <source>Telecommunication Systems</source>
            <volume>76</volume>
            <pub-id pub-id-type="doi">10.1007/s11235-020-00733-2</pub-id>
            <pub-id pub-id-type="pmid">33110340</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B2">
        <label>2.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Mohammad, R.M., Thabtah, F. and McCluskey, L. (2014) Predicting Phishing Websites Based on Self-Structuring Neural Network. <italic>Neural Computing and Applications</italic>, 25, 443-458. https://doi.org/10.1007/s00521-013-1490-z <pub-id pub-id-type="doi">10.1007/s00521-013-1490-z</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00521-013-1490-z">https://doi.org/10.1007/s00521-013-1490-z</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Mohammad, R.M.</string-name>
              <string-name>Thabtah, F.</string-name>
              <string-name>McCluskey, L.</string-name>
            </person-group>
            <year>2014</year>
            <article-title>Predicting Phishing Websites Based on Self-Structuring Neural Network</article-title>
            <source>Neural Computing and Applications</source>
            <volume>25</volume>
            <pub-id pub-id-type="doi">10.1007/s00521-013-1490-z</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B3">
        <label>3.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Sahingoz, O.K., Buber, E., Demir, O. and Diri, B. (2019) Machine Learning Based Phishing Detection from URLs. <italic>Expert Systems with Applications</italic>, 117, 345-357. https://doi.org/10.1016/j.eswa.2018.09.029 <pub-id pub-id-type="doi">10.1016/j.eswa.2018.09.029</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.eswa.2018.09.029">https://doi.org/10.1016/j.eswa.2018.09.029</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Sahingoz, O.K.</string-name>
              <string-name>Buber, E.</string-name>
              <string-name>Demir, O.</string-name>
              <string-name>Diri, B.</string-name>
            </person-group>
            <year>2019</year>
            <article-title>Machine Learning Based Phishing Detection from URLs</article-title>
            <source>Expert Systems with Applications</source>
            <volume>117</volume>
            <pub-id pub-id-type="doi">10.1016/j.eswa.2018.09.029</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B4">
        <label>4.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Do, N.Q., Selamat, A., Krejcar, O., Herrera-Viedma, E. and Fujita, H. (2022) Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions. <italic>IEEE Access</italic>, 10, 36429-36463. https://doi.org/10.1109/access.2022.3151903 <pub-id pub-id-type="doi">10.1109/access.2022.3151903</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/access.2022.3151903">https://doi.org/10.1109/access.2022.3151903</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Do, N.Q.</string-name>
              <string-name>Selamat, A.</string-name>
              <string-name>Krejcar, O.</string-name>
              <string-name>Herrera-Viedma, E.</string-name>
              <string-name>Fujita, H.</string-name>
              <string-name>Taxonomy, C</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions</article-title>
            <source>IEEE Access</source>
            <volume>10</volume>
            <pub-id pub-id-type="doi">10.1109/access.2022.3151903</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B5">
        <label>5.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Biggio, B. and Roli, F. (2018) Wild Patterns: Ten Years after the Rise of Adversarial Machine Learning. <italic>Proceedings of the</italic>2018 <italic>ACM SIGSAC Conference on Computer and Communications Security</italic>, Toronto, 15-19 October 2018, 2154-2156. https://doi.org/10.1145/3243734.3264418 <pub-id pub-id-type="doi">10.1145/3243734.3264418</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/3243734.3264418">https://doi.org/10.1145/3243734.3264418</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Biggio, B.</string-name>
              <string-name>Roli, F.</string-name>
              <string-name>Security, T</string-name>
            </person-group>
            <year>2018</year>
            <article-title>Wild Patterns: Ten Years after the Rise of Adversarial Machine Learning</article-title>
            <source>Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security</source>
            <volume>15</volume>
            <pub-id pub-id-type="doi">10.1145/3243734.3264418</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B6">
        <label>6.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Apruzzese, G., Anderson, H.S., Dambra, S., Freeman, D., Pierazzi, F. and Roundy, K. (2023) “Real Attackers Don’t Compute Gradients”: Bridging the Gap between Adversarial ML Research and Practice. 2023 <italic>IEEE Conference on Secure and Trustworthy Machine Learning</italic>( <italic>SaTML</italic>), Raleigh, 8-10 February 2023, 339-364. https://doi.org/10.1109/satml54575.2023.00031 <pub-id pub-id-type="doi">10.1109/satml54575.2023.00031</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/satml54575.2023.00031">https://doi.org/10.1109/satml54575.2023.00031</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Apruzzese, G.</string-name>
              <string-name>Anderson, H.S.</string-name>
              <string-name>Dambra, S.</string-name>
              <string-name>Freeman, D.</string-name>
              <string-name>Pierazzi, F.</string-name>
              <string-name>Roundy, K.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>“Real Attackers Don’t Compute Gradients”: Bridging the Gap between Adversarial ML Research and Practice</article-title>
            <source>2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML)</source>
            <volume>8</volume>
            <pub-id pub-id-type="doi">10.1109/satml54575.2023.00031</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B7">
        <label>7.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Biggio, B., Corona, I., Maiorca, D., Nelson, B., Šrndić, N., Laskov, P., <italic>et al</italic>. (2013) Evasion Attacks against Machine Learning at Test Time. In: <italic>Lecture Notes in Computer Science</italic>, Springer, 387-402. https://doi.org/10.1007/978-3-642-40994-3_25 <pub-id pub-id-type="doi">10.1007/978-3-642-40994-3_25</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-3-642-40994-3_25">https://doi.org/10.1007/978-3-642-40994-3_25</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Biggio, B.</string-name>
              <string-name>Corona, I.</string-name>
              <string-name>Maiorca, D.</string-name>
              <string-name>Nelson, B.</string-name>
              <string-name>Laskov, P.</string-name>
              <string-name>Science, S</string-name>
            </person-group>
            <year>2013</year>
            <article-title>Evasion Attacks against Machine Learning at Test Time</article-title>
            <source>In: Lecture Notes in Computer Science</source>
            <volume>387</volume>
            <pub-id pub-id-type="doi">10.1007/978-3-642-40994-3_25</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B8">
        <label>8.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Khonji, M., Iraqi, Y. and Jones, A. (2013) Phishing Detection: A Literature Survey. <italic>IEEE Communications Surveys &amp; Tutorials</italic>, 15, 2091-2121. https://doi.org/10.1109/surv.2013.032213.00009 <pub-id pub-id-type="doi">10.1109/surv.2013.032213.00009</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/surv.2013.032213.00009">https://doi.org/10.1109/surv.2013.032213.00009</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Khonji, M.</string-name>
              <string-name>Iraqi, Y.</string-name>
              <string-name>Jones, A.</string-name>
            </person-group>
            <year>2013</year>
            <article-title>Phishing Detection: A Literature Survey</article-title>
            <source>IEEE Communications Surveys &amp; Tutorials</source>
            <volume>15</volume>
            <pub-id pub-id-type="doi">10.1109/surv.2013.032213.00009</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B9">
        <label>9.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Lin, Y., Liu, R., Divakaran, D.M., Ng, J.Y., <italic>et al</italic>. (2021) Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages. <italic>Proceedings of the</italic> 30 <italic>th USENIX Security Symposium</italic>, Vancouver, 11-13 August 2021, 3793-3810.</mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Lin, Y.</string-name>
              <string-name>Liu, R.</string-name>
              <string-name>Divakaran, D.M.</string-name>
              <string-name>Ng, J.Y.</string-name>
              <string-name>Symposium, V</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages</article-title>
            <source>Proceedings of the 30th USENIX Security Symposium</source>
            <volume>11</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B10">
        <label>10.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Oest, A., Safaei, Y., Doupé, A., Ahn, G.J., <italic>et al</italic>. (2020) Sunrise to Sunset: Analyzing the End-to-End Life Cycle and Effectiveness of Phishing Attacks at Scale. 2020 29 <italic>th USENIX Security Symposium</italic>, Boston, 12-14 August 2020, 361-377.</mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Oest, A.</string-name>
              <string-name>Safaei, Y.</string-name>
              <string-name>Ahn, G.J.</string-name>
              <string-name>Symposium, B</string-name>
            </person-group>
            <year>2020</year>
            <article-title>Sunrise to Sunset: Analyzing the End-to-End Life Cycle and Effectiveness of Phishing Attacks at Scale</article-title>
            <source>2020 29th USENIX Security Symposium</source>
            <volume>12</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B11">
        <label>11.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Bijmans, P.H., Booij, T.M. and van Eeten, M. (2021) Catching Phishers by Their Bait: Investigating the Dutch Phishing Landscape through Phishing Kit Analysis. <italic>Proceedings of the</italic> 30 <italic>th USENIX Security Symposium</italic>, Vancouver, 11-13 August 2021, 3757-3774.</mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Bijmans, P.H.</string-name>
              <string-name>Booij, T.M.</string-name>
              <string-name>Eeten, M.</string-name>
              <string-name>Symposium, V</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Catching Phishers by Their Bait: Investigating the Dutch Phishing Landscape through Phishing Kit Analysis</article-title>
            <source>Proceedings of the 30th USENIX Security Symposium</source>
            <volume>11</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B12">
        <label>12.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Ruan, W.J., Yi, X.P. and Huang, X.W. (2021) Adversarial Robustness of Deep Learning: Theory, Algorithms, and Applications. <italic>Proceedings of the</italic>30 <italic>th ACM International Conference on Information &amp; Knowledge Management</italic>, Queensland, 1-5 November 2021, 4866-4869.</mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Ruan, W.J.</string-name>
              <string-name>Yi, X.P.</string-name>
              <string-name>Huang, X.W.</string-name>
              <string-name>Theory, A</string-name>
              <string-name>Management, Q</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Adversarial Robustness of Deep Learning: Theory, Algorithms, and Applications</article-title>
            <source>Proceedings of the 30th ACM International Conference on Information &amp; Knowledge Management</source>
            <volume>1</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B13">
        <label>13.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Pierazzi, F., Pendlebury, F., Cortellazzi, J. and Cavallaro, L. (2020). Intriguing Properties of Adversarial ML Attacks in the Problem Space. 2020 <italic>IEEE Symposium on Security and Privacy</italic> ( <italic>SP</italic>), San Francisco, 18-21 May 2020, 1332-1349. https://doi.org/10.1109/sp40000.2020.00073 <pub-id pub-id-type="doi">10.1109/sp40000.2020.00073</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/sp40000.2020.00073">https://doi.org/10.1109/sp40000.2020.00073</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Pierazzi, F.</string-name>
              <string-name>Pendlebury, F.</string-name>
              <string-name>Cortellazzi, J.</string-name>
              <string-name>Cavallaro, L.</string-name>
            </person-group>
            <year>2020</year>
            <pub-id pub-id-type="doi">10.1109/sp40000.2020.00073</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B14">
        <label>14.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Carlini, N. and Wagner, D. (2017) Towards Evaluating the Robustness of Neural Networks. 2017 <italic>IEEE Symposium on Security and Privacy</italic> ( <italic>SP</italic>), San Jose, 22-26 May 2017, 39-57. https://doi.org/10.1109/sp.2017.49 <pub-id pub-id-type="doi">10.1109/sp.2017.49</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/sp.2017.49">https://doi.org/10.1109/sp.2017.49</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Carlini, N.</string-name>
              <string-name>Wagner, D.</string-name>
            </person-group>
            <year>2017</year>
            <article-title>Towards Evaluating the Robustness of Neural Networks</article-title>
            <source>2017 IEEE Symposium on Security and Privacy (SP)</source>
            <volume>22</volume>
            <pub-id pub-id-type="doi">10.1109/sp.2017.49</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B15">
        <label>15.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Goodfellow, I.J., Shlens, J., and Szegedy, C. (2015) Explaining and Harnessing Adversarial Examples. 2015 3 <italic>rd International Conference on Learning Representations</italic>, San Diego, 7-9 May 2015, 11 p.</mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Goodfellow, I.J.</string-name>
              <string-name>Shlens, J.</string-name>
              <string-name>Szegedy, C.</string-name>
              <string-name>Representations, S</string-name>
            </person-group>
            <year>2015</year>
            <article-title>Explaining and Harnessing Adversarial Examples</article-title>
            <source>2015 3rd International Conference on Learning Representations</source>
            <volume>7</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B16">
        <label>16.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Corona, I., Biggio, B., Contini, M., Piras, L., Corda, R., Mereu, M., <italic>et al</italic>. (2017) DeltaPhish: Detecting Phishing Webpages in Compromised Websites. In: <italic>Lecture Notes in Computer Science</italic>, Springer, 370-388. https://doi.org/10.1007/978-3-319-66402-6_22 <pub-id pub-id-type="doi">10.1007/978-3-319-66402-6_22</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-3-319-66402-6_22">https://doi.org/10.1007/978-3-319-66402-6_22</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Corona, I.</string-name>
              <string-name>Biggio, B.</string-name>
              <string-name>Contini, M.</string-name>
              <string-name>Piras, L.</string-name>
              <string-name>Corda, R.</string-name>
              <string-name>Mereu, M.</string-name>
              <string-name>Science, S</string-name>
            </person-group>
            <year>2017</year>
            <article-title>DeltaPhish: Detecting Phishing Webpages in Compromised Websites</article-title>
            <source>In: Lecture Notes in Computer Science</source>
            <volume>370</volume>
            <pub-id pub-id-type="doi">10.1007/978-3-319-66402-6_22</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B17">
        <label>17.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Xu, W., Qi, Y. and Evans, D. (2016) Automatically Evading Classifiers: A Case Study on PDF Malware Classifiers. <italic>Proceedings</italic>2016 <italic>Network and Distributed System Security Symposium</italic>, San Diego, 21-24 February 2016, 1-15. https://doi.org/10.14722/ndss.2016.23115 <pub-id pub-id-type="doi">10.14722/ndss.2016.23115</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.14722/ndss.2016.23115">https://doi.org/10.14722/ndss.2016.23115</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Xu, W.</string-name>
              <string-name>Qi, Y.</string-name>
              <string-name>Evans, D.</string-name>
              <string-name>Symposium, S</string-name>
            </person-group>
            <year>2016</year>
            <article-title>Automatically Evading Classifiers: A Case Study on PDF Malware Classifiers</article-title>
            <source>Proceedings 2016 Network and Distributed System Security Symposium</source>
            <volume>21</volume>
            <pub-id pub-id-type="doi">10.14722/ndss.2016.23115</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B18">
        <label>18.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Das, A., Baki, S., El Aassal, A., Verma, R. and Dunbar, A. (2020) SoK: A Comprehensive Reexamination of Phishing Research from the Security Perspective. <italic>IEEE Communications Surveys &amp; Tutorials</italic>, 22, 671-708. https://doi.org/10.1109/comst.2019.2957750 <pub-id pub-id-type="doi">10.1109/comst.2019.2957750</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/comst.2019.2957750">https://doi.org/10.1109/comst.2019.2957750</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Das, A.</string-name>
              <string-name>Baki, S.</string-name>
              <string-name>Aassal, A.</string-name>
              <string-name>Verma, R.</string-name>
              <string-name>Dunbar, A.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>SoK: A Comprehensive Reexamination of Phishing Research from the Security Perspective</article-title>
            <source>IEEE Communications Surveys &amp; Tutorials</source>
            <volume>22</volume>
            <pub-id pub-id-type="doi">10.1109/comst.2019.2957750</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B19">
        <label>19.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Mohammad, R.M., Thabtah, F. and McCluskey, L. (2015) Phishing Websites Data Set. UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/phishing+websites</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Mohammad, R.M.</string-name>
              <string-name>Thabtah, F.</string-name>
              <string-name>McCluskey, L.</string-name>
            </person-group>
            <year>2015</year>
            <article-title>Phishing Websites Data Set</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B20">
        <label>20.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Rao, R.S., Vaishnavi, T. and Pais, A.R. (2020) CatchPhish: Detection of Phishing Websites by Inspecting URLs. <italic>Journal</italic><italic>of</italic><italic>Ambient</italic><italic>Intelligence</italic><italic>and</italic><italic>Humanized</italic><italic>Computing</italic>, 11, 813-825. https://doi.org/10.1007/s12652-019-01311-4 <pub-id pub-id-type="doi">10.1007/s12652-019-01311-4</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s12652-019-01311-4">https://doi.org/10.1007/s12652-019-01311-4</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Rao, R.S.</string-name>
              <string-name>Vaishnavi, T.</string-name>
              <string-name>Pais, A.R.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>CatchPhish: Detection of Phishing Websites by Inspecting URLs</article-title>
            <source>Journal of Ambient Intelligence and Humanized Computing</source>
            <volume>11</volume>
            <pub-id pub-id-type="doi">10.1007/s12652-019-01311-4</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
    </ref-list>
  </back>
</article>