<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JAMP</journal-id><journal-title-group><journal-title>Journal of Applied Mathematics and Physics</journal-title></journal-title-group><issn pub-type="epub">2327-4352</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jamp.2024.124069</article-id><article-id pub-id-type="publisher-id">JAMP-132553</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Solving Neumann Boundary Problem with Kernel-Regularized Learning Approach
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xuexue</surname><given-names>Ran</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Baohuai</surname><given-names>Sheng</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Mathematics, Shaoxing University, Shaoxing, China</addr-line></aff><pub-date pub-type="epub"><day>11</day><month>04</month><year>2024</year></pub-date><volume>12</volume><issue>04</issue><fpage>1101</fpage><lpage>1125</lpage><history><date date-type="received"><day>7,</day>	<month>March</month>	<year>2024</year></date><date date-type="rev-recd"><day>16,</day>	<month>April</month>	<year>2024</year>	</date><date date-type="accepted"><day>19,</day>	<month>April</month>	<year>2024</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  We provide a kernel-regularized method to give theory solutions for Neumann boundary value problem on the unit ball. We define the reproducing kernel Hilbert space with the spherical harmonics associated with an inner product defined on both the unit ball and the unit sphere, construct the kernel-regularized learning algorithm from the view of semi-supervised learning and bound the upper bounds for the learning rates. The theory analysis shows that the learning algorithm has better uniform convergence according to the number of samples. The research can be regarded as an application of kernel-regularized semi-supervised learning.
 
</p></abstract><kwd-group><kwd>Neumann Boundary Value</kwd><kwd> Kernel-Regularized Approach</kwd><kwd> Reproducing Kernel Hilbert Space</kwd><kwd> The Unit Ball</kwd><kwd> The Unit Sphere</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>It is known that approximation theory and skills have been used to give the analytic solution for PDE with boundary value problems and form the method of fundamental solutions (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref1">1</xref>] - [<xref ref-type="bibr" rid="scirp.132553-ref6">6</xref>] ; Appendices 1-3). Recently, the kernel-based collocation method for solving several PDEs with boundary problems has been developed (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref11">11</xref>] ) from the view of minimal norm interpolation of reproducing kernel Hilbert spaces (RKHSs), the existent theorem and the representation theorem for the numerical solutions are shown qualitatively. It is suggested by [<xref ref-type="bibr" rid="scirp.132553-ref12">12</xref>] that kernel-regularized gradient learning may be used to give numerical solution for PDE. Indeed, some kernel-regularized learning algorithms have been used to study the PDE with Dirichlet boundary value problem quantitatively (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref16">16</xref>] ). For a given domain D ⊂ R d , the pairwise distinct collocation points chosen in the kernel-based collocation method are:</p><p>X D = { x 1 , ⋯ , x N } ⊂ D ,</p><p>X ∂ D = { x N + 1 , ⋯ , x N + M } ⊂ ∂ D ,</p><p>and the sample values from the PDE (where P and B are given differential operators):</p><p>{ P u = f * ,               in     D ⊂ R d , B u = g ,                   on     ∂ D (1)</p><p>at the given collocation, points are (see [<xref ref-type="bibr" rid="scirp.132553-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref11">11</xref>] ):</p><p>y j = f * ( x j ) + η x j ,         j = 1,2, ⋯ , N ,</p><p>y k = g ( x N + k ) ,         k = 1 , 2 , ⋯ , M ,</p><p>where the random variable η → : = ( η x 1 , ⋯ , η x N ) Τ ~ N ( 0 → , Ψ → ∂ D ) .</p><p>There are two typical PDE problems (see Chapter 1 of [<xref ref-type="bibr" rid="scirp.132553-ref17">17</xref>] ). When P = ∇ = ∑ j = 1 d ∂ 2 ∂ x j 2 is the Laplace operator and B = I is the unit operator, we have the Dirichlet problem:</p><p>{ Δ u = f * ,               in     D ⊂ R d , u = g ,                       on     ∂ D (2)</p><p>When P = I is the unit operator and B = ∂ ∂ n → is the directional derivative along the outward normal vector n → , i.e. B u = ∂ u ∂ n → = ∇ u ⋅ n → , problem (1) become the Neumann boundary problem:</p><p>{ u = f * ,               in     D ⊂ R d , ∂ u ∂ n → = g ,             on     ∂ D . (3)</p><p>The observations { ( x j , y j ) } j = 1 N can be regarded as the supervised labeled observations in the setting of semi-supervised learning and y k ( k = 1,2, ⋯ , M ) maybe regarded as the unlabeled ones. This similarity encourages us to construct kernel-regularized learning algorithms to give numerical solution for problem (3) referring to the kernel semi-supervised learning frameworks (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref18">18</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref19">19</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref20">20</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref21">21</xref>] ). Along this line, we constructed in [<xref ref-type="bibr" rid="scirp.132553-ref15">15</xref>] a kind of kernel-regularized learning algorithm for solving problem (2) and showed the convergence rate. In the present paper, we shall construct a kind of kernel-regularized regression algorithm to solve problem (2) in case that D is the unit ball and ∂ D is the unit sphere. To this aim, we restate problem (3) in the setting of Sobolev spaces.</p><p>Let Ω ⊂ R d be a given bounded closed domain with a smoothness surface (i.e. the outward normal derivative is continuous) and ρ Ω be a Borel measure on Ω. Let C ( s ) ( Ω ) denote the set of functions such that ∂ x α f ( x ) ∈ C ( Ω ) and | α | ≤ s , where for α = ( α 1 , ⋯ , α d ) ∈ Z + d we define | α | = ∑ i = 1 d   α i and</p><p>∂ α f ( x ) : = ∂ x α f ( x ) = ∂ 1 α 1 ⋯ ∂ d α d f ( x ) = ∂ | α | f ( x ) ∂ ( x 1 ) α 1 ⋯ ( x d ) α d .</p><p>For a K ∈ C ( s ) ( Ω &#215; Ω ) , we define:</p><p>∂ x α ∂ y β K ( x , y ) = ∂ | α | + | β | K ( x , y ) ∂ ( x 1 ) α 1 ⋯ ( x d ) α d ∂ ( y 1 ) β 1 ⋯ ( y d ) β d , x = ( x 1 , ⋯ , x d ) , y = ( y 1 , ⋯ , y d ) .</p><p>Denoted by W 1 ( ρ Ω ) , the set of all functions whose 1-order partial derivatives are all in L 2 ( ρ Ω ) , i.e.</p><p>W 1 ( ρ Ω ) = { f : ‖ f ‖ W 1 ( ρ Ω ) = ( ∑ | α | ≤ 1 ‖ D α f ‖ 2, ρ Ω 2 ) 1 / 2 &lt; + ∞ } ,</p><p>and L 2 ( ρ Ω ) = { f ( x ) : ‖ f ‖ 2, ρ Ω = ( ∫ Ω | f ( x ) | 2 d ρ Ω ) 1 2 &lt; + ∞ } . Defined by ∂ ∂ n → , the outward normal derivative operator, i.e. ∂ f ( x ) ∂ n → : = ∂ x f ( x ) ∂ n → = ∇ f ( x ) ⋅ n → , where ∇ f ( x ) = ( ∂ f ∂ x 1 , ⋯ , ∂ f ∂ x d ) and n → is the outward normal vector at x = { x 1 , ⋯ , x d } ∈ ∂ Ω .</p><p>To borrow the setting of learning theory (see [<xref ref-type="bibr" rid="scirp.132553-ref22">22</xref>] ), we rewrite the problem (3). Let f * be an unknown function and g be a given function. Then, the collocation points z = { x 1 , ⋯ , x m } ⊂ i n t Ω and ν = { x m + 1 , ⋯ , x m + u } ⊂ ∂ Ω for the Neumann boundary problem:</p><p>{ f ( x ) = f * ( x ) ,       x ∈ Ω , ∂ x f ( x ) ∂ n → = g ( x ) ,       x ∈ ∂ Ω (4)</p><p>are scattered points with values y x i = f * ( x i ) + η x i , i = 1,2, ⋯ , m , and y x i = g ( x i ) , i = m + 1, m + 2, ⋯ , m + u , where for a given x i = ( x i 1 , ⋯ , x i d ) ∈ Ω , ξ x i is a random variable subject to a condition distribution ρ ( y | x i ) satisfying | ρ ( y | x i ) | ≤ B , B is a given constant number, E ρ ( ⋅ | x ) ( η x ) = ∫ [ − B , B ]   η x ( y ) d ρ ( y | x ) = 0 and σ = ( ∑ i = 1 m   σ x i 2 ) 1 2 &lt; + ∞ and σ x 2 = E ρ ( ⋅ | x ) ( η x 2 ) . The correspondence of (4) is:</p><p>{ f ( x i ) = y x i ,       i = 1,2, ⋯ , m , ∂ x f ( x ) ∂ n → | x = x i = g ( x i ) ,       i = m + 1, ⋯ , m + u (5)</p><p>We shall give an investigation on the convergence analysis of problem (5) when Ω is the unit ball B d = { x ∈ R d : ‖ x ‖ ≤ 1 } and ∂ Ω is the unit sphere S d − 1 = { x ∈ R d : ‖ x ‖ = 1 } . The paper is organized as follows. In Section 2.1, we shall provide some notions on the kernel-regularized regression learning, which contain the concept of reproducing kernel Hilbert spaces, the concept of kernel-regularized regression learning model and the kernel-regularized semi-supervised regression learning model. In Section 2.2, we shall provide some notions and results on spherical analysis. In particular, we shall define an RKHS with spherical harmonics, which have the reproducing property for the outward normal vector operator. The Neumann boundary value problem with the RKHS as the hypothesis space is defined. Based on these notions, we define in Section 2.3 a kind of kernel-regularized learning algorithm for solving the Neumann boundary value problem, show the representation theorem and give the error decomposition, with which we give the learning rates (i.e. the main Theorem 2.1) in Section 2.4. In Section 3, we shall give some lemmas, which will be used to show Theorem 2.1 in Section 4. Section 5 is the appendices containing some knowledge about the convex function, a kind of RKHS H K n → ( ρ Ω ) defined in a Sobolev space H n → ( ρ Ω ) associating a general domain Ω and its boundary ∂ Ω , and a probability inequality defined on a general RKHS.</p><p>Throughout the paper, we denote by A = O ( B ) the fact that there is a constant C independent of A and B such that A ≤ C B . We say a ~ B if both A = O ( B ) and B = O ( A ) .</p></sec><sec id="s2"><title>2. Kernel-Regularized Regression and Error Analysis</title><p>We first provide some notions and results about the kernel-regularized regression learning problem.</p><sec id="s2_1"><title>2.1. Notions and Results on Kernel-Regularized Regression</title><p>Follow all the definitions and notions in Section 1. Let K x ( y ) = K ( x , y ) : Ω &#215; Ω → R be a Mercer kernel (i.e. it is continuous, symmetry and positive semi-definite, i.e. for any given integer l ≥ 1 and any given set { x 1 , x 2 , ⋯ , x l } ⊂ Ω , the matrix ( K ( x i , x j ) ) i , j = 1,2, ⋯ , l is positive definite) satisfying ( ∫ Ω &#215; Ω | K ( x , y ) | 2 d ρ Ω ( x ) d ρ Ω ( y ) ) 1 2 &lt; + ∞ . The reproducing kernel Hilbert space ( H K , ‖   ⋅   ‖ K ) associated with K ( x , y ) is a Hilbert space consisting of all the real functions defined on Ω such that:</p><p>f ( x ) = 〈 f , K x 〉 K ,         ∀ x ∈ Ω ,   ∀ f ∈ H K .</p><p>Define an operator L K ( f , x ) as:</p><p>L K ( f , x ) = ∫ Ω   f ( y ) K x ( y ) d ρ Ω ,         x ∈ Ω   .</p><p>Then, L : L 2 ( ρ Ω ) → L 2 ( ρ Ω ) . Denoted by λ k the k-th eigenvalue associated with eigenfunction φ k . Then, we have by the Mercer theorem that:</p><p>K ( x , y ) = ∑ l = 0 + ∞   λ l φ l ( x ) φ l ( y ) ,             x , y ∈ Ω ,</p><p>where we assume the convergence on the right side is absolute (for every x , y ∈ Ω ) and uniform on x , y ∈ Ω . If { φ k } k = 0 + ∞ forms an orthonormal system in L 2 ( ρ Ω ) , then:</p><p>H K = L K 1 2 ( L 2 ( ρ Ω ) ) = { f ( x ) = ∑ l = 0 + ∞   a l ( f ) φ l ( x ) : ‖ f ‖ K = ( ∑ l = 0 + ∞ | a l ( f ) | 2 λ l ) 1 2 &lt; + ∞ } ,</p><p>where L K = L K 1 2 ∘ L K 1 2 .</p><p>Let z = { ( x k , y k ) } k = 1 m be a set of observations about an unknown function f * . Then, to obtain an good approximation of f * , one usually borrow the kernel-regularized regression learning model:</p><p>f z = arg min f ∈ H K ( E z ( f ) + λ ‖ f ‖ K 2 ) , (6)</p><p>where E z ( f ) = 1 m ∑ k = 1 m ( y k − f ( x k ) ) 2 is the empirical variance. In learning theory, the convergence analysis is sum up to bound the convergence rate for the error ( [<xref ref-type="bibr" rid="scirp.132553-ref23">23</xref>] or [<xref ref-type="bibr" rid="scirp.132553-ref24">24</xref>] ):</p><p>‖ f z − f * ‖ L 2 ( ρ Ω ) . (7)</p><p>When the observation set z has the form z = { ( x k , y k ) } k = 1 m ∪ { x m + l } l = 1 u , i.e. the nodes { x m + l } l = 1 u have no labeled observation values { y m + l } l = 1 u , we call this case the semi-supervised learning. In practical applications, most of the observations belong to semi-supervised samples since data is precious. Many mathematicians have paid their attentions to this field (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref18">18</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref19">19</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref21">21</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref25">25</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref26">26</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref27">27</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref28">28</xref>] ). The main ideas of dealing with this problem are add a term to make the use of unlabeled data, i.e. we need to modify (8) as the form of:</p><p>f z , λ , γ = arg min f ∈ H K ( E z ( f ) + γ   Ω u , m ( f ) + λ ‖ f ‖ K 2 ) ,           γ &gt; 0. (8)</p><p>For example, in [<xref ref-type="bibr" rid="scirp.132553-ref19">19</xref>] , one choose γ   Ω u , m ( f ) = γ ( u + m ) 2 ∑ i , j = 1 l + u ( f ( x i ) − f ( x j ) ) 2 W i , j , where W i , j are edge weights in the data adjacency graph. Also, in [<xref ref-type="bibr" rid="scirp.132553-ref21">21</xref>] , one chooses:</p><p>γ   Ω u , m ( f ) = γ ( m + u ) ( m + u − 1 ) ∑ i , j = 1, i ≠ j m + u   w i j ( σ ) &#215; ( f ( x i ) − f ( x j ) ) 2 , w i j ( σ ) = exp { − d K ( x i , x j ) σ } ,</p><p>where σ &gt; 0 and</p><p>d K ( x , y ) = ‖ K x − K y ‖ K = K ( x , x ) + K ( y , y ) − 2 K ( x , y ) .</p><p>These choices encourage us to choose suitable γ   Ω u , m ( f ) to give good approximation solution for problem (5).</p></sec><sec id="s2_2"><title>2.2. Some Results on Spherical Analysis</title><p>In this subsection, we shall define a kind of RKHS with spherical harmonics, with which define a kernel-regularized regression learning algorithm for solving problem (5) when Ω is the unit ball B d and show the learning rates.</p><p>Let B d = { x ∈ R d : ‖ x ‖ ≤ 1 } denote the unit ball in d-dimensional Euclidean space R d with the usual inner product 〈 x , y 〉 , and ‖ x ‖ = 〈 x , x 〉 is the usual Euclidean norm. For weight W μ ( x ) = ( 1 − ‖ x ‖ 2 ) μ , μ &gt; − 1 , we denote by L p , μ ( B d ) ≡ L p ( B d , W μ ) , 1 ≤ p &lt; + ∞ , the space of measurable functions defined on B d with:</p><p>‖ f ‖ L p , μ ( B d ) = ( ∫ B d | f ( x ) | p W μ ( x ) d x ) 1 2 &lt; + ∞ ,         1 ≤ p &lt; + ∞</p><p>and for p = + ∞ , we assume that L ∞ , μ denotes the space C ( B d ) of continuous functions on B d with the uniform norm.</p><p>We denote by Π n d the space of all polynomials in d variables of degree at most n, and by ν n d ( W μ ) the space of all polynomials of degree n which are orthogonal to polynomials of low degree in L 2, μ ( B d ) . The ν n d ( W μ ) is mutually orthogonal in L 2, μ ( B d ) and (see [<xref ref-type="bibr" rid="scirp.132553-ref29">29</xref>] ):</p><p>L 2, μ ( B d ) = ⊕ n = 0 ∞     ν n d ( W μ ) ,         Π n d = ⊕ k = 0 n     ν k d ( W μ ) .</p><p>Let d σ denote the Lebesgue measure on S d − 1 = { x : ‖ x ‖ = 1 } and denote the area of S d − 1 by σ d , σ d = ∫ S d − 1   d σ = 2 π d / 2 / Γ ( d / 2 ) . Let H n d denote the space of homogeneous harmonic polynomials of degree n, which are homogeneous polynomials of degree n satisfying equation Δ p = ∑ k = 1 d ∂ 2 p ∂ ( x k ) 2 = 0 . Also, we denote by P n d the set of homogeneous polynomials of degree n. It is well known that:</p><p>a n d : = dim H n d = ( n + d − 1 n − 1 ) − ( n + d − 3 n − 2 ) .</p><p>Let W 1 ( B d ) μ denote the set of functions whose 1-th derivatives are all in L 2, μ ( B d ) , i.e.</p><p>W 1 ( B d ) μ = { f : ‖ f ‖ W 2 ( B d ) μ = ( ∑ | α | ≤ 1 ‖ ∂ α f ‖ L 2, μ ( B d ) 2 ) 1 / 2 &lt; + ∞ } .</p><p>In this case, S d − 1 = { x : ‖ x ‖ 2 = ∑ i = 1 d ( x i ) 2 = 1 } and</p><p>∂ x f ∂ n → ( x ) = ∑ i = 1 d   x i ∂ f ∂ x i ( x ) ,       x = ( x 1 ,   ⋯ ,   x d ) ∈ S d − 1 . (9)</p><p>Define a subclass of W 1 ( B d ) μ as:</p><p>H μ n → = { f ∈ W 1 ( B d ) μ : ‖ f ‖ H μ n → = ( ∫ B d | f ( x ) | 2 W μ ( x ) d x + ∫ S d − 1 | ∂ ξ f ∂ n → ( ξ ) | 2 d σ ( ξ ) ) 1 2 &lt; + ∞ } .</p><p>An inner product defined on H μ n → is:</p><p>〈 f , g 〉 H μ n → = ∫ B d   f ( x ) g ( x ) W μ ( x ) d x + ∫ S d − 1 ∂ f ∂ n → ( ξ ) ∂ g ∂ n → ( ξ ) d σ ( ξ ) .</p><p>Denoted by ν n d ( W μ , S ) the space of orthogonal polynomials with respect to 〈 ⋅ , ⋅ 〉 H μ n → . Then, by Theorem 3 of [<xref ref-type="bibr" rid="scirp.132553-ref30">30</xref>] , we know ν n d ( W μ , S ) contains a mutually orthonormal basis { Q n , k ≡ Q n , k d : k = 1,2, ⋯ , a n d } with respect to 〈 ⋅ , ⋅ 〉 H μ n → . Then, there holds the expansion:</p><p>f ( x ) ~ ∑ k = 0 ∞   ∑ l = 1 a k d   a k , l ( f ) Q k , l ( x ) ,       x ∈ B d ,   f ∈ H μ n → ,</p><p>where a k , l ( f ) = 〈 f , Q k , l 〉 μ S and by the Bessel inequality, we have:</p><p>( ∑ k = 0 ∞   ∑ l = 1 d k d | a k , l ( f ) | 2 ) 1 2 ≤ ‖ f ‖ H μ n → &lt; + ∞ .</p><p>Let K μ ( x , y ) : B d &#215; B d → R be a Mercer kernel with the form:</p><p>K x μ ( y ) = K μ ( x , y ) : = ∑ k = 0 ∞     λ k P k ( x , y ) ,         x ∈ B d ,   y ∈ B d , (10)</p><p>where P k ( x , y ) = ∑ l = 1 a k d   Q k , l ( x ) Q k , l ( y ) and ∑ k = 0 ∞   λ k c k &lt; + ∞ with λ k &gt; 0 being defined as sup x ∈ B d P k ( x , x ) = c k   ( k = 0 , 1 , 2 , ⋯ ) .</p><p>Define</p><p>L K μ ( f ) ( x ) = L K μ ( f , x ) : = 〈 f , K x ( ⋅ ) 〉 H μ n → = ∑ k = 0 ∞   λ k ∑ l = 1 a k d   a k , l ( f ) Q k , l ( x ) ,       x ∈ B d</p><p>and H K μ n → = L K μ 1 2 ( H μ n → ) . Then,</p><p>H K μ n → = { ∑ k = 0 ∞   ∑ l = 1 a k d   a k , l ( f ) Q k , l ( x ) : ‖ f ‖ K μ = ( ∑ k = 0 ∞   ∑ l = 1 a k d | a k , l ( f ) | 2 λ k ) 1 2 &lt; + ∞ } .</p><p>For f ( x ) = ∑ k = 0 ∞   ∑ l = 1 a k d   a k , l ( f ) Q k , l ( x ) ∈ H K μ n → and g ( x ) = ∑ k = 0 ∞   ∑ l = 1 a k d   a k , l ( g ) Q k , l ( x ) ∈ H K μ n → , we define:</p><p>〈 f , g 〉 K μ = ∑ k = 0 ∞   ∑ l = 1 a k d a k , l ( f ) a k , l ( g ) λ k .</p><p>Then, we shall show in Proposition 2.1 that H K μ n → is an RKHS associated with kernel (10).</p><p>To give quantitatively description for the kernel K μ , we give two assumptions.</p><p>Assumption A. Assume K μ ∈ C ( 1 ) ( B d &#215; B d ) .</p><p>Assumption B. Assume sup x ∈ S d − 1 | ∂ x P k ( x , x ) ∂ n → | = c ′ k   ( k = 0,1,2, ⋯ ) and</p><p>∑ k = 0 ∞   λ k c ′ k &lt; + ∞ . (11)</p><p>Since { Q n , k ≡ Q n , k d : k = 1,2, ⋯ , a n d } are algebraic polynomials, c k and c ′ k must exist. The real numbers λ k satisfy (11) are also existent, for example, we can take λ k = e − ( c k + c ′ k ) ( k = 0,1,2, ⋯ ) .</p><p>If (11) holds, then by Theorem 2.4 in [<xref ref-type="bibr" rid="scirp.132553-ref31">31</xref>] , or Theorem 4.2 in [<xref ref-type="bibr" rid="scirp.132553-ref32">32</xref>] or Proposition 6.2 in [<xref ref-type="bibr" rid="scirp.132553-ref33">33</xref>] that:</p><p>∂ x α ( ∂ y β K x μ ( y ) ) = K μ ( x , y ) : = ∑ k = 0 ∞   λ k ∂ x α ( ∂ y β P k ( x , y ) ) ,       x ∈ S d − 1 ,   y ∈ S d − 1 , (12)</p><p>and the convergence in the right-side of (12) is absolute and uniform on S d − 1 &#215; S d − 1 .</p><p>Proposition 2.1. Assume above Assumptions A and B hold. Then,</p><p>1) There holds the reproducing property:</p><p>f ( x ) = 〈 f , K x μ ( ⋅ ) 〉 K μ ,       f ∈ H K μ n → ,   x ∈ B d . (13)</p><p>2) There holds the reproducing property for the outward normal vector operator, i.e.</p><p>∂ x f ( x ) ∂ n → = 〈 f , ∂ x K x μ ( ⋅ ) ∂ n → 〉 K μ ,       f ∈ H K μ n → ,   x ∈ S d − 1 . (14)</p><p>3) Define</p><p>k = sup x ∈ B d ‖ K x μ ( ⋅ ) ‖ K μ + sup x ∈ S d − 1 ‖ ∂ x ∂ n → K x μ ( ⋅ ) ‖ K μ .</p><p>Then, for all x ∈ B d and y ∈ S d − 1 , we have:</p><p>| f ( x ) | ≤ k ‖ f ‖ K μ ,       | ∂ x f ( y ) ∂ n → | ≤ k ‖ f ‖ K μ . (15)</p><p>Proof. See the proofs in Section 4.</p><p>Let { x i } i = 1 m + l be observations drawn i.i.d. according to ρ B d , y i = f * ( x i ) + η x i , i = 1 , 2 , ⋯ , m and for a given x i ∈ B d ξ x i is a random variable subject to a condition distribution ρ B d ( y | x i ) satisfying | ρ ( y | x i ) | ≤ B (B is a given constant number), E ρ ( ⋅ | x ) ( η x ) = ∫ [ − B , B ]   η x ( y ) d ρ B d ( y | x ) = 0 , σ = ( ∫ B d   σ x 2 d ρ B d ) 1 2 &lt; + ∞ and σ x 2 = E ρ B d ( ⋅ | x ) ( η x 2 ) . Then, z = { ( x i , y i ) } i = 1 m can be regarded as observations drawn i.i.d. according to ρ ( x , y ) = ρ B d ( x ) ρ B d ( y | x ) and { x i } i = m + 1 m + l be samples drawn i.i.d. according to ρ S d − 1 . The correspondence of problem (5) then is:</p><p>{ f ( x i ) = y x i ,       i = 1 , 2 , ⋯ , m , ∂ f ( x i ) ∂ n → = ∑ k = 1 d   x i k ∂ f ( x i ) ∂ x k = g ( x i ) ,       i = m + 1 , ⋯ , m + l (16)</p><p>We shall give an investigation on the numerical solutions of problem (16) with kernel-regularized approaches. A kernel learning algorithm with H K μ n → being the hypothesis space will be defined in Section 2.2. The representation theorem for it is provided and an error decomposition for its error analysis is given, from which a learning rate for Algorithm (16) is shown.</p></sec><sec id="s2_3"><title>2.3. Kernel-Regularized Regression</title><p>With above notions in hand, we now give following kernel-regularized learning algorithms for giving solutions for problem (16):</p><p>f z , λ = arg min f ∈ H K μ n → E z ( f ) + 1 l ∑ i = m m + l ( ∂ x f ( x i ) ∂ n → − g ( x i ) ) 2 + λ ‖ f ‖ K μ 2 , (17)</p><p>where l ~ m , i.e. there exist c 1 &gt; 0 , c 2 &gt; 0 such that c 1 ≤ l m ≤ c 2 and</p><p>E z ( f ) = 1 m ∑ i = 1 m ( f ( x i ) − y x i ) 2 .</p><p>Corresponding to (17), we define a model for the observations without the disturbances ξ x i by:</p><p>f X &#175; , λ = arg min f ∈ H K μ n → E X &#175; ( f ) + 1 l ∑ i = m m + l ( ∂ x f ( x i ) ∂ n → − g ( x i ) ) 2 + λ ‖ f ‖ K μ 2 , (18)</p><p>where</p><p>E X &#175; ( f ) = 1 m ∑ i = 1 m ( f ( x i ) − f * ( x i ) ) 2 .</p><p>The integral model for (18) is defined as:</p><p>f λ = arg min f ∈ H K μ n → E ( f ) + ∫ S d − 1 ( ∂ x f ( x ) ∂ n → − g ( x ) ) 2 d ρ S d − 1 + λ ‖ f ‖ K μ 2 , (19)</p><p>where</p><p>E ( f ) = ∫ B d ( f ( x ) − f * ( x ) ) 2 d ρ B d .</p><p>Proposition 2.2. (Representation theorem). Assume above Assumptions A and B hold. Then,</p><p>1) Algorithm (17) has unique solution f z , λ and there are coefficients { a i } i = 1 m + l depending upon λ , m , l , f z , λ , z , f * and g such that:</p><p>f z , λ ( ⋅ ) = ∑ i = 1 m   a i K x i μ ( ⋅ ) + ∑ k = 1 l   a m + k ∂ x K x m + k μ ( ⋅ ) ∂ n → . (20)</p><p>2) Algorithm (18) has unique solution f X &#175; , λ and there are coefficients { b i } i = 1 m + l depending upon λ , m , l , f X &#175; , λ , X &#175; , f * and g such that:</p><p>f X &#175; , λ ( ⋅ ) = ∑ i = 1 m   b i K x i μ ( ⋅ ) + ∑ k = 1 l   b m + k ∂ x K x m + k μ ( ⋅ ) ∂ n → . (21)</p><p>3) Algorithm (19) has unique solution f λ and there is a function G λ , f * ( x ) depending upon λ , f * and a function p λ , g ( x ) depending upon λ and g such that:</p><p>f λ ( ⋅ ) = ∫ B d   G λ , f * ( x ) K x μ ( ⋅ ) d ρ B d + ∫ S d − 1   p λ , g ( x ) ∂ x K x μ ( ⋅ ) ∂ n → d ρ S d − 1 . (22)</p><p>Proof. See the proof in Section 4.</p><p>(20) shows that Algorithm (17) can be replaced by some coefficient regularized models and is a new topic, such kind of research can be found from literature [<xref ref-type="bibr" rid="scirp.132553-ref34">34</xref>] [<xref ref-type="bibr" rid="scirp.132553-ref35">35</xref>] .</p><p>We give the following error decomposition:</p><p>‖ f z , λ − f * ‖ L 2 ( ρ B d ) + ‖ ∂ f z , λ ∂ n → − g ‖ L 2 ( ρ S d − 1 ) ≤ ‖ f z , λ − f X &#175; , λ ‖ L 2 ( ρ B d ) + ‖ f X &#175; , λ − f * ‖ L 2 ( ρ B d )         + ‖ ∂ f z , λ ∂ n → − ∂ f X &#175; , λ ∂ n → ‖ L 2 ( ρ S d − 1 ) + ‖ ∂ f X &#175; , λ ∂ n → − g ‖ L 2 ( ρ S d − 1 ) ≤ ‖ f z , λ − f X &#175; , λ ‖ L 2 ( ρ B d ) + ‖ f X &#175; , λ − f λ ‖ L 2 ( ρ B d ) + ‖ f λ − f * ‖ L 2 ( ρ B d )       + ‖ ∂ f z , λ ∂ n → − ∂ f X &#175; , λ ∂ n → ‖ L 2 ( ρ S d − 1 ) + ‖ ∂ f X &#175; , λ ∂ n → − ∂ f λ ∂ n → ‖ L 2 ( ρ S d − 1 ) + ‖ ∂ f λ ∂ n → − g ‖ L 2 ( ρ S d − 1 )</p><p>≤ ( ‖ f z , λ − f X &#175; , λ ‖ L 2 ( ρ B d ) + ‖ ∂ f z , λ ∂ n → − ∂ f X &#175; , λ ∂ n → ‖ L 2 ( ρ S d − 1 ) )       + ( ‖ f X &#175; , λ − f λ ‖ L 2 ( ρ B d ) + ‖ ∂ f X &#175; , λ ∂ n → − ∂ f λ ∂ n → ‖ L 2 ( ρ S d − 1 ) ) + 2 K ( f * , g , λ ) ≤ k ( ‖ f z , λ − f X &#175; , λ ‖ K μ + ‖ f X &#175; , λ − f λ ‖ K μ ) + 2 K ( f * , g , λ ) , (23)</p><p>where in the last derivation, we have used (15) and</p><p>K ( f * , g , λ ) = inf f ∈ H K μ n → ( ‖ f − f * ‖ L 2 ( ρ B d ) 2 + ∫ S d − 1 ( ∂ f ( x ) ∂ n → − g ( x ) ) 2 d ρ S d − 1 + λ ‖ f ‖ K μ 2 ) ,</p><p>which controls the approximation errors. Then, to bound the error:</p><p>E ( f z , λ , f * , g ) = ‖ f z , λ − f * ‖ L 2 ( ρ B d ) + ‖ ∂ f z , λ ∂ n → − g ‖ L 2 ( ρ S d − 1 ) ,</p><p>we need only to bound upper bounds for the sample errors ‖ f z , λ − f X &#175; , λ ‖ K μ and ‖ f X &#175; , λ − f ρ , λ ‖ K μ respectively.</p></sec><sec id="s2_4"><title>2.4. Learning Rates</title><p>Theorem 2.1. Let the above Assumptions A and B hold and let f z , λ be the solution of (17) and let f * ∈ C ( B d ) and g ∈ L 2 ( ρ S d − 1 ) . Then, for any δ ∈ ( 0,1 ) , with confidence 1 − δ , holds:</p><p>‖ f z , λ − f * ‖ L 2 ( ρ B d ) + ‖ ∂ f z , λ ∂ n → − g ‖ L 2 ( ρ S d − 1 ) = O ( σ λ m δ + ( K ( f * , g , λ ) λ 3 2 m + ‖ f * ‖ C ( B d ) λ m ) log 4 δ         + K ( f * , g , λ ) λ l δ + K ( f * , g , λ ) ) . (24)</p><p>If g = ∂ f * ( x ) ∂ n → for x ∈ S d − 1 , then</p><p>E ( f z , λ , f * , g ) = ‖ f z , λ − f * ‖ H n → ( ρ B d )</p><p>and in this case,</p><p>K ( f * , g , λ ) = K ( f * , λ ) = inf f ∈ H K μ n → ( ‖ f − f * ‖ H n → ( ρ B d ) 2 + λ ‖ f ‖ K μ 2 ) ,         λ &gt; 0 ,</p><p>where the norm ‖   ⋅   ‖ H n → ( ρ B d ) is defined in Section 2.2. The decay for K ( f * , λ ) has recently been discussed in [<xref ref-type="bibr" rid="scirp.132553-ref36">36</xref>] .</p><p>We then have the following Corollary 2.1.</p><p>Corollary 2.1. Let Assumptions A and B hold and let f z , λ be the solution of (17) and f * ∈ C ( B d ) . If g ( x ) = ∂ f * ( x ) ∂ n → for x ∈ S d − 1 . Then, for any δ ∈ ( 0,1 ) , with confidence 1 − δ , holds:</p><p>‖ f z , λ − f * ‖ H n → ( ρ B d ) = O ( σ λ m δ + ( K ( f * , λ ) λ 3 2 m + ‖ f * ‖ C ( B d ) λ m ) log 4 δ + K ( f * , g , λ ) λ l δ + K ( f * , λ ) ) . (25)</p><p>By (25), we know if λ = λ ( m ) ↓ 0 +   ( m → + ∞ ) is chosen in such a way that l i m λ → 0 + K ( f * , λ ) = 0 , λ 3 2 m → + ∞ when m → + ∞ , then, with confidence 1 − δ , holds (since m ~ l ):</p><p>l i m m → + ∞ ‖ f z , λ − f * ‖ H n → ( ρ B d ) = 0. (26)</p></sec><sec id="s2_5"><title>2.5. Further Discussions</title><p>We now give some explanation on the results and the assumptions.</p><p>1) There are three reasons encouraging us to choose D = B d and ∂ D = S d − 1 to show the kernel-regularized regression model for solving the Neumann boundary problem (5).</p><p>i) When D = B d and ∂ D = S d − 1 , we can easily give explicit representation for outward normal derivatives (see (9)). Therefore, we can extend the method used in this paper to the domains whose outward normal derivatives may be computed easily.</p><p>ii) By the statement in Section 2.1, we know a tool for us to construct a reproducing kernel Hilbert space is the orthonormal basis. We notice that function orthogonal basis theory has been developed not only on the unit ball B d (see [<xref ref-type="bibr" rid="scirp.132553-ref37">37</xref>] ) and the unit sphere S d − 1 (see [<xref ref-type="bibr" rid="scirp.132553-ref29">29</xref>] ), but also on the some Sobolev space associated with both B d and S d − 1 (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref30">30</xref>] ). These facts also encourage us to choose D = B d and ∂ D = S d − 1 for problem (5).</p><p>iii) In learning theory, the learning rate estimate for the kernel-regularized learning algorithm are sum up to bound the error bounds, which belong to the scope of approximation theory. One can make use of the rich spherical approximation theory skills to bound the learning rates (see [<xref ref-type="bibr" rid="scirp.132553-ref24">24</xref>] ).</p><p>2) In the present paper, we give two Assumptions A and B. They are reasonable.</p><p>Since p k ( x , y ) is a bivariate polynomial on B d &#215; B d for a given k, ∂ x p k ( x , x ) ∂ n → is still a polynomial whose sup can be attained on S d − 1 . The Assumption B is reasonable. By the same way, we know d k = sup x , y ∈ B d ∂ α p k ( x , y ) can be attained. If we choose λ l = e − k β ( β &gt; 0 ) , then, we can have K μ ∈ C ( 1 ) ( B d &#215; B d ) . So, the Assumption A is reasonable as well.</p><p>3) The convergence in the present paper are uniform convergence (see (26)), which admits the reliability for the proposed method.</p></sec></sec><sec id="s3"><title>3. Lemmas</title><p>To give the feature description of the optimal solutions of Algorithms (17)-(19), we need the concept of G&#226;teaux derivative.</p><p>Let ( H ,   ‖   ⋅   ‖ H ) be a Hilbert space, F ( f ) : H → R ∪ { ∓ ∞ } be a real function. We say F is G&#226;teaux differentiable at f ∈ H if there is a ξ ∈ H such that for any g ∈ H there holds:</p><p>l i m t → 0 F ( f + t g ) − F ( f ) t = 〈 g , ξ 〉 H</p><p>and write ∇ f F ( f ) = ξ as the G&#226;teaux derivative of F ( f ) at f.</p><p>To prove Theorem 2.1, we need some lemmas.</p><p>Lemma 3.1. There hold the following equations:</p><p>1) f z , λ satisfies equation:</p><p>1 m ∑ i = 1 m ( f z , λ ( x i ) − y x i ) K x i μ ( ⋅ ) + 1 l ∑ k = 1 l ( ∂ f z , λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n →   + λ f z , λ ( ⋅ ) = 0. (27)</p><p>2) f X &#175; , λ satisfies equation:</p><p>1 m ∑ i = 1 m ( f X &#175; , λ ( x i ) − f * ) ( x i ) K x i μ ( ⋅ ) + 1 l ∑ k = 1 l ( ∂ f X &#175; , λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n →   + λ f X &#175; , λ ( ⋅ ) = 0. (28)</p><p>3) f λ satisfies equation:</p><p>∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d + ∫ S d − 1 ( ∂ f λ ( x ) ∂ n → − g ( x ) ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1   + λ f λ ( ⋅ ) = 0. (29)</p><p>Proof. Proof of 1). Define Ω z ( f ) as:</p><p>Ω z ( f ) = E z ( f ) + 1 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x m + k ) ) 2 + λ ‖ f ‖ K μ 2 ,       f ∈ H K μ n → .</p><p>Then,</p><p>l i m t → 0 + Ω z ( f + t h ) − Ω z ( f ) t = l i m t → 0 + E z ( f + t h ) − E z ( f ) t + 2 λ 〈 h , f 〉 K μ ,       + 1 l ∑ k = 1 l l i m t → 0 + ( ∂ f ( x k + m ) ∂ n → + t ∂ h ( x k + m ) ∂ n → − g ( x k + m ) ) 2 − ( ∂ f ( x k + m ) ∂ n → − g ( x k + m ) ) 2 t ,</p><p>where</p><p>l i m t → 0 + E z ( f + t h ) − E z ( f ) t = 1 m ∑ i = 1 m l i m t → 0 + ( f ( x i ) + t h ( x i ) − y x i ) 2 − ( f ( x i ) − y x i ) 2 t = 2 m ∑ i = 1 m ( f ( x i ) − y x i ) h ( x i ) = 2 m ∑ i = 1 m ( f ( x i ) − y x i ) 〈 h , K x i μ ( ⋅ ) 〉 K μ = 〈 h , 2 m ∑ i = 1 m ( f ( x i ) − y x i ) K x i μ ( ⋅ ) 〉 K μ</p><p>and</p><p>1 l ∑ k = 1 l l i m t → 0 + ( ∂ f ( x k + m ) ∂ n → + t ∂ h ( x k + m ) ∂ n → − g ( x k + m ) ) 2 − ( ∂ f ( x k + m ) ∂ n → − g ( x k + m ) ) 2 t = 〈 h , 2 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x k + m ) ) ∂ K x k + m μ ( ⋅ ) ∂ n → 〉 K μ . (30)</p><p>It follows</p><p>l i m t → 0 + Ω z ( f + t h ) − Ω z ( f ) t = 〈 h , 2 m ∑ i = 1 m ( f ( x i ) − y x i ) K x i μ ( ⋅ ) + 2 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x k + m ) ) ∂ K x k + m μ ( ⋅ ) ∂ n → + 2 λ f 〉 K μ .</p><p>By the definition of G&#226;teaux derivative, we have:</p><p>∇ f Ω z ( f ) = 2 m ∑ i = 1 m ( f ( x i ) − y x i ) K x i μ ( ⋅ )   + 2 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x k + m ) ) ∂ K x k + m μ ( ⋅ ) ∂ n → + 2 λ f ( ⋅ ) .</p><p>By Fermat’s rule (see 1) in Proposition A1) and the definition of f z , λ , we have ∇ f Ω z ( f ) | f = f z , λ = 0 , i.e. (27) holds.</p><p>Lemma 3.2. There hold the inequality:</p><p>‖ f z , λ − f X &#175; , λ ‖ K μ ≤ 2 λ m ‖ ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) ‖ K μ (31)</p><p>and the inequality:</p><p>‖ f X &#175; , λ − f ρ , λ ‖ K μ ≤ 2 λ ( ‖ ∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d − 1 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) ‖ K μ         + ‖ ∫ S d − 1 ( ∂ f ( x ) ∂ n → − g ( x ) ) K x μ ( ⋅ ) d ρ S d − 1 − 1 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x m + k ) ) K x m + k μ ( ⋅ ) ‖ K μ ) . (32)</p><p>Proof. The definition of f z , λ and the inequality (43) give:</p><p>0 ≥ Ω z ( f z , λ ) − Ω z ( f X &#175; , λ ) ≥ 2 m ∑ i = 1 m ( f X &#175; , λ ( x i ) − y x i ) ( f z , λ ( x i ) − f X &#175; , λ ( x i ) )   + 2 l ∑ k = 1 l ( ∂ f X &#175; , λ ( x m + k ) ∂ n → − g ( x m + k ) ) ( ∂ f z , λ ( x m + k ) ∂ n → − ∂ f X &#175; , λ ( x m + k ) n → )   + λ ( ‖ f z , λ ‖ K μ 2 − ‖ f X &#175; , λ ‖ K μ 2 )</p><p>≥ 〈 f z , λ − f X &#175; , λ , 2 m ∑ i = 1 m ( f X &#175; , λ ( x i ) − y x i ) K x i μ ( ⋅ )   + 2 l ∑ k = 1 l ( ∂ f X &#175; , λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ   + 〈 f z , λ − f X &#175; , λ , 2 λ f X &#175; , λ 〉 K μ + λ ‖ f z , λ − f X &#175; , λ ‖ K μ 2 ,</p><p>where we have used (44). Since (28) and y x i = f * ( x i ) + η x i , we have:</p><p>0 ≥ Ω z ( f z , λ ) − Ω z ( f X &#175; , λ ) ≥ 〈 f z , λ − f X &#175; , λ , 2 m ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) 〉 K μ + λ ‖ f z , λ − f X &#175; , λ ‖ K μ 2 .</p><p>It follows:</p><p>λ ‖ f z , λ − f X &#175; , λ ‖ K μ 2 ≤ 〈 f X &#175; , λ − f z , λ , 2 m ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) 〉 K μ ≤ ‖ 2 m ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) ‖ K μ &#215; ‖ f X &#175; , λ − f z , λ ‖ K μ .</p><p>(31) thus holds.</p><p>Proof of (32). Define Ω X &#175; ( f ) as:</p><p>Ω X &#175; ( f ) = E X &#175; ( f ) + 1 l ∑ k = 1 l ( ∂ f ( x k + m ) ∂ n → − g ( x m + k ) ) 2 + λ ‖ f ‖ K μ 2 ,       f ∈ H K μ n → .</p><p>Then, by the definition of f X &#175; , λ and inequalities (43) and (44), we have:</p><p>0 ≥ Ω X &#175; ( f X &#175; , λ ) − Ω X &#175; ( f λ ) ≥ 〈 f X &#175; , λ − f λ , 2 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) 〉 K μ   + 〈 f X &#175; , λ − f λ ,   2 l ∑ k = 1 l ( ∂ f λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ   + 〈 f X &#175; , λ − f λ , 2 λ f λ 〉 K μ + λ ‖ f X &#175; , λ − f λ ‖ K μ 2 .</p><p>By (29), we have:</p><p>0 ≥ 〈 f X &#175; , λ − f λ , ∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d               − 1 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) + ∫ S d − 1 ( ∂ f λ ( x ) ∂ n → − g ( x ) ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1           − 1 l ∑ k = 1 l ( ∂ f λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ + λ ‖ f X &#175; , λ − f λ ‖ K μ 2 .</p><p>It follows:</p><p>λ ‖ f X &#175; , λ − f λ ‖ K μ 2 ≤ 〈 f X &#175; , λ − f λ , 1 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) − ∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d         − ∫ S d − 1 ( ∂ f λ ( x ) ∂ n → − g ( x ) ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1         + 1 l ∑ k = 1 l ( ∂ f λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ</p><p>≤ ( ‖ 1 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) − ∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d ‖ K μ         &#215; ‖ f X &#175; , λ − f λ ‖ K μ + ‖ ∫ S d − 1 ( ∂ f λ ( x ) ∂ n → − g ( x ) ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1         − 1 l ∑ k = 1 l ( ∂ f λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → ‖ K μ ) .</p><p>Above inequality gives (32).</p><p>Lemma 3.3. There hold following inequalities.</p><p>1) For any δ ∈ ( 0,1 ) , with confidence 1 − δ , holds:</p><p>‖ f z , λ − f X &#175; , λ ‖ K μ ≤ 2 k σ λ m δ . (33)</p><p>2) For any δ ∈ ( 0,1 ) , with confidence 1 − δ , holds:</p><p>‖ f X &#175; , λ − f λ ‖ K μ = ( 2 k 2 K ( f * , g , λ ) λ m λ + 2 k ‖ f * ‖ C ( B d ) λ m ) log 2 δ + k K ( f * , g , λ ) l δ . (34)</p><p>Proof. Proof of (33). The definition of ‖   ⋅   ‖ K μ ( B d ) and (14) give:</p><p>‖ 1 m ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) ‖ K μ 2 = 〈 1 m ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) , 1 m ∑ j = 1 m     η x j ( y ) K x j μ ( ⋅ ) 〉 K μ = 1 m 2 ∑ i , j = 1 m     η x i ( y ) η x j ( y ) K μ ( x i , x j ) .</p><p>By Markov inequality, we have:</p><p>P ( ‖ f z , λ − f X &#175; , λ ‖ K μ &gt; ε ) ≤ E ( ‖ f z , λ − f X &#175; , λ ‖ K μ 2 ) ε 2 . (35)</p><p>Then, by (31), we have:</p><p>‖ f z , λ − f X &#175; , λ ‖ K μ 2 ≤ 4 λ 2 m 2 ‖ ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) ‖ K μ 2 ≤ 4 λ 2 m 2 〈 ∑ i = 1 m     η x i ( y ) K x i μ ( ⋅ ) ,   ∑ j = 1 m     η x j ( y ) K x j μ ( ⋅ ) 〉 K μ = 4 λ 2 m 2 ∑ i , j = 1 m     η x i ( y ) η x j ( y ) K μ ( x i , x j ) .</p><p>Since E ρ ( ⋅ | x ) ( η x ) = 0 , we have:</p><p>E ( ‖ f z , λ − f X &#175; , λ ‖ K μ 2 ) ≤ 4 k 2 λ 2 m ∫ B d     η x 2 d ρ B d = 4 k 2 σ 2 λ 2 m .</p><p>By (35), we have:</p><p>P ( ‖ f z , λ − f X &#175; , λ ‖ K μ ≤ ε ) ≥ 1 − 4 k 2 σ 2 λ 2 m ε 2 .</p><p>Taking δ = 4 k 2 σ 2 λ 2 m ε 2 . We have ε = 2 k σ λ m δ and (33) thus holds.</p><p>We now show (34). Take</p><p>A = ‖ ∫ B d ( f λ ( x ) − f * ( x ) ) K x μ ( ⋅ ) d ρ B d − 1 m ∑ i = 1 m ( f λ ( x i ) − f * ( x i ) ) K x i μ ( ⋅ ) ‖ K μ</p><p>and</p><p>B = ‖ ∫ S d − 1 ( ∂ f λ ( x ) ∂ n → − g ( x ) ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1     − 1 l ∑ k = 1 l ( ∂ f λ ( x m + k ) ∂ n → − g ( x m + k ) ) ∂ K x m + k μ ( ⋅ ) ∂ n → ‖ K μ .</p><p>Then,</p><p>‖ f X &#175; , λ − f ρ , λ ‖ K μ ≤ 2 λ ( A + B ) , (36)</p><p>where</p><p>A ≤ ‖ ∫ B d     f λ ( x ) K x μ ( ⋅ ) d ρ B d − 1 m ∑ i = 1 m     f λ ( x i ) K x i μ ( ⋅ ) ‖ K μ     + ‖ ∫ B d     f * ( x ) K x μ ( ⋅ ) d ρ B d − 1 m ∑ i = 1 m     f * ( x i ) K x i μ ( ⋅ ) ‖ K μ .</p><p>Since</p><p>‖ f λ ( x ) K x μ ( ⋅ ) ‖ K μ = | f λ ( x ) | K x μ ( x ) ≤ k | f λ ( x ) | ≤ k 2 ‖ f ‖ K μ ≤ k 2 K ( f * , g , λ ) λ</p><p>and ‖ f * ( x ) K x μ ( ⋅ ) ‖ K μ ≤ k | f * ( x ) | ≤ k ‖ f * ‖ C ( B d ) , we have by (47) that, with confidence 1 − δ , holds:</p><p>A ≤ ( 2 k 2 K ( f * , g , λ ) λ m λ + 2 k ‖ f * ‖ C ( B d ) λ m ) log 2 δ . (37)</p><p>On the other hand, take ξ ( x ) = ( ∂ f λ ( x ) ∂ n → − g ( x ) ) . Then,</p><p>B = ‖ ∫ S d − 1     ξ ( x ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1 − 1 l ∑ k = 1 l     ξ ( x m + k ) ∂ K x m + k μ ( ⋅ ) ∂ n → ‖ K μ .</p><p>By the definition of ‖   ⋅   ‖ K μ , we have:</p><p>B 2 = 〈 ∫ S d − 1     ξ ( x ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1 − 1 l ∑ k = 1 l     ξ ( x m + k ) ∂ K x m + k μ ( ⋅ ) ∂ n → ,       ∫ S d − 1     ξ ( u ) ∂ K u μ ( ⋅ ) ∂ n → d ρ S d − 1 − 1 l ∑ i = 1 l     ξ ( x m + i ) ∂ K x m + i μ ( ⋅ ) ∂ n → 〉 K μ = ∫ S d − 1     ∫ S d − 1     ξ ( x ) ξ ( u ) 〈 ∂ K x μ ( ⋅ ) ∂ n → , ∂ K u μ ( ⋅ ) ∂ n → 〉 K μ d ρ S d − 1 ( x ) d ρ S d − 1 ( u )   − 2 ∫ S d − 1     ξ ( x ) ( 1 l ∑ i = 1 l     ξ ( x m + i ) 〈 ∂ K x μ ( ⋅ ) ∂ n → ,   ∂ K x m + i μ ( ⋅ ) ∂ n → 〉 K μ ) d ρ S d − 1 ( x )   + 1 l 2 ∑ k = 1 l     ξ 2 ( x m + k ) 〈 ∂ K x m + k μ ( ⋅ ) ∂ n → , ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ   + 1 l 2 ∑ k , i = 1 , k ≠ i l     ξ ( x m + i ) ξ ( x m + j ) 〈 ∂ K x m + k μ ( ⋅ ) ∂ n → , ∂ K x m + i μ ( ⋅ ) ∂ n → 〉 K μ .</p><p>Since (14), we have:</p><p>〈 ∂ K x μ ( ⋅ ) ∂ n → , ∂ K u μ ( ⋅ ) ∂ n → 〉 K μ = ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) ,</p><p>〈 ∂ K x μ ( ⋅ ) ∂ n → ,   ∂ K x m + i μ ( ⋅ ) ∂ n → 〉 K μ = ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) | u = x m + i</p><p>and</p><p>〈 ∂ K x m + k μ ( ⋅ ) ∂ n → , ∂ K x m + k μ ( ⋅ ) ∂ n → 〉 K μ = ∂ u ∂ n → ∂ u ∂ n → K u μ ( u ) | u = x m + k ,</p><p>〈 ∂ K x m + k μ ( ⋅ ) ∂ n → , ∂ K x m + i μ ( ⋅ ) ∂ n → 〉 K μ = ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) | u = x m + i , x = x m + k .</p><p>It follows that:</p><p>B 2 = ∫ S d − 1   ∫ S d − 1   ξ ( x ) ξ ( u ) ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) d ρ S d − 1 ( x ) d ρ S d − 1 ( u )     − 2 ∫ S d − 1   ξ ( x ) ( 1 l ∑ i = 1 l   ξ ( x m + i ) ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) | u = x m + i ) d ρ S d − 1 ( x )     + 1 l 2 ∑ k = 1 l   ξ 2 ( x m + k ) ∂ u ∂ n → ∂ u ∂ n → K u μ ( u ) | u = x m + k     + 1 l 2 ∑ k , i = 1 , k = i l   ξ ( x m + i ) ξ ( x m + j ) ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) | u = x m + i , x = x m + k .</p><p>Since ( x m + 1 , x m + 2 , ⋯ , x m + l ) are i.i.d. according to ρ ρ S d − 1 , we have:</p><p>E ( B 2 ) = 1 l ( ∫ S d − 1     ξ 2 ( x ) ∂ x ∂ n → ∂ x ∂ n → K x μ ( x ) d ρ S d − 1 ( x )     − ∫ S d − 1     ∫ S d − 1     ξ ( x ) ξ ( u ) ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) d ρ S d − 1 ( x ) d ρ S d − 1 ( u ) ) ≤ 1 l ( ∫ S d − 1     ξ 2 ( x ) ∂ x ∂ n → ∂ x ∂ n → K x μ ( x ) d ρ S d − 1 ( x ) ) ,</p><p>where we have used the fact that:</p><p>∫ S d − 1     ∫ S d − 1     ξ ( x ) ξ ( u ) ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) d ρ S d − 1 ( x ) d ρ S d − 1 ( u ) ≥ 0</p><p>since ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) is a positive definition function about u and x. According to Assumption B, we have:</p><p>| ∂ u ∂ n → ∂ x ∂ n → K x μ ( u ) | ≤ k 2 ,</p><p>It follows that:</p><p>E ( B 2 ) ≤ k 2 l ( ∫ S d − 1     ξ 2 ( x ) d ρ S d − 1 ( x ) ) .</p><p>By Markov inequality, we have:</p><p>P ( B &gt; ε ) ≤ E ( B 2 ) ε 2 ≤ k 2 l ε 2 ( ∫ S d − 1     ξ 2 ( x ) d ρ S d − 1 ( x ) ) ≤ k 2 l ε 2 K ( f * , g , λ ) .</p><p>Take δ = k 2 l ε 2 K ( f * , g , λ ) . Then, ε = k K ( f * , g , λ ) l δ . It follows that with confidence 1 − δ :</p><p>B ≤ k K ( f * , g , λ ) l δ . (38)</p><p>Collecting (38), (37) and (36), we arrive at (34).</p></sec><sec id="s4"><title>4. Proofs</title><p>Proof of Proposition 2.1. Proof of 1). For any f ( x ) = ∑ k = 0 ∞   ∑ l = 1 a k d   a k , l ( f ) Q k , l ( x ) ∈ H K μ n → . We rewrite K μ ( x , y ) as:</p><p>K μ ( x , y ) = K y μ ( x ) = ∑ k = 0 ∞   ∑ l = 1 a k d ( λ k , l Q k , l ( y ) ) Q k , l ( x ) .</p><p>Then,</p><p>〈 f , K y μ 〉 K μ = ∑ k = 0 ∞   ∑ l = 1 a k d ( λ k , l Q k , l ( y ) ) a k , l ( f ) λ k , l = f ( y ) ,</p><p>which yields (13).</p><p>Proof of 2). Since (9) and (46), we have:</p><p>∂ f ( x ) ∂ n → = 〈 f , ∑ i = 1 d   x i ∂ K ( x , ⋅ ) ∂ x i ( x ) 〉 K μ = 〈 f , ∂ ∂ n → K ( x , ⋅ ) 〉 K μ ,       x = ( x 1 ,   ⋯ ,   x d ) ∈ S d − 1 .</p><p>(14) thus holds.</p><p>Proof of 3). By (14), we have:</p><p>| ∂ f ( x ) ∂ n → | = | 〈 f ,   ∂ K x μ ( ⋅ ) ∂ n → 〉 K μ | ≤ ‖ f ‖ K μ ‖ ∂ K x μ ( ⋅ ) ∂ n → ‖ K μ ≤ k ‖ f ‖ K μ .</p><p>By the same method, we have by (13) that:</p><p>| f ( x ) | = | 〈 f , K x μ ( ⋅ ) 〉 K μ | ≤ ‖ f ‖ K μ ‖ K x μ ( ⋅ ) ‖ K μ ≤ k ‖ f ‖ K μ .</p><p>(15) thus holds.</p><p>Proof of Proposition 2.2. By (27), we have:</p><p>f z , λ ( ⋅ ) = 1 λ m ∑ i = 1 m ( y x i − f z , λ ( x i ) ) K x i μ ( ⋅ )     + 1 λ l ∑ k = 1 l ( g ( x m + k ) − ∂ f z , λ ( x m + k ) ∂ n → ) ∂ K x m + k μ ( ⋅ ) ∂ n → . (39)</p><p>Taking a i = 1 λ m ( y x i − f z , λ ( x i ) ) and a m + k = 1 λ l ( g ( x m + k ) − ∂ f z , λ ( x m + k ) ∂ n → ) into (40), we have (20).</p><p>By (29), we have:</p><p>f X &#175; , λ ( ⋅ ) = 1 λ m ∑ i = 1 m ( f * ( x i ) − f X &#175; , λ ( x i ) ) K x i μ ( ⋅ )   + 1 λ l ∑ k = 1 l ( g ( x m + k ) − ∂ f X &#175; , λ ( x m + k ) n → ) ∂ K x m + k μ ( ⋅ ) ∂ n → . (40)</p><p>Taking b i = 1 λ m ( f * ( x i ) − f X &#175; , λ ( x i ) ) and b m + k = 1 λ l ( g ( x m + k ) − ∂ f X &#175; , λ ( x m + k ) n → ) into (40), we have (21).</p><p>By (29), we have:</p><p>f λ ( ⋅ ) = 1 λ ∫ B d ( f * ( x ) − f λ ( x ) ) K x μ ( ⋅ ) d ρ B d                       + 1 λ ∫ S d − 1 ( g ( x ) − ∂ f λ ( x ) n → ) ∂ K x μ ( ⋅ ) ∂ n → d ρ S d − 1 . (41)</p><p>Taking G λ , f * ( x ) = 1 λ ( f * ( x ) − f λ ( x ) ) and p λ , g ( x ) = 1 λ ( g ( x ) − ∂ f λ ( x ) ∂ n → ) into (41), we have (22).</p><p>Proof of Theorem 2.1. Collecting (33), (34) and (23), we have:</p><p>‖ f z , λ − f * ‖ L 2 ( ρ B d ) + ‖ ∂ f z , λ ∂ n → − g ‖ L 2 ( ρ S d − 1 ) ≤ 2 k 2 σ λ m δ + k ( K ( f * , g , λ ) λ 3 2 m + ‖ f * ‖ C ( B d ) λ m ) log 4 δ         + k K ( f * , g , λ ) l δ + 2 K ( f * , g , λ ) . (42)</p><p>(42) yields (24).</p></sec><sec id="s5"><title>Founding</title><p>Supported partially by the NSF (Project No. 61877039), the NSFC/RGC Joint Research Scheme (Project No. 12061160462 and N_CityU102/20) of China.</p></sec><sec id="s6"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s7"><title>Cite this paper</title><p>Ran, X.X. and Sheng, B.H. (2024) Solving Neumann Boundary Problem with Kernel-Regularized Learning Approach. Journal of Applied Mathematics and Physics, 12, 1101-1125. https://doi.org/10.4236/jamp.2024.124069</p></sec><sec id="s8"><title>Appendices</title>Appendix 1. G&#226;teaux Derivative and the Convex Function<p>Following Proposition A1 can be found from the Proposition 17.4, Proposition 17.10 and Proposition 17.12 of [<xref ref-type="bibr" rid="scirp.132553-ref38">38</xref>] .</p><p>Proposition A1. Let ( H , ‖   ⋅   ‖ H ) be a Hilbert space and F ( f ) : H → R ∪ { ∓ ∞ } be a function defined on H . Then,</p><p>1) If F ( f ) is a convex function, then, F ( f ) attains minimal value at f 0 if and only if ∇ f F ( f 0 ) = 0 .</p><p>2) If F ( f ) : H → R ∪ { ∓ ∞ } is a G&#226;teaux differentiable function, then F ( f ) is a convex on H if and only if for any f , g ∈ H , there holds:</p><p>F ( g ) − F ( f ) ≥ 〈 g − f ,   ∇ f F ( f ) 〉 H .</p><p>In particular, we have:</p><p>x 2 − y 2 ≥ 2 y ( x − y ) ,       ∀ x , y ∈ R . (43)</p><p>3) For function F ( f ) = ‖ f ‖ H 2 , there holds ∇ f F ( f ) = 2 f and there holds equality:</p><p>‖ f ‖ H 2 − ‖ g ‖ H 2 = 〈 f − g ,   2 g 〉 H + ‖ f − g ‖ H 2 , (44)</p><p>i.e.</p><p>‖ f ‖ H 2 − ‖ g ‖ H 2 = 〈 f − g ,   ∇ g F ( g ) 〉 H + ‖ f − g ‖ H 2 .</p>Appendix 2. Derivatives Reproducing Property<p>Let Ω ⊂ R d be a compact subset which is the closure of its nonempty interior Ω 0 . Let K ( x , y ) be a Mercer kernel on Ω &#215; Ω having the expansion (see e.g. [<xref ref-type="bibr" rid="scirp.132553-ref39">39</xref>] ):</p><p>K ( x , y ) = ∑ k = 0 + ∞     λ k φ k ( x ) φ k ( y ) ,       x , y ∈ Ω , (45)</p><p>where the convergence is absolute (for each x , y ∈ Ω ) and uniform on Ω &#215; Ω . Then, we have a proposition.</p><p>Proposition A2. Let K ( x , y ) be a Mercer kernel of form (45) and K ∈ C ( 1 ) ( Ω &#215; Ω ) . If H K is a reproducing kernel Hilbert space such that:</p><p>f ( x ) = 〈 f , K ( ⋅ , x ) 〉 H K ,       f ∈ H K ,   x ∈ Ω   .</p><p>Then,</p><p>∂ x α f ( x ) = 〈 f , ∂ x α K ( ⋅ , x ) 〉 H K n → ( ρ Ω ) ,       | α | ≤ 1 ,   f ∈ H K ,   x ∈ Ω , (46)</p><p>where | α | = ∑ i = 1 d     α i ≤ 1 .</p><p>Proof. It can be found from Theorem 1 in [<xref ref-type="bibr" rid="scirp.132553-ref12">12</xref>] , or see (v) in Theorem 4.7 in [<xref ref-type="bibr" rid="scirp.132553-ref40">40</xref>] .</p>Appendix 3. A Probability Inequality<p>Proposition A4. [<xref ref-type="bibr" rid="scirp.132553-ref41">41</xref>] Let ξ be a random variable taking values in a real separable Hilbert space H on a probability space ( Ω , F , P ) . Assume that there are a positive constant L such that ‖ ξ ‖ H ≤ L . Then, for all n ≥ 1 and 0 &lt; η &lt; 1 , it holds, with confidence 1 − η , that:</p><p>‖ 1 n ∑ i = 1 n     ξ ( Ω i ) − E ( ξ ) ‖ H ≤ 4 L n log 2 η . (47)</p></sec></body><back><ref-list><title>References</title><ref id="scirp.132553-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Atkinson, K., Hansen, O. and Chien, D. (2011) A Spectral Method for Elliptic Equations: The Neumann Problem.&lt;i&gt; Advances in Computational Mathematics&lt;/i&gt;, 34, 295-317. &lt;br&gt;https://doi.org/10.1007/s10444-010-9154-3</mixed-citation></ref><ref id="scirp.132553-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Atkinson, K., Chien, D. and Hansen, O. (2010) A Spectral Method for Elliptic Equation: The Dirichlet Problem.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Advances in Computational Mathematics&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;33, 169-189. &lt;br&gt;https://doi.org/10.1007/s10444-009-9125-8</mixed-citation></ref><ref id="scirp.132553-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Atkinson, K. and Hansen, O. (2010) A Spectral Method for the Eigenvalue Problem for Elliptic Equations. &lt;i&gt;Electronic Transactions on Numerical Analysis&lt;/i&gt;, 37, 386-412.</mixed-citation></ref><ref id="scirp.132553-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Li, X. (2009) Approximation of Potential Integral by Radial Bases for Solutions of Helmholtz Equation. &lt;i&gt;Advances in Computational Mathematics&lt;/i&gt;, 30, 201-230. &lt;br&gt;https://doi.org/10.1007/s10444-008-9065-8</mixed-citation></ref><ref id="scirp.132553-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Li, X. (2008) Rate of Convergence of the Method of Fundamental Solutions and Hyperinterpolation for Modified Helmholtz Equations on the Unit Ball.&lt;i&gt; Advances &lt;/i&gt;&lt;i&gt;in Computational Mathematics&lt;/i&gt;, 29, 393-413. &lt;br&gt;https://doi.org/10.1007/s10444-007-9056-1</mixed-citation></ref><ref id="scirp.132553-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Li, X. (2008) Convergence of the Method of Fundamental Solutions for Poisson&amp;#8217;s Equation on the Unit Sphere. &lt;i&gt;Advances in Computational Mathematics&lt;/i&gt;, 28, 269-282. &lt;br&gt;https://doi.org/10.1007/s10444-006-9022-3</mixed-citation></ref><ref id="scirp.132553-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Cialenco, I., Fasshauer, G.E. and Ye, Q. (2012) Approximation of Stochastic Partial Differential Equations by a Kernel-Based Collocation Method. &lt;i&gt;International Journal of Computer Mathematics&lt;/i&gt;, 89, 2543-2561. &lt;br&gt;https://doi.org/10.1080/00207160.2012.688111</mixed-citation></ref><ref id="scirp.132553-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Ding, L.L., Liu, Z.Y. and Xu, Q.Y. (2021) Multilevel RBF Collocation Method for the Fourth-Order Thin Plate Problem. &lt;i&gt;International Journal of Wavelets&lt;/i&gt;,&lt;i&gt; Multiresolu&lt;/i&gt;&lt;i&gt;tion and Information Processing&lt;/i&gt;, 19, Article ID: 2050079. &lt;br&gt;https://doi.org/10.1142/S0219691320500794</mixed-citation></ref><ref id="scirp.132553-ref9"><label>9</label><mixed-citation publication-type="book" xlink:type="simple">Fasshauer, G.E. and Ye, Q. (2013) Kernel-Based Collocation Methods versus Galerkin Finite Element Methods for Approximating Elliptic Stochastic Partial Differential Equation. In: Griebel, M. and Schweitzer, M., Eds., &lt;i&gt;Meshfree Methods for Pa&lt;/i&gt;&lt;i&gt;r&lt;/i&gt;&lt;i&gt;tial Differential Equations VI&lt;/i&gt;, Springer, Berlin, 155-170. &lt;br&gt;https://doi.org/10.1007/978-3-642-32979-1_10</mixed-citation></ref><ref id="scirp.132553-ref10"><label>10</label><mixed-citation publication-type="book" xlink:type="simple">Fasshauer, G.E. and Ye, Q. (2012) A Kernel-Based Collocation Method for Elliptic Partial Differential Equations with Random Coefficients. In: Dick, J., Kuo, F., Peters, G. and Sloan, I., Eds., &lt;i&gt;Monte Carlo and Quasi-Monte Carlo Methods&lt;/i&gt; 2012, Springer, Berlin, 331-347. &lt;br&gt;https://doi.org/10.1007/978-3-642-41095-6_14</mixed-citation></ref><ref id="scirp.132553-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Ye, Q. (2014) Approximation of Nonlinear Stochastic Partial Differential Equations by a Kernel-Based Collocation Method. &lt;i&gt;International Journal of Applied Nonlinear Science&lt;/i&gt;, 1, 156-172. &lt;br&gt;https://doi.org/10.1504/IJANS.2014.061018</mixed-citation></ref><ref id="scirp.132553-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Zhou, D.X. (2008) Derivative Reproducing Properties for Kernel Methods in Learning Theory. &lt;i&gt;Journal of Computational and Applied Mathematics&lt;/i&gt;, 220, 456-463. &lt;br&gt;https://doi.org/10.1016/j.cam.2007.08.023</mixed-citation></ref><ref id="scirp.132553-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Bao, K.J., Qian, X., Liu, Z.Y. and Song, S.B. (2022) An Operator Learning Approach &lt;i&gt;via&lt;/i&gt; Function&amp;#8212;Valued Reproducing Kernel Hilbert Space for Diferential Equations. arXiv: 2202.09488.</mixed-citation></ref><ref id="scirp.132553-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Mo, Y. and Qian, T. (2014) Support Vector Machine Adapted Tikhonov Regularization Method to Solve Dirichlet Problem. &lt;i&gt;Applied Mathematics and Computation&lt;/i&gt;, 245, 509-519. &lt;br&gt;https://doi.org/10.1016/j.amc.2014.07.089</mixed-citation></ref><ref id="scirp.132553-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H., Zhou, D.P. and Wang, S.H. (2022) The Kernel Regularized Learning Algorithm for Solving Laplace Equationn with Dirichlet Boundary. &lt;i&gt;International &lt;/i&gt;&lt;i&gt;Journal of Wavelets&lt;/i&gt;,&lt;i&gt; Multiresolution and Information Processing&lt;/i&gt;, 20, Article ID: 2250031. &lt;br&gt;https://doi.org/10.1142/S021969132250031X</mixed-citation></ref><ref id="scirp.132553-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Stepaniants, G. (2023) Learning Partial Differential Equations in Reproducing Kernel Hilbert Spaces. &lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;, 24, 1-72</mixed-citation></ref><ref id="scirp.132553-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Harosko, D.D. and Triebel, H. (2008) Distributions, Sobolev Spaces, Elliptic Equations. European Mathematical Society, Helsinki. &lt;br&gt;https://doi.org/10.4171/042</mixed-citation></ref><ref id="scirp.132553-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Belkin, M. and Niyogi, P. (2004) Semi-Supervised Learning on Riemannian Manifolds. &lt;i&gt;Machine Learning&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;56, 209-239. &lt;br&gt;https://doi.org/10.1023/B:MACH.0000033120.25363.1e</mixed-citation></ref><ref id="scirp.132553-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Belkin, M., Niyogi, P. and Sindhwani, V. (2006) Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples. &lt;i&gt;Journal of &lt;/i&gt;&lt;i&gt;M&lt;/i&gt;&lt;i&gt;a&lt;/i&gt;&lt;i&gt;chine Learning Research&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;7, 2399-2434.</mixed-citation></ref><ref id="scirp.132553-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Niyogi, P. (2013) Manifold Regularization and Semi-Supervised Learning: Some Theoretical Analysis,&lt;i&gt; &lt;/i&gt;&lt;i&gt;Journal of Machine Learning Research&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;14, 1229-1250.</mixed-citation></ref><ref id="scirp.132553-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H. and Zhu, H.C. (2018) The Convergence Rate of Semi-Supervised Regression with Quadratic Loss.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Applied Mathematics and Computation&lt;/i&gt;, 321, 11-24. &lt;br&gt;https://doi.org/10.1016/j.amc.2017.10.033</mixed-citation></ref><ref id="scirp.132553-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Smale, S. and Zhou, D.X. (2004) Shannon Sampling and Function Reconstruction from Point Values. &lt;i&gt;Bulletin of the American Mathematical Society&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;41, 279-305. &lt;br&gt;https://doi.org/10.1090/S0273-0979-04-01025-0</mixed-citation></ref><ref id="scirp.132553-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Cucker, F. and Smale, S. (2002) On the Mathematical Foundations of Learning Theory. &lt;i&gt;Bulletin of the American Mathematical Society&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;39, 1-49. &lt;br&gt;https://doi.org/10.1090/S0273-0979-01-00923-5</mixed-citation></ref><ref id="scirp.132553-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Cucker, F. and Zhou, D.X. (2007) Learning Theory: An Approximation Theory Viewpoint. Cambridge University Press, Cambridge. &lt;br&gt;https://doi.org/10.1017/CBO9780511618796</mixed-citation></ref><ref id="scirp.132553-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Chen, H., Zhou, Y.C., Tang, Y.Y., Li, L.Q. and Pan, Z.B. (2013) Convergence Rate of the Semi-Supervised Greedy Algorithm.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Neural Networks&lt;/i&gt;, 44, 44-50. &lt;br&gt;https://doi.org/10.1016/j.neunet.2013.03.001</mixed-citation></ref><ref id="scirp.132553-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H. and Xiang, D.H. (2017) The Performance of Semi-Supervised Laplacian Regularized Regression with Least Square Loss. &lt;i&gt;International Journal of Wavelets&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;&lt;i&gt;Multiresolution and Information Processing&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;15, Article ID: 1750016. &lt;br&gt;https://doi.org/10.1142/S0219691317500163</mixed-citation></ref><ref id="scirp.132553-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H., Xiang, D.H. and Ye, P.X. (2015) Convergence Rate of Semi-Supervised Gradient Learning. &lt;i&gt;International Journal of Wavelets&lt;/i&gt;,&lt;i&gt; Multiresolution and Infor&lt;/i&gt;&lt;i&gt;m&lt;/i&gt;&lt;i&gt;a&lt;/i&gt;&lt;i&gt;tion Processing&lt;/i&gt;, 13, Article ID: 1550021. &lt;br&gt;https://doi.org/10.1142/S0219691315500216</mixed-citation></ref><ref id="scirp.132553-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H. and Zhang, H.Z. (2020) Performance Analysis of the LapRSSLG Algorithmin Learning Theory. &lt;i&gt;Analysis and Applications&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;18, 79-108. &lt;br&gt;https://doi.org/10.1142/S0219530519410033</mixed-citation></ref><ref id="scirp.132553-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Wang, K.Y. and Li, L.Q. (2000) Harmonic Analysis and Approximation on the Unit Sphere. Science Press, Beijing.</mixed-citation></ref><ref id="scirp.132553-ref30"><label>30</label><mixed-citation publication-type="other" xlink:type="simple">Delgado, A.M., Fern&amp;#225;ndez, L., Lubinsky, D., P&amp;#233;rez, T.E. and Pi&amp;#241;ar, M.A. (2016) Sobolev Orthogonal Polynomials on the Unit Ball &lt;i&gt;via&lt;/i&gt; Outward Normal Derivatives. &lt;i&gt;Jou&lt;/i&gt;&lt;i&gt;r&lt;/i&gt;&lt;i&gt;nal &lt;/i&gt;&lt;i&gt;of Mathematical Analysis and Applications&lt;/i&gt;, 440, 716-740. &lt;br&gt;https://doi.org/10.1016/j.jmaa.2016.03.041</mixed-citation></ref><ref id="scirp.132553-ref31"><label>31</label><mixed-citation publication-type="other" xlink:type="simple">Jordao, T. and Menegatto, V.A. (2012) Reproducing Properties of Differentiable Mercer-Like Kernels on the Sphere. &lt;i&gt;Numerical Functional Analysis and Optimization&lt;/i&gt;, 33, 1221-1243. &lt;br&gt;https://doi.org/10.1080/01630563.2012.660590</mixed-citation></ref><ref id="scirp.132553-ref32"><label>32</label><mixed-citation publication-type="other" xlink:type="simple">Castro, M.H., Menegatto, V.A. and Oliveira, C.P. (2013) Laplace-Beltrami Differentiability of Positive Definite Kernels on the Sphere. &lt;i&gt;Acta Mathematica Sinica&lt;/i&gt;,&lt;i&gt; Eng&lt;/i&gt;&lt;i&gt;lish Series&lt;/i&gt;, 29, 93-104. &lt;br&gt;https://doi.org/10.1007/s10114-012-1067-2</mixed-citation></ref><ref id="scirp.132553-ref33"><label>33</label><mixed-citation publication-type="other" xlink:type="simple">Ferreira, J.C. and Menegatto, V.A. (2013) Positive Definiteness, Reproducing Kernel Hilbert Spaces and Beyond.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Annals of Functional Analysis&lt;/i&gt;, 4, 64-88. &lt;br&gt;https://doi.org/10.15352/afa/1399899838</mixed-citation></ref><ref id="scirp.132553-ref34"><label>34</label><mixed-citation publication-type="other" xlink:type="simple">Sun, H.W. and Wu, Q. (2011) Least Square Regression with Independent Kernels and Coefficient Regularization. &lt;i&gt;Applied and Computational Harmonic Analysis&lt;/i&gt;, 30, 96-109. &lt;br&gt;https://doi.org/10.1016/j.acha.2010.04.001</mixed-citation></ref><ref id="scirp.132553-ref35"><label>35</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, J., Wang, J.L. and Sheng, B.H. (2011) Learning from Regularized Regression Algorithms with P-Order Markov Chain Sampling. &lt;i&gt;Applied Mathematics&lt;/i&gt;: &lt;i&gt;A Journal of Chinese Universities&lt;/i&gt;,&lt;i&gt; &lt;/i&gt;226, 295-306. &lt;br&gt;https://doi.org/10.1007/s11766-011-2701-y</mixed-citation></ref><ref id="scirp.132553-ref36"><label>36</label><mixed-citation publication-type="other" xlink:type="simple">Sheng, B.H. and Wang, J.L. (2024) Moduli of Smoothness, &lt;i&gt;K&lt;/i&gt;-Functionals and Jackson-Type Inequalities Associated with Kernel Function Approximation in Learning Theory.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Analysis and Applications&lt;/i&gt;. &lt;br&gt;https://doi.org/10.1142/S021953052450009X</mixed-citation></ref><ref id="scirp.132553-ref37"><label>37</label><mixed-citation publication-type="other" xlink:type="simple">Dai, F. and Xu, Y. (2013) Approximation Theory and Harmonic Analysis on Spheres and Balls. Springer-Verlag, New York. &lt;br&gt;https://doi.org/10.1007/978-1-4614-6660-4</mixed-citation></ref><ref id="scirp.132553-ref38"><label>38</label><mixed-citation publication-type="other" xlink:type="simple">Bauschke, H.H. and Combettes, P.L. (2010) Convex Analysis and Monotone Operator Theory in Hilbert Spaces.&lt;i&gt; &lt;/i&gt;Springer-Verlag, New York. &lt;br&gt;https://doi.org/10.1007/978-1-4419-9467-7</mixed-citation></ref><ref id="scirp.132553-ref39"><label>39</label><mixed-citation publication-type="other" xlink:type="simple">Aronszajn, N. (1950) Theory of Reproducing Kernels. &lt;i&gt;Transactions of the American Mathematical Society&lt;/i&gt;, 68, 337-404. &lt;br&gt;https://doi.org/10.1090/S0002-9947-1950-0051437-7</mixed-citation></ref><ref id="scirp.132553-ref40"><label>40</label><mixed-citation publication-type="other" xlink:type="simple">Ferreira, J.C. and Menegatto, V.A. (2012) Reproducing Properties of Differentiable Mercer-Like Kernels.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Mathematische Nachrichten&lt;/i&gt;, 285, 959-973. &lt;br&gt;https://doi.org/10.1002/mana.201100072</mixed-citation></ref><ref id="scirp.132553-ref41"><label>41</label><mixed-citation publication-type="other" xlink:type="simple">Smale, S. and Zhou, D.X. (2007) Learning Theory Estimates &lt;i&gt;via&lt;/i&gt; Integral Operators and Their Applications. &lt;i&gt;Constructive Approximation&lt;/i&gt;, 26, 153-172. &lt;br&gt;https://doi.org/10.1007/s00365-006-0659-y</mixed-citation></ref></ref-list></back></article>