<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AJOR</journal-id><journal-title-group><journal-title>American Journal of Operations Research</journal-title></journal-title-group><issn pub-type="epub">2160-8830</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ajor.2012.21011</article-id><article-id pub-id-type="publisher-id">AJOR-17836</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Optimizing Forest Sampling by Using Lagrange Multipliers
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>yriaki</surname><given-names>Kitikidou</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Department of Forestry and Management of the Environment and Natural Resources,  Democritus University of Thrace, Orestiada, Greece</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>kkitikid@fmenr.duth.gr</email></corresp></author-notes><pub-date pub-type="epub"><day>14</day><month>03</month><year>2012</year></pub-date><volume>02</volume><issue>01</issue><fpage>94</fpage><lpage>99</lpage><history><date date-type="received"><day>July</day>	<month>22,</month>	<year>2011</year></date><date date-type="rev-recd"><day>August</day>	<month>20,</month>	<year>2011</year>	</date><date date-type="accepted"><day>September</day>	<month>10,</month>	<year>2011</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  In two-phase sampling, or double sampling, from a population with size N we take one, relatively large, sample size n. From this relatively large sample we take a small sub-sample size m, which usually costs more per sample unit than the first one. In double sampling with regression estimators, the sample of the first phase n is used for the estimation of the average of an auxiliary variable X, which should be strongly related to the main variable Y (which is estimated from the sub-sample m). Sampling optimization can be achieved by minimizing cost C with fixed var 
  Y, or by finding a minimum var 
  Y for fixed C. In this paper we optimize sampling with use of Lagrange multipliers, either by minimizing variance of Y and having predetermined cost, or by minimizing cost and having predetermined variance of Y.
 
</p></abstract><kwd-group><kwd>Forest Inventories; Lagrange Multipliers; Optimization</kwd><kwd> Sampling</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>All decision-making requires information. In forestry, this information is acquired by means of forest inventories, systems for measuring the extent, quantity and condition of forests [<xref ref-type="bibr" rid="scirp.17836-ref1">1</xref>]. More specifically, the purpose of forest inventories is to estimate means and totals for measures of forest characteristics over a defined area. Such characteristics include the volume of the growing stock, the area of a certain type of forest and nowadays also measures concerned with forest biodiversity, e.g. the volume of dead wood or vegetation.</p><p>The main method used in forest inventories in the 19<sup>th</sup> century was complete enumeration, but it was soon noted that there was a possibility to reduce costs by using representative samples [<xref ref-type="bibr" rid="scirp.17836-ref2">2</xref>]. Sampling-based methods were used in forestry a century before the mathematical foundations of sampling techniques were described [3-9]. In this paper we attempt to optimize sampling with use of Lagrange multipliers, either by minimizing variance of the forest variable we are interested in and having fixed cost, or by minimizing cost and having fixed variance of the variable in question.</p></sec><sec id="s2"><title>2. The Method of Lagrange Multipliers</title><p>Lagrange multipliers is a method of evaluating maxima or minima of a function of possibly several variables, subject to one or more constraints [<xref ref-type="bibr" rid="scirp.17836-ref10">10</xref>]. This method, which is due to Joseph Louis de Lagrange (1736-1813), is used to optimize a real-valued function<img src="11-1040036\8f995cb4-d9d0-4bdc-ab18-8dd39969317d.jpg" />, where x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\f01fa0bc-a9b5-4060-a1ce-2446b227c9a1.jpg" />, x<sub>n</sub> are subject to m (&lt;n) equality constraints of the form</p><disp-formula id="scirp.17836-formula22257"><label>(1)</label><graphic position="anchor" xlink:href="11-1040036\8d47931c-f460-4296-9220-304664622d27.jpg"  xlink:type="simple"/></disp-formula><p>where g<sub>1</sub>, g<sub>2</sub>, <img src="11-1040036\40cae00a-9e12-4863-a34f-2a89237e9c46.jpg" />, g<sub>n</sub> are differentiable functions.</p><p>This determination of the stationary points in this constrained optimization problem is done by first considering the function</p><disp-formula id="scirp.17836-formula22258"><label>(2)</label><graphic position="anchor" xlink:href="11-1040036\320bc463-ee70-48e2-ab4e-be878ef843af.jpg"  xlink:type="simple"/></disp-formula><p>where <img src="11-1040036\4b93e4b8-bd3e-4828-a198-60f85b182f8e.jpg" /> and λ<sub>1</sub>, λ<sub>2</sub>, <img src="11-1040036\693cadeb-512a-4631-b1a8-a8beaef19bdf.jpg" />, λ<sub>m</sub> are scalars called Lagrange multipliers. By differentiating (2) with respect to x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\50a0e40b-d99d-4931-aa2d-ba07a0f6047a.jpg" />, x<sub>n</sub> and equating the partial derivatives to zero we obtain</p><disp-formula id="scirp.17836-formula22259"><label>(3)</label><graphic position="anchor" xlink:href="11-1040036\841bdb51-e3c8-42dd-aa09-260ec9649ce2.jpg"  xlink:type="simple"/></disp-formula><p>Equations (1) and (3) consist of m + n unknowns, namely, x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\b2904159-46da-42f7-bc18-d3daf450fde5.jpg" />, x<sub>n</sub>; λ<sub>1</sub>, λ<sub>2</sub>, <img src="11-1040036\f1c79276-c42d-4e24-a046-63ff710c0b33.jpg" />, λ<sub>m</sub>. The solutions for x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\ec7ee8f7-47da-41b1-82a9-a1d7e4ca429b.jpg" />, x<sub>n</sub> determine the locations of the stationary points. The following argument explains why this is the case.</p><p>Suppose that in Equation (1) we can solve for m x<sub>i</sub>’s, for example, x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\fcae647a-c157-462c-8b6e-8230f9612f87.jpg" />, x<sub>n</sub>, in terms of the remaining n – m variables. By Implicit Function Theorem (see Appendix 1), this is possible whenever</p><disp-formula id="scirp.17836-formula22260"><label>. (4)</label><graphic position="anchor" xlink:href="11-1040036\4f10d356-c407-4a8e-9e83-6330632d5bb7.jpg"  xlink:type="simple"/></disp-formula><p>In this case, we can write</p><disp-formula id="scirp.17836-formula22261"><label>(5)</label><graphic position="anchor" xlink:href="11-1040036\9c8e104b-217d-4bec-bd6c-dfbbdf047d59.jpg"  xlink:type="simple"/></disp-formula><p>Thus f(x) is a function of only n – m variables, namely, x<sub>m</sub><sub>+1</sub>, x<sub>m</sub><sub>+2</sub>, <img src="11-1040036\6f3747f3-0323-4517-94dc-695b85cf4de6.jpg" />, x<sub>n</sub>. If the partial derivatives of f with respect to these variables exist and if f has a local optimum, then these partial derivatives must necessarily vanish, that is,</p><disp-formula id="scirp.17836-formula22262"><label>(6)</label><graphic position="anchor" xlink:href="11-1040036\5a30feb6-80ee-4c97-9451-b5bb4cfa297d.jpg"  xlink:type="simple"/></disp-formula><p>Now, if Equations (5) are used to substitute h<sub>1</sub>, h<sub>2</sub>, <img src="11-1040036\2795d107-52b6-48d6-899f-27f29d79312a.jpg" />, h<sub>m</sub> for x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\dbd2e3c1-7af5-4c7e-be06-91a62a607444.jpg" />, x<sub>n</sub>, respectively, in Equation (1), then we obtain the identities</p><p><img src="11-1040036\00383503-f6ee-4b58-927e-34da70fa9c49.jpg" />.</p><p>By differentiating these identities with respect to x<sub>m</sub><sub>+1</sub>, x<sub>m</sub><sub>+2</sub>, <img src="11-1040036\4de3d94a-d4a6-49a8-9952-dd2367da4118.jpg" />, x<sub>n</sub> we obtain</p><disp-formula id="scirp.17836-formula22263"><label>(7)</label><graphic position="anchor" xlink:href="11-1040036\19f00c3c-224a-4414-a424-73741ae0646d.jpg"  xlink:type="simple"/></disp-formula><p>Let us now define the vectors</p><p><img src="11-1040036\f1975f9d-bb37-4435-98de-86da64c45107.jpg" /></p><p><img src="11-1040036\ddea2c8e-31b2-4926-8610-3e53337fe93c.jpg" /></p><p><img src="11-1040036\49a08ae2-6f47-451e-89c9-a310db94112e.jpg" /></p><p><img src="11-1040036\4d67ac00-b1c2-4cb3-b22b-6c37492feb6c.jpg" /></p><p><img src="11-1040036\832f1440-4c59-4c9e-87d8-2b8413e9364b.jpg" />.</p><p>Equations (6) and (7) can then be written as</p><disp-formula id="scirp.17836-formula22264"><label>(8)</label><graphic position="anchor" xlink:href="11-1040036\8598963b-5b1d-4ba1-820a-38b6c13aacc9.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17836-formula22265"><label>(9)</label><graphic position="anchor" xlink:href="11-1040036\d347efeb-2460-4a86-9740-20c850f2540c.jpg"  xlink:type="simple"/></disp-formula><p>where<img src="11-1040036\3c257b62-73a5-4b41-ba1c-efeca8ac513a.jpg" />, which is a nonsingular m &#215; m matrix if condition (4) is valid.</p><p>From Equation (8) we have</p><p><img src="11-1040036\c1274792-71fc-4368-b20f-e6f12fecbf16.jpg" /></p><p>By making the proper substitution in Equation (9) we obtain</p><disp-formula id="scirp.17836-formula22266"><label>, (10)</label><graphic position="anchor" xlink:href="11-1040036\4a7a585d-c644-42d9-86cd-90a30b9936da.jpg"  xlink:type="simple"/></disp-formula><p>where</p><disp-formula id="scirp.17836-formula22267"><label>. (11)</label><graphic position="anchor" xlink:href="11-1040036\bbd3f9c7-1621-400b-976f-8a878009c483.jpg"  xlink:type="simple"/></disp-formula><p>Equations (10) can then be expressed as</p><disp-formula id="scirp.17836-formula22268"><label>(12)</label><graphic position="anchor" xlink:href="11-1040036\d90dd50d-6589-4953-ba7f-6d871297b27a.jpg"  xlink:type="simple"/></disp-formula><p>From Equation (11) we also have</p><disp-formula id="scirp.17836-formula22269"><label>(13)</label><graphic position="anchor" xlink:href="11-1040036\46e21723-547c-4e2d-b214-10238514c715.jpg"  xlink:type="simple"/></disp-formula><p>Equations (12) and (13) can now be combined into a single vector equation of the form</p><p><img src="11-1040036\3c594d8a-3629-4c0b-aff1-85128aeaee62.jpg" />which is the same as Equation (3). We conclude that at a stationary point of f, the values of x<sub>1</sub>, x<sub>2</sub>, <img src="11-1040036\a64e96c1-fe5e-4806-9d76-69d4953528aa.jpg" />, x<sub>n</sub> and the corresponding values of λ<sub>1</sub>, λ<sub>2</sub>, <img src="11-1040036\4a221ba2-5a6c-494e-b06d-ffd4f9f1eeab.jpg" />, λ<sub>m</sub> must satisfy Equations (1) and (3).</p></sec><sec id="s3"><title>3. Lagrange Multipliers in Sampling Optimization</title><p>In two-phase sampling, or double sampling, from a population with size N we take one, relatively large, sample size n. From this relatively large sample we take a small sub-sample size m, which usually costs more per sample unit than the first one. In double sampling with regression estimators, the sample of the first phase n is used for the estimation of the average of an auxiliary variable X, <img src="11-1040036\67d00051-9a48-40ff-98fa-076db7e43c59.jpg" />, which should be strongly related to the main variable Y.</p><p>In the sub-sample m both auxiliary X and main Y variables are measured, in order to estimate their means <img src="11-1040036\523c581d-c3d3-43b9-b964-fcd5cc425d44.jpg" /></p><p>and<img src="11-1040036\c2f15f1d-9d47-47b2-b20a-ce334ef0ef50.jpg" />, respectively. The regression estimator <img src="11-1040036\d280f6b9-ae3e-4423-ae6e-649b036a891a.jpg" /> and its estimated variance <img src="11-1040036\03309b99-6061-480c-b023-402dc74ca169.jpg" /> are ([7,11-13]):</p><p><img src="11-1040036\6b2d5a32-2325-4c1a-a3f7-62ad97a97f91.jpg" /></p><p><img src="11-1040036\befd43c9-7b97-474d-a9a0-72eb1960cfe3.jpg" /></p><p>where <img src="11-1040036\383b2f08-10d1-48b2-807d-dec709cd40a6.jpg" /> is the variance of Y in the sub-sample m, and r is the estimated correlation coefficient between X and Y.</p><p>An approximate cost function could be</p><p><img src="11-1040036\2af47f60-b230-4ca3-a6a1-8d34741375ec.jpg" />where C: total sampling cost;</p><p>C<sub>1</sub>: sampling cost of the first phase;</p><p>C<sub>2</sub>: sampling cost of the second phase.</p><p>Sampling optimization can be achieved by minimizing cost C with fixed<img src="11-1040036\735f2fe2-19b2-4498-a949-da001f74e133.jpg" />, applying the following procedure:</p><p>We assume an approximately normal distribution of<img src="11-1040036\e6998b52-9542-4cde-a4dd-4db8855015ea.jpg" />, so that the 95% confidence interval for <img src="11-1040036\40386deb-da93-4188-a3e4-02bd9329b65c.jpg" /> would be:</p><p><img src="11-1040036\17dcd892-e731-4cb7-879a-c21742059975.jpg" />where<img src="11-1040036\b0bc3a4f-16ce-4d26-9544-16a7165aad1c.jpg" />.</p><p>Now we must choose n and m in such a way, that half a confidence interval does not exceed a value D, fixed a priori, where D also may be expressed as a fraction (E)</p><p>of<img src="11-1040036\333f3c09-748a-48eb-85dc-a7b2daba7d9d.jpg" />:</p><p><img src="11-1040036\2008ea5d-81e8-4be1-bbce-62187bd47af5.jpg" /></p><p>To this end we construct the Lagrange function:</p><p><img src="11-1040036\5487d35a-c002-4fc0-b44e-7da4fdca93e0.jpg" /></p><p><img src="11-1040036\f3ddf744-6198-4378-be03-f9368eaed70b.jpg" /></p><p>Remind that</p><p><img src="11-1040036\fb318cd3-45c1-4bca-b3ec-711afb94f6c1.jpg" />so</p><disp-formula id="scirp.17836-formula22270"><label>(14)</label><graphic position="anchor" xlink:href="11-1040036\50e2fab9-7a30-48e8-a371-eb9ce916f867.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17836-formula22271"><label>(15)</label><graphic position="anchor" xlink:href="11-1040036\b68a167d-1535-4f80-87eb-45b6cb72c108.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17836-formula22272"><label>(16)</label><graphic position="anchor" xlink:href="11-1040036\c21f3a79-7ace-4390-b0da-3c09d130c532.jpg"  xlink:type="simple"/></disp-formula><p>Solving the system of Equations (14), (15) and (16) we find n, m and λ. The reverse problem, viz. finding <img src="11-1040036\0c1f1dcc-f321-4b70-b252-7b0e80f52c91.jpg" /> for fixed C, is solved in a similar way.</p><p>In order to explain how Lagrange multipliers work, we describe the following example: Assume that the total cost C of an industry producing two products x and y, is given by the equation<img src="11-1040036\ac6edd57-3e40-4da3-aaf3-8d25414e30e4.jpg" />. The production is limited with a limitation of 20 units, that is<img src="11-1040036\bd4a4d27-f474-4ab3-b721-0803e5c20499.jpg" />. Thus, we have:</p><p><img src="11-1040036\9ba611e6-3e05-4cbd-af06-fee957adc1ee.jpg" /></p><disp-formula id="scirp.17836-formula22273"><label>(a)</label><graphic position="anchor" xlink:href="11-1040036\1f2eedcf-0ada-4a70-a968-64cc02b8f560.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17836-formula22274"><label>(b)</label><graphic position="anchor" xlink:href="11-1040036\e7584927-bb6c-4cc0-b332-db2334b5ecd5.jpg"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.17836-formula22275"><label>(c)</label><graphic position="anchor" xlink:href="11-1040036\d9853b08-75d1-409d-9496-e9b0dbf0f9cd.jpg"  xlink:type="simple"/></disp-formula><p>By solving the equations’ system (a), (b) and (c), we find that x = 13, y = 7 and λ = –71. Consequently,<img src="11-1040036\f7094a52-1068-4051-8f31-cca4a6d4f68f.jpg" />.</p><p>The economic meaning of λ is this: λ is the reduction of the total cost to the limit, if production was 19 instead of 20 units. In other words, if we required 19 total production units, the total cost would be reduced by 71 monetary units (710 – 71 = 639). Generally, λ represents the marginal effect on the cost function, when production limitation is increased by one unit.</p></sec><sec id="s4"><title>4. Other Uses of Lagrange Multipliers in Forest Inventories</title><p>If there are not enough sample plots to give sufficiently good inventory results using only forest measurements, we may try to make use of auxiliary variables correlated with forest variables. The most obvious way is to use ratio or regression estimators (see Appendix 2). The calibration estimator of Deville and S&#228;rndal [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>] is an extension of the regression estimator for obtaining population totals using auxiliary information. Both regression and calibration estimators can be employed if there are auxiliary variables for inventory sample plots known for which the population totals are also known, e.g. variables obtained from remote sensing or from GIS systems. The appeal of calibration estimators for forest inventories comes from the fact that they lead to estimators which are weighted sums of the sample plot variables, where the weight can be interpreted as the area of forest in the population that is similar to the sample plot.</p><p>The basic features of the calibration estimator of Deville and S&#228;rndal [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>] in terms of estimating means can be described as follows. Consider a finite population U consisting of N units. Let j denote a general unit, thus<img src="11-1040036\52497ecb-4028-448e-a832-d5c8e6328f41.jpg" />. In a forest inventory the population is a region where units are pixels or potential sample plots. The units in a forest inventory will be referred to here as pixels, and it will be assumed that an inventory sample plot gives values to the forest variables for an associated pixel. Each unit j is associated with a variable yj and a vector of auxiliary variables xj. The population mean of x, <img src="11-1040036\f7a818e6-c761-4193-88d7-0b11597d2bbc.jpg" />is assumed to be known. The y variables in a forest inventory are forest variables and the x variables can be spectral variables from remote sensing or geographical or climatic variables obtained from GIS databases.</p><p>Assume that a probability sample S is drawn, and y<sub>j</sub> and x<sub>j</sub> are observed for each j in S, the objective being to estimate the mean of y,<img src="11-1040036\494a9a1c-6f62-446a-b0bb-0c9508ecb6f8.jpg" />. Let π<sub>j</sub> be the inclusion probability and d<sub>j</sub> the basic sampling design weight<img src="11-1040036\1e44f47b-d462-4b36-8978-49bf254ac92e.jpg" />, which can be used to compute the unbiased Horvitz-Thompson estimator</p><p><img src="11-1040036\beb6eca3-6ea9-47b0-93ad-1ff2fd950670.jpg" />.</p><p>A calibration estimator</p><disp-formula id="scirp.17836-formula22276"><label>(17)</label><graphic position="anchor" xlink:href="11-1040036\773902cb-ef0c-4455-b245-55d338c93206.jpg"  xlink:type="simple"/></disp-formula><p>is obtained by minimizing the sum of distances,</p><p><img src="11-1040036\059d70d8-4db9-47ff-ae4e-4612362fa0f8.jpg" />, between the prior weights d<sub>j</sub> and posterior weights w<sub>j</sub> for a positive distance function G, taking account of the calibration equation</p><disp-formula id="scirp.17836-formula22277"><label>. (18)</label><graphic position="anchor" xlink:href="11-1040036\a8ec38a0-3664-476e-8616-de9bce53b505.jpg"  xlink:type="simple"/></disp-formula><p>If the distance between d<sub>j</sub> and w<sub>j</sub> is defined as</p><p><img src="11-1040036\14c71fb5-a635-4b12-8dbc-178807f09f6e.jpg" />the calibration estimator will be the same as the regression estimator</p><disp-formula id="scirp.17836-formula22278"><label>, (19)</label><graphic position="anchor" xlink:href="11-1040036\a74c1bd4-31b7-4f29-95bc-3e5488950885.jpg"  xlink:type="simple"/></disp-formula><p>where <img src="11-1040036\92775e44-dcfa-43cb-9307-91c085150bc3.jpg" /> and <img src="11-1040036\777bd386-4be7-486e-99b7-3e1da8c48e96.jpg" /> (a weighted regression coefficient vector) are</p><disp-formula id="scirp.17836-formula22279"><label>(20)</label><graphic position="anchor" xlink:href="11-1040036\247f1b16-52b9-4959-ae70-77bcfa8c2a99.jpg"  xlink:type="simple"/></disp-formula><p>and</p><disp-formula id="scirp.17836-formula22280"><label>. (21)</label><graphic position="anchor" xlink:href="11-1040036\61ed61f6-addc-45b2-8c47-f85e9556dd12.jpg"  xlink:type="simple"/></disp-formula><p>If the model contains an intercept, the corresponding variable x will be one for all observations, and the calibration Equation (18) will then guarantee that the weights w<sub>j</sub> add up to one. This means that when estimating totals, the weights Nw<sub>j</sub> will add up to the known total number of pixels in the population. Thus Nw<sub>j</sub> can be interpreted as the total area, in pixel units, for plots of forest similar to plot j. The standard least squares theory implies that the regression estimator (19) can be expressed in the form</p><disp-formula id="scirp.17836-formula22281"><label>. (22)</label><graphic position="anchor" xlink:href="11-1040036\5d7a1e92-9ec6-4ec4-ad14-11cde9f145b1.jpg"  xlink:type="simple"/></disp-formula><p>It is assumed that the intercept is always among the parameters. Estimator (21) is defined if the moment matrix <img src="11-1040036\8e1dc070-37c0-46d1-a375-9a55031b6391.jpg" /> is non-singular.</p><p>Some of the weights w<sub>j</sub> in (17) implied by Equations (20)-(22) may be negative. Nonnegative weights are guaranteed if the distance function is infinite for negative w<sub>j</sub>. Deville and S&#228;rndal [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>] presented four distance functions producing positive weights.</p><p>Minimization of the sum<img src="11-1040036\1fbb0514-1c85-4353-af58-bd76938af49c.jpg" />, so that (18) is satisfied is a non-linear constrained minimization problem. Using Lagrange multipliers, the problem can be reformulated as a non-linear system of equations which can be solved iteratively using Newton’s method [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>]. If the initial values of the Lagrange multipliers are set to zero, the first step will produce w<sub>j</sub>’s of the regression estimator (19).</p><p>Since the calibration estimator is asymptotically equivalent to the regression estimator, Deville and S&#228;rndal [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>] suggest that the variance of the calibration estimator should be computed in the same way as the variance of the regression estimator using regression residuals. There is no design-unbiased estimator of the variance in systematic sampling [<xref ref-type="bibr" rid="scirp.17836-ref7">7</xref>].</p><p>The emphasis on area interpretation for the weights has the same argument behind it as was used by Moeur and Stage [<xref ref-type="bibr" rid="scirp.17836-ref14">14</xref>] for the most similar neighbour method (MSN), where unknown plot variables are taken from a plot which is as similar as possible with respect to the known plot variables. In both methods each sample plot represents a percentage of the total area, and all the forest variables are logically related to each other. The difference is that in the calibration estimator we obtain an estimate of the area of the sample plot for the whole population whereas in the MSN method each pixel is associated with a sample plot. Since there is no straightforward way of showing that the MSN method produces optimal results in any way at the population level, it may be safer to use the calibration estimator for computing population-level estimates for forest variables. The problem with the calibration estimator is that it does not provide a map. If a map is needed, then the weights provided by the calibration estimator need to be distributed over pixels using separate after-processing.</p><p>Lappi [<xref ref-type="bibr" rid="scirp.17836-ref15">15</xref>] proposed a small-area modification of the calibration estimator which can be used when several subpopulation totals are required simultaneously. He used satellite data as auxiliary information for computing inventory results for counties. Sample plots in the surrounding inclusion zone are also used for a given subpopulation so that the prior weight decreases as distance increases. The error variance is computed using a spatial variogram model. Block kriging [<xref ref-type="bibr" rid="scirp.17836-ref16">16</xref>] provides an optimal estimator for subpopulation totals under such a model, but kriging can produce negative weights for sample plots, and the weights are different for each y variable. Thus it is not possible to give areal interpretations to sample plot weights in kriging.</p></sec><sec id="s5"><title>REFERENCES</title></sec><sec id="s6"><title>Appendices</title>Appendix 1. Implicit Function Theorem<p>Let<img src="11-1040036\46ad1be1-80da-4f52-8ce3-08c08ae349db.jpg" />, where D is an open subset of R<sup>m</sup><sup> +</sup><sup> n</sup>, and g has continuous first-order partial derivatives in D. If there is a point<img src="11-1040036\cb442dae-6508-407e-94b5-8c61370ea509.jpg" />, where<img src="11-1040036\f8bc9055-5a6f-4ccf-b1a9-0c3113776e5d.jpg" />, with<img src="11-1040036\7b073f42-1edc-4099-9aa8-066ea01ac75a.jpg" />, <img src="11-1040036\daf07369-3fa9-481f-ab5f-91dfd862f5b2.jpg" />such that<img src="11-1040036\57116bc7-5fd4-4c82-a9ff-5f2bc0cc1dcd.jpg" />, and if at z<sub>0</sub>,</p><p><img src="11-1040036\0e966d18-8865-4aad-b4ef-c4295402308c.jpg" />where g<sub>i</sub> is the ith element of<img src="11-1040036\438b8b31-1fd2-42ef-b965-67c2921e96fc.jpg" />, then there is a neighborhood <img src="11-1040036\d59e73c7-04bf-4db9-b176-61dd4371f18a.jpg" /> of y<sub>0</sub> in which the equation <img src="11-1040036\ec7be591-f274-4c8d-b71c-7ecb81a2ee2b.jpg" /> can be solved uniquely for x as a continuously differentiable function of y.</p>Appendix 2. Ratio and Regression Estimators<p>In a stratified inventory information on some auxiliary variables is used both to plan the sampling design (e.g. allocation) and for estimation, or only for estimation (post-stratification). Stratification is not the only way to use auxiliary information, however, as it can be used at the design stage, e.g. in sampling proportional to size (see Appendix 3). It can also be used at the estimation stage in ratio or regression estimators, so that the standard error of the estimators can be reduced using information on a variable x which is known for each sampling unit in the population. The estimation is based on the relationship between the variables x and y. In ratio estimation, a model that goes through the origin is applied. If this model does not apply, regression estimator is more suitable. The ratio estimator for the mean is</p><p><img src="11-1040036\6cbfcd6f-0572-4f0d-b2a4-d012aeb323f4.jpg" />where <img src="11-1040036\7a010f76-9ba1-47bc-b2fb-419f810e0b23.jpg" /> is the mean of a variable x in the population and <img src="11-1040036\7826837a-28bd-4a1f-a9d5-c04cc7bb30ce.jpg" /> in the sample. Ratio estimators are usually biased, and thus the root mean square error (RMSE) should be used instead of the standard error. The relative bias nevertheless decreases as a function of sample size, so that in large samples (at least more than 30 units) the accuracy of the mean estimator can be approximated as [<xref ref-type="bibr" rid="scirp.17836-ref1">1</xref>]:</p><p><img src="11-1040036\54e38ef7-3578-4515-9ef0-8e92b0f6193d.jpg" />.</p><p>The ratio estimator is more efficient the larger the correlation between x and y relative to the ratio of the coefficients of variation CV. It is worthwhile using the ratio estimator if</p><p><img src="11-1040036\985c0c9b-ca0b-453f-b357-890ce48cda3c.jpg" />.</p><p>The (simple linear) regression estimator for the mean value is</p><p><img src="11-1040036\1408acb0-6800-4189-a70f-1beaed36e5ab.jpg" />where <img src="11-1040036\5e85b3ba-0a76-41b5-a855-3a76b44f6d11.jpg" /> is the OLS (Ordinary Least Squares) coefficient of x for the model, which predicts the population mean of y based on the sample means. In a sampling context, the constant of the model is not usually presented, but the formula for the constant, <img src="11-1040036\c0f5e775-2288-4b00-a928-48191d5994a3.jpg" />, is embedded in the equation. The model is more efficient the larger the correlation between x and y. The variance of the regression estimator can be estimated as</p><p><img src="11-1040036\c5ba46f6-4550-45f0-ba72-e813ff1262d2.jpg" />.</p>Appendix 3. Sampling with Probability Proportional to Size<p>The basic properties of sampling with arbitrary probabilities can also be utilized in sampling with probability proportional to size (PPS), such as sampling with a relascope. It is then assumed that unit i is selected with the probability kx<sub>i</sub>, where k is a constant and x is a covariate (diameter of a tree in relascope sampling). PPS sampling is more efficient the larger the correlation between x and y. For perfect correlation the variance in the estimator would be zero [<xref ref-type="bibr" rid="scirp.17836-ref7">7</xref>]. PPS sampling might even be less efficient than SRS (Simple Random Sampling), however, if the correlation were negative. This could be the case when multiple variables of interest are considered simultaneously, for example, when correlation with one variable (say volume) might give efficient estimates but the estimates for other variables (say health and quality) might not be so good.</p><p>In practice, PPS sampling can be performed by ordering the units, calculating the sum of their sizes (say<img src="11-1040036\a4d9f01f-5f36-4758-a727-25ea422618f5.jpg" />), and calculating<img src="11-1040036\8901a4c8-ae31-4f7a-89e2-e5a33c012b7c.jpg" />. The probability of a unit i being selected is then <img src="11-1040036\40695dd1-3a0d-4632-a884-47882255ee52.jpg" /> and a cumulative probability can be calculated for the ordered units. A random number r is then picked and each unit with a cumulative probability equal to (or just above) r, r + 1, &#160;r + 2, <img src="11-1040036\118da5ac-b20f-443e-822a-f5db3d114b21.jpg" />, r + n – 1 is selected for the sample. Every unit of size greater than <img src="11-1040036\e548ba2e-b881-4b50-99e5-54b46f9e64ea.jpg" /> is then selected with certainty.</p></sec></body><back><ref-list><title>References</title><ref id="scirp.17836-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">J. Penman, M. Gytarsky, T. Hiraishi, T. Krug, D. Kruger, R. Pipatti, L. Buendia, K. Miwa, T. Ngara, K. Tanabe and F. Wagner, “Good Practice Guidance for Land Use, Land- Use Change and Forestry,” Intergovernmental Panel on Climate Change: IPCC National Greenhouse Gas Inventories Program, Institute for Global Environmental Strategies (IGES) for the IPCC, Kanagawa Japan, 2003.</mixed-citation></ref><ref id="scirp.17836-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">F. Loetsch, F. Z?hrer and K. Haller, “Forest Inventory,” BLV, Verlagsgesellschaft, 1973.</mixed-citation></ref><ref id="scirp.17836-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">I. Doig, “When the Douglas-Firs Were Counted: The Beginning of the Forest Survey,” Journal of Forest History, Vol. 20, 1976, pp. 20-27.</mixed-citation></ref><ref id="scirp.17836-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">W. Frayer and G. Furnival, “Forest Survey Sampling Designs—A History,” Journal of Forestry, Vol. 97, No. 12, 1999, pp. 4-10.</mixed-citation></ref><ref id="scirp.17836-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">T. Gregoire, “Roots of Forest inventory in North America,” A Paper Presented at the A1 Inventory Working Group Session at the SAF National Convention Held at Richmond, VA, 25-28 October 1992.</mixed-citation></ref><ref id="scirp.17836-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">T. Honer and F. Hegyi, “Forest Inventory—Growth and Yield in Canada: Past, Present and Future,” The Forestry Chronicle, Vol. 66, 1990, pp. 112-117.</mixed-citation></ref><ref id="scirp.17836-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">H. Schreuder, T. Gregoire and G. Wood, “Sampling Methods for Multiresource Forest Inventory,” John Wiley and Sons, New York, 1993.</mixed-citation></ref><ref id="scirp.17836-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">R. Sepp?l?, “Forest Inventories and the Development of Sampling Methods,” Silva Fennica, Vol. 3, 1985, pp. 218-219.</mixed-citation></ref><ref id="scirp.17836-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">D. Van Hooser, N. Cost and H. Lund, “The History of Forest Survey Program in the United States,” In: G. Preto and B. Koch, Eds., Forest Resource Inventory and Monitoring and Remote Sensing Technology, Proceedings of the IUFRO Centennial Meeting in Berlin, 31 August-4 September 1992, Japan Society of Forest Planning Press, Tokyo University of Agriculture and Technology, Saiwaicho, Fuchu, Tokyo, Japan, 1992, pp. 19-27.</mixed-citation></ref><ref id="scirp.17836-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">S. Rao, “Optimization Theory and Application,” Wiley Eastern Ltd., New Delhi, 1984.</mixed-citation></ref><ref id="scirp.17836-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">W. Cochran, “Sampling Techniques—3rd Edition,” Wiley, New York, 1977.</mixed-citation></ref><ref id="scirp.17836-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">P. De Vries, “Sampling for Forest Inventory,” Springer, Berlin, 1986. doi:10.1007/978-3-642-71581-5</mixed-citation></ref><ref id="scirp.17836-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">K. Matis, “Sampling of Natural Resources,” Pigasos Publications, Thessaloniki, Greece, 2004.</mixed-citation></ref><ref id="scirp.17836-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">J. Deville and C. S?rndal, “Calibration Estimators in Survey Sampling,” Journal of the American Statistical Association, Vol. 87, No. 418, 1992, pp. 376-382.  
doi:10.2307/2290268</mixed-citation></ref><ref id="scirp.17836-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">J. Lappi, “Forest Inventory of Small Areas Combining the Calibration Estimator and a Spatial Model,” Canadian Journal of Forest Research, Vol. 31, No. 9, 2001, pp. 1551-1560. doi:10.1139/x01-078</mixed-citation></ref><ref id="scirp.17836-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">N. Cressie, “Kriging Nonstationary Data,” Journal of American Statistical Association, Vol. 81, No. 395, 1986, pp. 625-634. doi:10.2307/2288990</mixed-citation></ref><ref id="scirp.17836-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">M. Moeur and A. Stage, “Most Similar Neighbor: An Improved Sampling Inference Procedure for Natural Resource Planning,” Forest Science, Vol. 41, No. 2, 1995, pp. 337-359.</mixed-citation></ref></ref-list></back></article>