Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Improving Estimations of Spatial Distribution of Soil Respiration Using the Bayesian Maximum Entropy Algorithm and Soil Temperature as Auxiliary Data

  • Junguo Hu,

    Affiliations Information Engineering College of Zhejiang A & F University, Linan, PR China, Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology Research, Linan, PR China

  • Jian Zhou,

    Affiliation Information Engineering College of Zhejiang A & F University, Linan, PR China

  • Guomo Zhou ,

    zhougm@zafu.edu.cn

    Affiliation Information Engineering College of Zhejiang A & F University, Linan, PR China

  • Yiqi Luo,

    Affiliation Department of Microbiology and Plant Biology, University of Oklahoma, Norman, Oklahoma, United States of America

  • Xiaojun Xu,

    Affiliation Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology Research, Linan, PR China

  • Pingheng Li,

    Affiliation Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology Research, Linan, PR China

  • Junyi Liang

    Affiliation Department of Microbiology and Plant Biology, University of Oklahoma, Norman, Oklahoma, United States of America

Abstract

Soil respiration inherently shows strong spatial variability. It is difficult to obtain an accurate characterization of soil respiration with an insufficient number of monitoring points. However, it is expensive and cumbersome to deploy many sensors. To solve this problem, we proposed employing the Bayesian Maximum Entropy (BME) algorithm, using soil temperature as auxiliary information, to study the spatial distribution of soil respiration. The BME algorithm used the soft data (auxiliary information) effectively to improve the estimation accuracy of the spatiotemporal distribution of soil respiration. Based on the functional relationship between soil temperature and soil respiration, the BME algorithm satisfactorily integrated soil temperature data into said spatial distribution. As a means of comparison, we also applied the Ordinary Kriging (OK) and Co-Kriging (Co-OK) methods. The results indicated that the root mean squared errors (RMSEs) and absolute values of bias for both Day 1 and Day 2 were the lowest for the BME method, thus demonstrating its higher estimation accuracy. Further, we compared the performance of the BME algorithm coupled with auxiliary information, namely soil temperature data, and the OK method without auxiliary information in the same study area for 9, 21, and 37 sampled points. The results showed that the RMSEs for the BME algorithm (0.972 and 1.193) were less than those for the OK method (1.146 and 1.539) when the number of sampled points was 9 and 37, respectively. This indicates that the former method using auxiliary information could reduce the required number of sampling points for studying spatial distribution of soil respiration. Thus, the BME algorithm, coupled with soil temperature data, can not only improve the accuracy of soil respiration spatial interpolation but can also reduce the number of sampling points.

Introduction

Soil respiration represents one of the most important fluxes in the terrestrial carbon (C) cycle [13]. Therefore, accurately estimating the amount of soil carbon dioxide (CO2) efflux is of great importance to understand the terrestrial C cycle and the mechanisms involving climate change and its effects. Influenced by numerous natural factors, soil CO2 efflux tends to show intense spatial heterogeneity [4, 5]. Practically, studies on soil CO2 efflux generally adopt the scattered point sampling method because of the measurement limits associated with soil respiration[5, 6]. However, due to the extreme spatial and temporal variabilities in soil respiration, it is crucial to have denser spatial data points to undertake spatial interpolation [7].

There are various spatial interpolation methods, most of which have already been applied in many fields [8]. Generally, spatial interpolation methods can be classified into non-geostatistical methods, geostatistical methods (e.g., Kriging), and mixed methods. As an unbiased estimate method, the Kriging method is the most mature and popular method in the field of environmental science[911]. Due to expanding application scopes and varying application requirements in different fields, mixed interpolation methods have developed over time, both in theory and application [12]. In addition to a number of commonly used methods, some machine-learning methods have also been adopted toward spatial interpolation, and fairly good results were derived. Popular machine-learning methods include the neural network method and random forest method[13, 14]. Li et al. [15] explored the utilization of many machine-learning methods in spatial interpolation. Hu et al. [16] employed the neural network interpolation algorithm to study illumination distributions. Furthermore, many other algorithms are already being used to study interpolation in the field of environmental science. For instance, researchers have expanded data use to spatial and temporal aspects by adopting Bayesian prior information [17, 18]. Most of the current interpolation methods have already been applied in the study of spatial distribution of soil respiration. Teixeira et al. [19] compared the results of the application of the Kriging and sequential Gaussian fitting methods to soil respiration interpolation, and they found that latter achieved better results. Stoyan et al. [20] studied the spatial variation of soil respiration using the Kriging method. Jordan et al. [21] also used the Kriging method to study the small-scale spatial heterogeneity of soil respiration for a growing forest. However, soil respiration entails a complex a complex interrelationship of physical, biological, and chemical reactions, and thus, it is hard to fully analyze its spatial heterogeneity by merely interpolating data from a few sampled points. Consequently, it is crucial to improve the interpolation accuracy of soil respiration and compensating for the above-mentioned deficiency by including additional information, such as impact factors that are easily accessible, for example, auxiliary information on soil respiration, during sampling. Teixeira et al. [6] compared the interpolation results obtained from the OK and Co-Kriging (Co-OK) methods, using soil bulk density as the second feature, and they showed that the inclusion of this feature greatly improved the effect of interpolation. Huang et al. [22] studied the influence of vegetation and soil properties on the estimation of soil respiration space with the aid of remote sensing techniques. Jurasinski et al. [23] took root microbes as auxiliary information and investigated the spatial distribution of soil respiration by adopting the Co-OK method.

The Bayesian Maximum Entropy (BME) algorithm is a combination of the Bayesian statistical theory and the information theory of Shannon. It is used to handle spatiotemporal variables that can be integrated into more empirical knowledge and soft data (auxiliary information), and consequently, it can aid in the collation of environmental information in the field of geostatistics. Compared to the kriging algorithm, the BME algorithm is more theoretical and systematic. Gao et al. [24] coupled the temperature data obtained by remote sensing as soft data with the BME algorithm to study the regional spatial distribution of soil moisture. They compared the results of the BME algorithm to those of kriging and proved that the former could improve the regional interpolation effect. Akita et al. [25] developed a moving-window BME method to improve the estimation accuracy of regional air pollutant distribution.

Studies in the field of soil respiration have indicated that soil surface temperature significantly affects soil respiration, representing an exponential relationship [2628]. Soil temperature can be more easily obtained by advanced technologies (such as wireless sensor networks) than soil respiration. Thus, in this paper, we take advantage of the ability of the BME algorithm to use soft data and employ soil temperature as the soft data. In doing so, we confirm the following three hypotheses: (1) The BME method is more accurate at estimating the spatial distribution of soil CO2 efflux than OK and Co-OK methods, (2) data on soil temperature, used as auxiliary information, provide improved estimates of the spatial distribution of soil CO2 efflux on small scale, (3) and this auxiliary information can help reduce the number of sampling points while studying the spatial distribution of soil CO2 efflux.

Materials and Methods

Study area

The experimental site was located in the city of Lin’an in the northwest of Jincheng County, Zhejiang Province, China (119°43’15.24”–119°43’26.97”E and 30°15’21.60”–30°15’33.27”N). The entire study area is open grassland bounded by a lake in the east; sparse woods in the south, west, and northwest; and open grassland in the northeast. The average altitude is 50 m, and the highest altitude is 170 m. The area has an average annual frost-free period of 237 days and receives an average annual rainfall of 1613.9 mm over a total of about 158 days. The average annual temperature is 16.4°C, with 1847.3 h of annual sunshine. Roughly speaking, this area is warm and humid, featuring a subtropical monsoon climate, and it has sufficient illumination, abundant rainfall, and four distinct seasons.

The area belongs to Zhejiang A & F University, and we can do research freely. Other researchers also can easily get permission to do research in the area. Warning signs had been set during the testing process to insure there was no any danger. The research didn’t cause irreversible damage to the soil and there isn’t any protected species in the study area.

Data sources

The experimental area covers an area of 35 m × 35 m, which was divided into grids of 5 × 5m. Seven Lr100GE-6400 [29], developed by GreenOrbs Laboratory, were used to measure the CO2 efflux within each grid. The Lr100GE-6400 is an open-box CO2 efflux measuring instrument. One day before measuring the CO2 efflux, a PVC soil collar was pressed into the soil to a depth of about 5 cm at the center of the surface of each soil core. In order to ensure simultaneous readings, we took 1 minute to warm up the 7 instruments, and 3 minutes to conduct the measurements, with all measurements being completed within the 90 minutes between 13:00 to 14:30 on September 30 (Day 1) and October 7 (Day 2), 2014. We chose the stable data as the experimental data from all the observed data. On Day 1, we conducted measurements from east to west (horizontal direction), while on Day 2, we did so from south to north (vertical direction). The summary statistics of the CO2 efflux data have been provided in Table 1. Higher density temperature data (35 × 35) were measured manually by 30 corrected soil TP-101 thermometers, logged when they achieved equilibrium at 1 to 2 min inserted into the soil at a depth of about 5 cm-10cm. All rounds of sampling were completed in 90 min. The summary statistics of the soil temperature data appear in Table 2.

Data preprocessing

The application of soft data (soil temperature) is important for the BME algorithm, to integrate uncertain information into the estimation. Suitable and high quality soft data can improve the performance of the algorithm. Soft data can be integrated into expert knowledge, experimental conclusions, and so on, with the common probability-type of soft data [30, 31]. Probability soft data can be approximately normally distributed or Student t-distributed to express the measurement error or physical interpretation [32, 33]. Interval soft data denote physical meanings with upper and lower bounds. After reviewing other similar studies, we assumed that soil respiration and soil temperature share a functional relationship of the Arrhenius type [26, 27], as shown in Eq 1. We used the measured data to calculate the fitting parameters in Eq 1. The soft data are given by the fitting results at regular intervals of the Student’s t-distribution, and the formula used to calculate the prediction interval is given by Eq 2 [34]: (1) (2)

Here, is the estimated soil CO2 efflux relative to the soil temperature Ti, with parameters a and b. RSinterval is the prediction interval corresponding to each estimated soil CO2 efflux is the critical value of the Student’s t-distribution with (n– 2) degrees of freedom and a confidence level of 95%. STP is the standard deviation (SD) of the soil CO2 efflux estimation error. is the average soil temperature. STT is the sum of the square of the deviations. The numerical values of parameters in Eqs 1 and 2 are shown in Table 3. Fig 1 shows the relationship between soil CO2 efflux and soil temperature, the probability distribution, and prediction interval of the soft data.

thumbnail
Fig 1. Temperature–soil CO2 Efflux Scatter Plots (TemRs), Fitting Relationship, and Probability Distribution.

Plots (a) and (b) denote the aforementioned relationships for Day 1 and Day 2, respectively. The red line (reg in the legend) shows the relationship between temperature and soil CO2 efflux. The green lines (Plup and Pldown in the legend) indicate the prediction intervals at a confidence level of 95%. The blue line (pd in the legend) refers to the probability distribution of the soft data (estimated CO2 efflux corresponding to soil temperature).

https://doi.org/10.1371/journal.pone.0146589.g001

thumbnail
Table 3. Parameters of the Arrhenius Type Formula and Summary Statistics of the Soft Data.

https://doi.org/10.1371/journal.pone.0146589.t003

Comparison between methods

This paper compared the results of the BME method and two Kriging methods (OK and Co-OK). The spatial estimates from the three methods were evaluated by several validation methods.

Kriging.

Kriging is an unbiased linear estimation method used to characterize a physical attribute's spatial variation and generate attribute estimates at un-sampled locations. OK, the simplest and most widely used kind of Kriging, calculates the weights (relative contributions) of attribute samples surrounding each estimation point by means of the geostatistical variogram, and the unknown attribute values are then estimated as the linear combination of the weighted samples, subject to the condition that the sum of the weights is equal to 1, see Eq 3: (3)

Here, Z*(V0) represents the value of the estimated point V0, Vi represents the value of the i-th point among n points around V0, and λi denote the weight coefficients.

Co-OK follows the same principle as OK. However, the former considers more than one variable, and in addition to considering the spatial relationship of the main variable itself, it considers the relationships between the main variable and all other variable types to enable better predictions. Adding more information about relevant variables while estimating the main variable can improve the estimated effects. As we consider the spatial variability of soil C flux over time, adding the closely related variable of soil temperature can compensate for the insufficiency of CO2 efflux sampling and improve the accuracy of the estimation, as shown in Eq 4: (4)

Here, is the estimate of the main variable Z1 at point V0. λ1i is the weight of the main variable Z1, and λ2j is the weight of the auxiliary variable Z2 (the second characteristic).

The Kriging method studies the spatial relationships from point to point, which are usually used to express spatial variability with an experimental variogram. Variograms generally include self-variograms and cross-variograms, and they are used to determine the spatial autocorrelations of the variable’s properties, as shown as Eq 5 [35]: (5) where is the experimental semivariance at a separation distance h, Z(Xi) is the property value of the variable at the i-th point, and N(h) is the number of pairs of points separated by the distance h.

In this paper, we added soil temperature properties as the auxiliary information. Cross-variograms can show the relationship between two variables, as seen in Eq 6 [36]: (6) is the experience cross-variogram at separation distance h, Z (Xi) is the main property value at the i-th point, Y(Xi) is the secondary property value at the i-th point, and N(h) is the number of pairs of points separated by distance h.

Based on the coefficient of determination (R2) and squared residuals, we chose the Gaussian and Spherical models as the optimal variogram model, as depicted by Eq 7 and Eq 8: (7) (8) where γ (h) is the semivariance, C0 indicates the nugget, C represents the structural variability, C0 + C represents the sill variance, and a denotes the correlation length range in geostatistics.

Bayesian Maximum Entropy.

BME is a spatiotemporal analysis and mapping method that combines information theory with Bayesian statistics [30]. Compared with classical geostatistics (kriging), BME can consider general-prior and site-specific knowledge using a certain error and uncertainty in soft (uncertain) data, in addition to hard (exact) data, to improve the accuracy of spatiotemporal analysis. The meaning of soft data is very flexible; it may denote sampled data, historical data, rough measurement data, expert knowledge, and/or model fitting data. The manner in which the BME method uses comprehensive information and employs the probability method to express uncertainty to the extent possible allows it to present more realistic results in the spatiotemporal analysis of nature attributes.

The BME process can be divided into three stages: prior, meta-prior, and posterior (Fig 2). The prior stage mainly uses general knowledge G, to calculate the prior joint probability density function fG(χmap) using the Shannon information measure. χmap is composed using hard data χhard, soft data χsoft, and unknown values at estimation point χk (the point to be estimated). The process involved in obtaining the expected information is shown as Eq 9, where gα(χmap) contains the general statistical information about χmap, such as the mean and covariance. The Shannon information measure is used to maximize the entropy under the relevant constraints of gα(χmap). Eq 10 shows the function of the Lagrange multipliers method (LMM) for maximizing the expected information by introducing the Lagrange multiplier μα and the expectation of gα(χmap).

(9)(10)

At the meta-prior stage, specific knowledge will be added into the calculation, including the measured hard data χhard and various forms of soft data χsoft. The soft data set fS(χsoft) denoted probability in this paper. At the posterior stage, using Bayes’ theorem, the posterior probability density function fK(xk|χdata) of the estimation points is calculated, taking into account the conditions of the specific knowledge, shown as Eqs 11 and 12: (11) (12)

After computing the posterior probability density function fK(xk|χdata), we can obtain the attribute values at the estimation points by way of the maximum posterior probability or maximum expectation, shown as Eqs 13 and 14: (13) (14) where denotes the estimated values of xk. We choose Eq 9 as the final estimation method in this paper.

Methods of validation.

In order to evaluate the performance of the three methods (OK, Co-OK, and BME), about 10% of the collected data (which were selected to avoid continuous sampling points or points located on the outside, namely, a total of five points) were employed as data for cross-validation. The three statistical indicators of the root mean squared error (RMSE), correlation coefficient (CR), and average deviation (mean bias) were used to quantify the accuracy of the estimation results, as shown in Eqs 15, 16 and 17: (15) (16) (17)

We applied GS+ 9.0 spatial analysis software to calculate the semivariance and for the OK and Co-OK methods[37]. The BMElib library (BMEGUI3.0 software) was used for the BME method[31], and Matlab 8.4 was employed for basic processing and mapping.

Results

Comparison of results of the OK, Co-OK, and BME methods

the mean values of soil respiration in the study area was 3.476 and 5.81 μmg/m2s with the varition range of 2.698 and 3.019 μmg/m2s respectively, and the values of the coefficient of variation were observed 17.332 and 8.805 respectively, which proved that it was significant to consider the spatial variability of soil respiration in the study area. As shown in the Table 4, the parameters of the variogram models for the auto-variograms of soil respiration and the cross-variograms of the soil respiration and temperature in the study area during the abservation periods. There was some different in the range values of soil respiration in the observing days but the similar range of the cross-varigram of soil respiration and temperature, shown in Fig 3,and the soil respiration presented strong spatial dependence with the C0/(C0 + C) ratio<0.25.

thumbnail
Fig 3. Auto- and cross-variograms fitted to the soil respiration and temperature (a) and (b) respectively are shown the variograms of soil respiration on Day1 and Day2. (c) and (d) respectively are shown the cross-variograms of soil respiration and soil temperature on Day1 and Day2.

https://doi.org/10.1371/journal.pone.0146589.g003

thumbnail
Table 4. Models and parameters of the auto-variograms and cross-variograms fitted to the soil repiration and temperature.

https://doi.org/10.1371/journal.pone.0146589.t004

The spatial distribution of soil respiration in the study area was estimated using the three methods previously described. Fig 4 shows results of the mapping method for the samples collected over Day 1 and Day 2. In general, the three methods reflect the variations in soil respiration and its range in the study area, but certain difference in the local. The range of the spatial estimates, however, shows some differences among the three methods (Table 5). According to the BME method, the range on Day 1 and Day 2 was 3.543 and 5.038, respectively, which exceeded the ranges obtained using the Co-OK method (2.220 and 3.130, respectively) and the OK method (2.170 and 3.040, respectively). In fact, the results from the BME method showed an even greater range for both days than the corresponding values for the measured data (2.698 and 3.014, respectively). The estimations obtained using the Co-OK and OK methods fall in the same range as the measured values. Note that the values estimated using the Co-OK method are slightly larger than those obtained using the OK method. At the same time, the standard deviation of estimated value using the three methods also follow that the range of BME method is greater than Co-OK method than the OK method. In probability terms, it’s possible for the range of estimated value of soil respiration beyond the sampling data using only sampling data to estimate the value of soil respiration in the entire study area, so the BME method integrating more information and presenting the spatial variation of soil respiration in a probabilistic manner is more in line with real-world changes. Table 6 presents the validation results for the three methods. CR for Day 1 and Day 2 using the BME method (0.793 and 0.697, respectively) was significantly higher than the corresponding values obtained using the Co-OK and OK methods. The correlation between the estimation results from the BME method and the actual measurements is higher. Moreover, the RMSEs and absolute values of bias for both Day 1 and Day 2 were the lowest for the BME method, indicating that it can provide estimates of soil respiration in the study area with higher precision than the traditional Co-OK and OK methods.

thumbnail
Fig 4. Spatial Distribution of Soil Respiration Using the BME, Co-OK, and OK Methods.

https://doi.org/10.1371/journal.pone.0146589.g004

thumbnail
Table 6. Summary of Validation Results for the BME, Co-OK, and OK Methods.

https://doi.org/10.1371/journal.pone.0146589.t006

Effect of soil temperature as auxiliary information on the spatial estimation of soil respiration

In this study, we used only soil respiration samples for the OK method, while soil temperature data provided additional inputs to the BME and Co-OK methods. Table 6 shows that the RMSEs (using soil temperature data) obtained using the BME and Co-OK methods are 0.727 and 0.911, respectively, on Day 1, and 0.409 and 0.790, respectively, on Day 2, indicating that the inclusion of soil temperature as auxiliary information in the BME and Co-OK methods improves their overall performance compared to the OK method (its corresponding RMSE values being 0.979 and 2.042, respectively). The verification of the existing correlation indicates that the BME and Co-OK methods significantly outperform the OK method; thus, soil temperature data can effectively improve the reliability of the spatial pattern of soil respiration. However, the results for the deviation of validation (bias; Table 6) indicate only a small difference in results between the Co-OK and OK methods (-0.309 and -0.296, respectively, on Day 1), while the value for the BME method is -0.035, which is significantly better than those obtained using the other two methods. Thus, the auxiliary effect of soil temperature on the estimates would vary depending on the manner of its utilization.

It is notable that the estimations obtained using the BME and Co-OK methods with soil temperature data are different. The results of the index regression model for both days of the measured data show that soil temperature can explain 60.6% and 57.1% of soil respiration respectively, as shown in Table 3. After estimating the spatial variation of soil respiration in the study area using the BME and Co-OK methods, the generated soil respiration estimates are exponentially related to the soil temperature values, see Fig 5. According to the results, in the BME method, soil temperature can explain 57.6% and 47.8% of soil respiration on Day 1 and Day 2, respectively. For the Co-OK method, the corresponding values are 47.8% and 37%, using the same soil temperature as auxiliary information. Thus, the utilization degree and degree of influence of the auxiliary information differ between the BME and Co-OK methods. Thus, Table 5 clearly indicates that, statistically, the BME method outperforms the Co-OK method in terms of estimating the spatial variation in soil respiration when soil temperature serves as auxiliary data.

thumbnail
Fig 5. Fitting Relationship between Estimated Soil CO2 Efflux and Soil Temperature.

Note: The total number of estimated points for each sampling strategy is 1225, and the analysis of correlation is calculated using Eq 12.

https://doi.org/10.1371/journal.pone.0146589.g005

Effect of soil temperature as auxiliary information on soil respiration sampling points

According to the analysis presented above, soil temperature can serve as auxiliary information to improve the estimation accuracy of the spatial distribution of soil respiration. In the field, soil temperature data can be obtained by setting up a wide network of wireless sensor networks, but these data are cumbersome to measure over multiple points over a large area and over a long period. However, the collection of high-density soil temperature data to help estimate soil respiration has certain practical significance. Accordingly, we reduced the number of sampling points for soil respiration in our study, and as noted in previous sections, using soil temperature data, we then applied the BME method to estimate the spatial distribution of soil respiration in the study area. The results were compared with estimates from the OK method, which could not use the temperature data. In order to ensure an even distribution of measuring points throughout the study area, we chose the measuring points across four quadrants (Fig 6), including points on the horizontal and vertical lines (axes) dividing the quadrants. The number of measuring points was increased from 1 to 3, from 3 to 6, and from 6 to 9 in each quadrant and by 1 on each axis, ultimately resulting in the 9, 21, 37, and 49 measuring points scheme adopted by this study. Fig 7 shows the spatial distribution of soil respiration estimated by the BME and OK methods when the number of measuring points was 9, 21, 37, and 49. The BME results continue to show more detailed information even when the number of sampling points are reduced. Conversely, the results from the OK method, which cannot account for temperature data, depend strongly on the number of measuring points, and thus, it is difficult to obtain detailed information with a reduced number of measuring points. Table 7 shows the validation results in terms of the RMSE and CR when the number of sampling points is 9, 21, and 37. For Day 1, the RMSEs for the BME method are 0.972, 0.838, and 0.673, respectively, while those for the OK method (without using soil temperature as auxiliary information) are 2.759, 1.246, and 1.146, respectively. This proves that the BME method using soil temperature as auxiliary information provides more accurate results, which are not significantly affected by the reduced number of soil respiration samples (that are difficult to obtain). The values of CR using the BME method with soil temperature as auxiliary information are 0.778, 0.906, and 0.951 when the number of sampling points is 9, 21, and 37, respectively. Again, the results of the BME method are preferable to those of the OK method (the corresponding values being 0.289, 0.544, and 0.738, respectively). Moreover, the differences in CR using the BME method for the three above-mentioned sampling strategies are smaller (Fig 8). We see similar patterns for Day 2 also. Furthermore, when the number of sampling points is 9, the RMSEs using the BME method are 0.972 and 1.193 for Day 1 and Day 2, which continue to be superior to the RMSEs obtained using the OK method when the number of sampling points is 37 (1.146 and 1.539, respectively). This means that the accuracy of the BME method, coupled with soil temperature as auxiliary information, does not suffer when the number of soil samples is reduced, and the effect of reducing the number of sampling points on the estimation results of the study area as a whole is small.

thumbnail
Fig 6. Distributions of 9, 21, 37, and 49 Measured Points.

The solid circles depict the sampled points, and the clear circles, points where samples were not taken.

https://doi.org/10.1371/journal.pone.0146589.g006

thumbnail
Fig 7. Spatial Distribution of Soil CO2 Efflux Using the BME and OK Methods When the Number of Measured Points is 9, 21, 37, and 49 (μmg/m2s).

https://doi.org/10.1371/journal.pone.0146589.g007

thumbnail
Fig 8. Correlation Between Estimated Soil CO2 Effluxes When the Number of Sampled Points is 9, 21, 37, and 49, Using the BME and OK Methods.

Note: the Estimated Points is 1225 at Each Sampling Strategy, and the Analysis of Correlation Using Eq 12.

https://doi.org/10.1371/journal.pone.0146589.g008

thumbnail
Table 7. Validation Results for Number of Sampling Points.

https://doi.org/10.1371/journal.pone.0146589.t007

Discussion

Accuracy of soil respiration spatial interpolation based on Bayesian Maximum Entropy is high

Christakos proposed the BME method in the early 1990s demonstrated both its theoretical and practical significance. Christakos also demonstrated that Kriging was a particular situation of BME under the limited assumption of transcendental information and effective data [3236, 38]. By making the best use of auxiliary information, the BME method could effectively deal with real-world physical situations characterized by relatively high uncertainty. The BME method has been applied to the area of edaphology [3941] and the results have demonstrated less error and higher accuracy compared with traditional interpolation. Spatial estimation values of higher accuracy and more detailed local differences have been achieved in its application to environmental science [4247]. The BME method fittingly integrates into the soft data’s logarithm spatial analysis and improves the classification of results in data mining [4850]. The BME method has also obtained better effects than traditional methods in applications to the ecological field [24, 51, 52].Traditional geostatistics mainly relies on sampled data measurements to obtain estimates for the whole study area without bias. The BME method, however, blends more soft data and expert knowledge with the sampled data, making full use of this information to enhance the estimation accuracy of the study object. For example, the spatial distribution of soil respiration can be studied using auxiliary information from the existing literature; Teixeira et al. [6] used soil bulk density as auxiliary information to improve estimates of spatial distribution of soil C density in sugarcane fields. This article considers the exponential relationship between soil temperature and C flux, by applying the BME approach to soil respiration and using soil temperature data as auxiliary information. BME thus effectively improves the accuracy of soil respiration spatial interpolation.

Bayesian maximum entropy method coupled with soil temperature data (as soft data) can reduce the number of sampling points

Currently, the most widely used instruments in soil respiration monitoring are Li-8100 and Li-6400, which are so expensive that it is difficult to buy multiple instruments for simultaneous monitoring. The static air box method, although inexpensive, requires more manpower, and it is difficult to achieve simultaneous monitoring. Soil respiration typically exhibits strong spatial heterogeneity [19, 5356]. Therefore, an insufficient number of monitoring points cannot characterize soil respiration for an entire area. Soil temperature, however, is relatively easy to measure, and these measurements can compensate for the lack of adequate monitoring points. Thus, devising a method that requires fewer monitoring points to achieve more comprehensive measurements of soil respiration is of great significance.

The biggest advantage of the BME method is that it can integrate soft data (prior knowledge, expertise, etc.) into hard data (measured data). Many scholars [32, 5759] have concluded that, compared to conventional kriging, the BME method offers the advantages of higher accuracy and less error. Regarding the use of auxiliary information, the Co-OK method only uses weights to simply increase the influence of auxiliary information. The BME method, in contrast, is based on information entropy theory and using the LMM, it integrates the auxiliary information into the estimation of the natural attributes of interest. Then, using the Bayesian approach, it calculates the final estimate, thus making the whole process more systematic. During the integration of the auxiliary information, the BME method combines the measured data (the hard data Xhard = {X1,X2Xh}) and the auxiliary information (soft data Xsoft = {Xh+1,Xh+2Xh+s}) into available information sets Xdata = {Xhard, Xsoft}, namely, the elements are expanded from h to q = h + s. Generally, the available soft data are very large; in this study too, the extent of soil temperature data significantly exceeded the measured soil respiration data. Notably, the expandation in the amount of information increases the uncertainty of information. Using its base of information entropy and the ability of the Bayesian approach to deal with information uncertainty, the BME method effectively combines the auxiliary information and measured data in the same space estimation [25, 52, 60]. Christakos also inferred the sum of partial derivatives of product μqgq(Xk, Xhard, Xsoft) between Lagrange parameter μq and statistical matrix equation gq(Xk, Xhard, Xsoft) with respect to the estimation point Xk; in other words, the BME method entails the simultaneous inclusion of massive amounts of soft data and hard data into the calculation [38]. Similarly, in this study, high-density soil temperature data were integrated into the space estimates alongside soil respiration values. Meanwhile, in the absence of adequate amounts of hard (measured) data region, the integration of soft data compensates for this absence of hard data to some extent. Christakos et al. [32] also noted that the BME method uses less measured data to obtain better estimates than the simple kriging method, which uses the space of all measured data. In agreement with this result, this study showed that the BME method, when coupled with soil temperature data and measured data for only nine measurement points, had better effects than the OK method for 37 measurement points as the verification results showed in Table 6. Thus, the BME method, coupled with soil temperature as auxiliary information, can reduce the number of monitoring points in soil respiration studies by a considerable extent.

Conclusions

This paper coupled the BME method with auxiliary information of soil temperature, to study the spatial distribution of soil respiration. The results indicated that the BME method is superior to the Co-OK and OK methods. Moreover, such application can reduce the number of soil respiration sampling points, thus indicating its significance in the area of soil respiration monitoring, the equipment for which is typically expensive enough to restrict researchers to sparse single-point monitoring.

This study also has some limitations. We chose soil temperature as auxiliary information. However, the spatial heterogeneity of soil respiration is affected by many other factors, such as soil C storage, soil bulk density, and root and microbial biomass. These factors can be taken into consideration in future studies of the performance of the BME method in interpolating soil respiration distributions while using data on multiple factors as auxiliary information.

Acknowledgments

This study was supported by the NSF China Programs (Grant No. 31300539 and 31570629) and the Public Welfare Technology Application Research Program of Zhejiang province (Grant No. 2015C31004).

Author Contributions

Conceived and designed the experiments: ZG HJ ZJ. Performed the experiments: HJ ZJ. Analyzed the data: ZJ. Contributed reagents/materials/analysis tools: ZJ. Wrote the paper: ZJ. Participated in revision of manuscript: LY XX LP LJ.

References

  1. 1. Chen S, Zhang X, Liu Y, Hu Z, Shen X, Ren J. Simulated acid rain changed the proportion of heterotrophic respiration in soil respiration in a subtropical secondary forest. Appl Soil Ecol. 2015; 86: 148–157.
  2. 2. Gomez-Casanovas N, Anderson-Teixeira K, Zeri M, Bernacchi CJ, DeLucia EH. Gap filling strategies and error in estimating annual soil respiration. Glob Chang Biol. 2013; 19: 1941–1952. pmid:23504959
  3. 3. Bond-Lamberty B, Thomson A. Temperature-associated increases in the global soil respiration record. Nature. 2010; 464: 579–582. pmid:20336143
  4. 4. Reichstein M, Rey A, Freibauer A, Tenhunen J, Valentini R, Banza J. Modeling temporal and large‐scale spatial variability of soil respiration from soil water availability, temperature and vegetation productivity indices. Global Biogeochem Cycles. 2003; 17: 1104.
  5. 5. Yu L, Wang Y, Wang Y, Sun S, Liu L. Quantifying components of soil respiration and their response to abiotic factors in two typical subtropical forest stands, southwest china. PloS one. 2014; 10(2): e0117490.
  6. 6. Teixeira DDB, Bicalho EDS, Panosso AR, Cerri CEP, Pereira GT, La Scala Júnior N. Spatial variability of soil CO2 emission in a sugarcane area characterized by secondary information. Sci Agric. 2013; 70: 195–203.
  7. 7. Li J, Heap AD. Spatial interpolation methods applied in the environmental sciences: A review. Environ Model Softw. 2014; 53: 173–189.
  8. 8. Li J, Heap AD. A review of spatial interpolation methods for environmental scientists. Geoscience Australia, Canberra, 2008. Record 2008/023.
  9. 9. Elbasiouny H, Abowaly M, Abu Alkheir A, Allah Gad A. Spatial variation of soil carbon and nitrogen pools by using ordinary Kriging method in an area of north Nile Delta, Egypt. Catena. 2014; 113: 70–78.
  10. 10. Zhang S-W, Shen C-Y, Chen X-Y, Ye H-C, Huang Y-F, Shuang LAI. Spatial interpolation of soil texture using compositional kriging and regression kriging with consideration of the characteristics of compositional data and environment variables. J Integr Agric. 2013; 12: 1673–1683.
  11. 11. Kumar S, Lal R, Liu D. A geographically weighted regression kriging approach for mapping soil organic carbon stock. Geoderma. 2012; 189: 627–634.
  12. 12. Hengl T. A practical guide to geostatistical mapping of environmental variables; 2007. Available: http://spatial-analyst.net/book/system/files/Hengl_2009_GEOSTATe2c1w.pdf
  13. 13. Guo P-T, Li M-F, Luo W, Tang Q-F, Liu Z-W, Lin Z-M. Digital mapping of soil organic matter for rubber plantation at regional scale: An application of random forest plus residuals kriging approach. Geoderma. 2015; 237: 49–59.
  14. 14. Friedel MJ, Iwashita F. Hybrid modeling of spatial continuity for application to numerical inverse problems. Environ Model Softw. 2013; 43: 60–79.
  15. 15. Li J, Potter A, Huang Z, Daniell JJ, Heap AD. Predicting seabed mud content across the Australian margin: comparison of statistical and mathematical techniques using a simulation experiment. Geoscience Australia, 2010. Record 2010/011.
  16. 16. Junguo H, Guomo Z, Xiaojun X. Using an improved back propagation neural network to study spatial distribution of sunshine illumination from sensor network data. Ecol Modell. 2013; 266: 86–96.
  17. 17. Cressie N, Wikle CK. Statistics for spatio-temporal data. John Wiley & Sons; 2011.
  18. 18. Strebel K, Espinosa G, Giralt F, Kindler A, Rallo R, Richter M, et al. Modeling airborne benzene in space and time with self-organizing maps and Bayesian techniques. Environ Model Softw. 2013; 41: 151–162.
  19. 19. Teixeira DDB, Panosso AR, Cerri CEP, Pereira GT, La Scala Júnior N. Soil CO2 emission estimated by different interpolation techniques. Plant Soil. 2011; 345: 187–194.
  20. 20. Stoyan H, De-Polli H, Böhm S, Robertson GP, Paul EA. Spatial heterogeneity of soil respiration and related properties at the plant scale. Plant Soil. 2000; 222: 203–214.
  21. 21. Jordan A, Jurasinski G, Glatzel S. Small scale spatial heterogeneity of soil respiration in an old growth temperate deciduous forest. Biogeosci Discuss. 2009; 6: 9977–10005.
  22. 22. Huang N, Wang L, Guo Y, Hao P, Niu Z. Modeling spatial patterns of soil respiration in maize fields from vegetation and soil property factors with the use of remote sensing and geographical information system. PLoS One. 2014; 9: e105150. pmid:25157827
  23. 23. Jurasinski G, Jordan A, Glatzel S. Mapping soil CO2 efflux in an old-growth forest using regression kriging with estimated fine root biomass as ancillary data. For Ecol Manage. 2012; 263: 101–113.
  24. 24. Gao S, Zhu Z, Liu S, Zu Z, Liu S, Jin R, et al. Estimating the spatial distribution of soil moisture based on Bayesian maximum entropy method with auxiliary data from remote sensing. Int J Appl Earth Obs Geoinf. 2014; 32: 54–66.
  25. 25. Akita Y, Chen J-C, Serre ML. The moving-window Bayesian maximum entropy framework: estimation of PM2.5 yearly average concentration across the contiguous United States. J Expo Sci Environ Epidemiol. 2012; 22: 496–501. pmid:22739679
  26. 26. Fang C, Moncrieff J. The dependence of soil CO2 efflux on temperature. Soil Biol Biochem. 2001; 33: 155–165.
  27. 27. Lloyd J, Taylor J. On the temperature dependence of soil respiration. Funct Ecol. 1994; 8: 315–323.
  28. 28. Zhou W, Hui D, Shen W. Effects of soil moisture on the temperature sensitivity of soil heterotrophic respiration: a laboratory incubation study. PLoS One. 2014; 9: e92531. pmid:24647610
  29. 29. Lin X, Lufeng M, Ziwei X, Xiaojun X, Shenghui W, Guomo Z. System of measuring soil carbon flux based on wireless sensor network. Scientia Silvae Sinicae. 2013; 49: 122–126.
  30. 30. Bei Z, Weidong L, Yang Y, Shanqin W, Chongfa C. The Bayesian maximum entropy geostatistical approach and its application in soil and environmental sciences. Acta Pedologica Sinica. 2011; 48: 831–839.
  31. 31. Christakos G, Bogaert P, Serre M. Temporal GIS, with CD-ROM. New York: Springer-Verlag; 2002.
  32. 32. Christakos G, Li X. Bayesian maximum entropy analysis and mapping: A farewell to kriging estimators? Math Geol. 1998; 30: 435–462.
  33. 33. Christakos G, Serre ML, Kovitz JL. BME representation of particulate matter distributions in the state of California on the basis of uncertain measurements. J Geophys Res Atmos (1984–2012). 2001; 106: 9717–9731.
  34. 34. Younger MS. A first course in linear regression. PWS Publishing Company; 1985.
  35. 35. Boyer DG, Wright RJ, Feldhake CM, Bligh DP. Soil spatial variability relationships in a steeply sloping acid soil environment. Soil Sci. 1996; 161: 278–287.
  36. 36. Clayton V, Journel AG. GSLIB—Geostatistical software library and user’s guide. Technometrics; 1998.
  37. 37. Gamma Design Software. 2008. GS+ Geostatistics for the Environmental Sciences, Version 9.0. Gamma Design Software, Plainwell, MI, USA.
  38. 38. Christakos G. A Bayesian/maximum-entropy view to the spatial estimation problem. Math Geol. 1990; 22: 763–777.
  39. 39. D’or D, Bogaert P, Christakos G. Application of the BME approach to soil texture mapping. Stoch Environ Res Risk Assess. 2001; 15: 87–100.
  40. 40. Bogaert P, D’or D. Estimating soil properties from thematic soil maps. Soil Sci Soc Am J. 2002; 66: 1492–1500.
  41. 41. Brus D, Bogaert P, Heuvelink G. Bayesian maximum entropy prediction of soil categories using a traditional soil map as soft information. Eur J Soil Sci. 2008; 59: 166–177.
  42. 42. Christakos G, Serre ML. BME analysis of spatiotemporal particulate matter distributions in North Carolina. Atmos Environ. 2000; 34: 3393–3406.
  43. 43. Puangthongthub S, Wangwongwatana S, Kamens RM, Serre LM. Modeling the space/time distribution of particulate matter in Thailand and optimizing its monitoring network. Atmos Environ. 2007; 41: 7788–7805.
  44. 44. Pang W, Christakos G, Wang J- F. Comparative spatiotemporal analysis of fine particulate matter pollution. Environmetrics. 2010; 21: 305–317.
  45. 45. Lee S- J, Wentz EA, Gober P. Space–time forecasting using soft geostatistics: a case study in forecasting municipal water demand for Phoenix, Arizona. Stoch Environ Res Risk Assess. 2010; 24: 283–295.
  46. 46. Yu H- L, Chen J- C, Christakos G, Jerrett M. BME estimation of residential exposure to ambient PM10 and ozone at multiple time scales. Environ Health Perspect. 2009; 117: 537–544. pmid:19440491
  47. 47. Savelieva E, Demyanov V, Kanevski M, Demyanov V, Kanevski M, Serre M, et al. G. BME-based uncertainty assessment of the Chernobyl fallout. Geoderma. 2005; 128: 312–324.
  48. 48. Orton T, Lark R. The Bayesian maximum entropy method for lognormal variables. Stoch Environ Res Risk Assess. 2009; 23: 319–328.
  49. 49. Bonavito N, Gordon C, Inguva R, Serafino GN, Barnes RA. Maximum entropy-driven Bayesian reasoning in data classification. Telemat Inform. 1994; 11: 295–308.
  50. 50. Yari G, Chaji A. Maximum Bayesian entropy method for determining ordered weighted averaging operator weights. Comput Ind Eng. 2012; 63: 338–342.
  51. 51. Li A, Bo Y, Chen L. Bayesian maximum entropy data fusion of field-observed leaf area index (LAI) and Landsat Enhanced Thematic Mapper Plus-derived LAI. Int J Remote Sens. 2013; 34: 227–246.
  52. 52. Li A, Bo Y, Zhu Y, Guo P, Bi J, He Y. Blending multi-resolution satellite sea surface temperature (SST) products using Bayesian maximum entropy method. Remote Sens Environ. 2013; 135: 52–63.
  53. 53. Ngao J, Epron D, Delpierre N, Bréda N, Granier A, Longdoz B. Spatial variability of soil CO2 efflux linked to soil parameters and ecosystem characteristics in a temperate beech forest. Agric For Meteorol, 2012; 154: 136–146.
  54. 54. Wood TE, Detto M, Silver WL. Sensitivity of soil respiration to variability in soil moisture and temperature in a humid tropical forest. PLoS One. 2013; 8(12): e80965. pmid:24312508
  55. 55. Xiao W, Ge X, Zeng L, Huang Z, Lei J, Zhou B, et al. Rates of Litter Decomposition and Soil Respiration in Relation to Soil Temperature and Water in Different-Aged Pinus massoniana Forests in the Three Gorges Reservoir Area, China. PLoS One. 2014; 9(7): e101890. pmid:25004164
  56. 56. Dore S, Fry DL, Stephens SL. Spatial heterogeneity of soil CO2 efflux after harvest and prescribed fire in a California mixed conifer forest. For Ecol Manage. 2014; 319: 150–160.
  57. 57. Bayat B, Zahraie B, Taghavi F, Nasseri M. Evaluation of spatial and spatiotemporal estimation methods in simulation of precipitation variability patterns. Theor Appl Climatol. 2013; 113: 429–444.
  58. 58. Douaik A, Van Meirvenne M, Tóth T. Soil salinity mapping using spatio-temporal kriging and Bayesian maximum entropy with interval soft data. Geoderma. 2005; 128: 234–248.
  59. 59. Douaik A, Van Meirvenne M, Tóth T, Serre M. Space-time mapping of soil salinity using probabilistic Bayesian maximum entropy. Stoch Environ Res Risk Assess. 2004; 18: 219–227.
  60. 60. Yu H-L, Wang C-H. Quantile-based Bayesian maximum entropy approach for spatiotemporal modeling of ambient air quality levels. Environ Sci Technol. 2013; 47: 1416–1424. pmid:23252912