Effective Key Parameter Determination for an Automatic Approach to Land Cover Classification Based on Multispectral Remote Sensing Imagery

Yong Wang; Dong Jiang; Dafang Zhuang; Yaohuan Huang; Wei Wang; Xinfang Yu

doi:10.1371/journal.pone.0075852

Abstract

The classification of land cover based on satellite data is important for many areas of scientific research. Unfortunately, some traditional land cover classification methods (e.g. known as supervised classification) are very labor-intensive and subjective because of the required human involvement. Jiang et al. proposed a simple but robust method for land cover classification using a prior classification map and a current multispectral remote sensing image. This new method has proven to be a suitable classification method; however, its drawback is that it is a semi-automatic method because the key parameters cannot be selected automatically. In this study, we propose an approach in which the two key parameters are chosen automatically. The proposed method consists primarily of the following three interdependent parts: the selection procedure for the pure-pixel training-sample dataset, the method to determine the key parameters, and the optimal combination model. In this study, the proposed approach employs both overall accuracy and their Kappa Coefficients (KC), and Time-Consumings (TC, unit: second) in order to select the two key parameters automatically instead of using a test-decision, which avoids subjective bias. A case study of Weichang District of Hebei Province, China, using Landsat-5/TM data of 2010 with 30 m spatial resolution and prior classification map of 2005 recognised as relatively precise data, was conducted to test the performance of this method. The experimental results show that the methodology determining the key parameters uses the portfolio optimisation model and increases the degree of automation of Jiang et al.'s classification method, which may have a wide scope of scientific application.

Citation: Wang Y, Jiang D, Zhuang D, Huang Y, Wang W, Yu X (2013) Effective Key Parameter Determination for an Automatic Approach to Land Cover Classification Based on Multispectral Remote Sensing Imagery. PLoS ONE 8(10): e75852. https://doi.org/10.1371/journal.pone.0075852

Editor: Guy J-P. Schumann, NASA Jet Propulsion Laboratory, United States of America

Received: February 18, 2013; Accepted: August 22, 2013; Published: October 28, 2013

Copyright: © 2013 Wang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This study was partially funded by the China Postdoctoral Science Foundation (20060400496), the Chinese Academy of Sciences (KZZD-EW-08), the National Natural Science Foundation of China (41001279), and the National Scientific and Technological Support Projects of China (2011BAJ07B01). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

As research on global change has grown in depth and scope, Land Use and Land Cover Change (LUCC) has increasingly become a core part of global environmental change research [1], [2], [3]. Multispectral satellite imagery is an important data source for LUCC research [4], [5], [6], [7], [8], [9]. One of the most common uses of satellite images is the mapping of LUCC via image classification. Various methods or algorithms have been successfully employed in LUCC classification and change detection, including visual interpretation classification [10], unsupervised classification [11], supervised classification [12], [13] (e.g. artificial neural network algorithms [14], [15], support vector machine algorithms [16], [17]), object oriented classification [18], [19], and decision tree algorithms [20], [21]. Different methods have their own scope, advantages and disadvantages [22], [23], [24], [25], [26], [27]. Some new methods of land classification imagery that lack historical and coincidental ground information to either calibrate data, validate data or assess identification accuracy have been proposed [28], [29], [30], which can increase classification accuracy. However, some important classification steps including invariant feature identification, training samples establishment, classification accuracy assessment and so on all require human participation, which made the classification procedure hard to be carried out automatically.

To overcome the problems mentioned above, a promising solution in land cover classification is to better utilise a prior, high-precision classification map instead of independently classifying the remote sensing images. Jiang et al. proposed a simple but robust method for land cover classification using a prior classification map [1]. In that study, the prior high-precision classification map and the multispectral remote sensing image were first employed to obtain pure pixels and constitute a semi-automatic classification dataset of training samples. Principal component analysis (PCA) was then performed on the data in all spectral bands of each land cover class extracted from the region of interest. The satellite images in that study were automatically classified using only the prior land cover map, thus requiring less human interaction or interpretation. Jiang et al.'s classification results showed that the classification method is appropriate for different environmental condition land cover classification. Although Jiang et al.'s method was capable of producing a reasonably accurate land cover classification map in a cost-effective way, the method was only a semi-automatic approach, not an automatic one, because the key parameters used ( and ) could not be selected automatically. Two questions about these parameters arise to which clear answers are not available in the literature:

How should the parameters ( is the accumulation area threshold of a certain class of land cover) and ( is the area threshold for buffer analysis) be determined?
How should the optimal combination of (, ) be determined?

We address these two questions in this study, as they are the key and most important parts of the study of the semi-automatic approach to land cover classification based on multispectral remote sensing imagery [1]. The purpose of this article is to present the procedures of these analyses.

In spite of its limitations, an approach using “pure pixels” representative of the major land use classes as training samples to classify images is promising. Therefore, we developed a new approach for automatically selecting the key parameters using the efficient computer technique and the test algorithm of image classification accuracy based on the existing method, and we improved Jiang et al.'s strategy through automatically adjusting and choosing the vital two parameters. The proposed approach aims to achieve two objectives: (1) to select the optimal parameters and and (2) to determine the portfolio optimisation model (, ). In the proposed approach, the overall classification accuracy and their derived Kappa Coefficient (KC), which is widely used to assess the accuracy of classification results in application studies, is employed to select the optimal parameters and . TC (Time-Consuming), which signifies the classification efficiency, is employed to determine the best combination of the two parameters. The greatest challenge in our method is to properly select the training sample. We propose an iterated procedure to automatically select a different percentage of pure pixels as training samples based on a prior classification map to ensure that the method is completely automatic. This approach was evaluated by using it to generate a land cover classification of Weichang County in Hebei Province, China, applying a Landsat TM (Thematic Mapper) image and a prior land cover classification map. Our method is expected to be more practicable for automatic land cover classification than traditional classification methods or algorithms such as the Maximum Likelihood approach [13].

Methods

The proposed approach includes three main, interdependent components: the selection procedure for the pure-pixel training sample dataset, the method for determining the key parameters, and the optimal combination model. The general flowchart is shown in Fig. 1.

Download:

Figure 1. General flowchart of proposed approach (KC: Kappa Coefficient; TC: Time-Consuming).

https://doi.org/10.1371/journal.pone.0075852.g001

Automatic Selection of Pure-pixel Training Samples

Fundamentally, training samples provide descriptive statistical information for each class in multispectral remote sensing imagery that may be used to classify an image. The key step in each classification approach is the proper selection of the training samples. Traditionally, accurate training samples are selected manually depending entirely on the knowledge of the analyst or on field investigations, which reduce the automation of land cover classification [1]. Jiang et al. introduced a novel idea of extracting sterling pixels of land cover semi-automatically using an accurate, existing land cover dataset as prior knowledge. Similar to supervised classification or/and object-oriented classification, these selected pixels as training samples are used to characterise the classes and ultimately convey the information to three-dimensional feature space. Samples of different types of land cover selected have an accumulation area threshold () value as follows:(1)where is the accumulation area threshold of the ith class of land cover, is the area of the patches of the ith class of land cover sorted in descending order, x is the number of the ith class of land cover, and is the total area of the ith class of land cover. According to ecological theory, a joint region with different types of land cover is discarded during spatial buffer analysis, and the buffer analysis distance is variable. The buffer analysis distance is defined as follows:(2)where is area threshold for buffer analysis, is the buffer area of the patch with a distance of d. The variable d is negative, indicating that the representative area was reduced. A is the area of the patch. The pure pixels within buffer region are chosen as training samples, different buffer regions constitute the diversely automatic training samples collection, and the accuracy of the collection depends on the key parameters and . The combination of (, ) is used to determine which samples are selected optimally for land cover classification.

The drawback of Jiang et al.'s method was not an automatic one because the two critical parameters and were chosen to be 60% and 50%, respectively, by a series of experiments using data in 4 different test areas (the determination procedure we called test-decision), instead of by automatic calculation. These experiments used an exploratory rather than an automatic method because satisfactory classification results require the proper calibration of various model parameters. In order to choose and automatically, we propose an iterated procedure reliably based on computer technologies to ensure that the method is completely automatic. Additionally, an approach based on this iterated algorithm is employed to reduce the “salt-and-pepper” error that usually occurs in pixel-based classification methods. This iterated procedure refining more pure, sterling pixels within the changed/unchanged area as the training samples is expected to improve classification accuracy. After iteration, the final changed/unchanged sampling results are obtained and used for the images' classification. The iterated procedure used to determine the key parameters is shown as follows:

As described in Fig. 2, the key parameters are run through the classification process and determine the classification results by verifying the collection of the pure pixels that constitute the training samples dataset. This determination process is composed of a dual circulation. The purpose of the inner loop is to increase , which has an initial value of 10%, incrementally with 10% steps until it reaches 100%. In the outer loop, increases incrementally from 10% to 100% with 10% steps. In each re-circulation, the PCA, the establishment of three-dimensional feature space, classification, and post-classification is carried out in turn. Ultimately, there are many different levels of training sample datasets (are comprise of different buffer regions), and different levels of land cover classification results and different values of TC are generated, which help determine the key parameters. By evaluating the entire process, we adopt an enumeration method to determination the key parameters in the following part.

Download:

Figure 2. Flowchart of the iterated procedure used to determine the key parameters (

and

) (KC: Kappa Coefficient; TC: Time-Consuming).

https://doi.org/10.1371/journal.pone.0075852.g002

Determining the Key Parameters

Different values of and determine different classification results. Accurate classification results correspond to optimal classification parameters. In our strategy, we determined the key parameters through evaluating the classification results. The procedure of accuracy assessment of the classification results is carried out by comparing these results to a known, accurate classification map, which has been visually interpreted and classified using high spatial resolution images and validated by intensive field surveys. In LUCC research, KC is a very important index value to the accuracy assessment of land cover classification [31], [32], [33], [37] because KC value provides both a better overall measure of accuracy and incorporates information about the errors of omission and commission [34], [35].

Furthermore, the KC measures the association between the two inputs (the known classification map and the TM image) and helps to evaluate the output images (different classification images) [36], [37], [38]. KC value denotes the agreement degree between the two comparative maps/images. Blackman and Landis assigned a scale for Kappa values between 0 and 1 for the analysis of map agreement degree, and this scale has become the standard measure of agreement between maps in classification applications [39], [40]. According to the iterated procedure observed in Fig. 2, different accuracy levels of land cover classification results derived from variables and using the proposed approach are generated. Different group KC values are automatically calculated and a new matrix of KC values related to different combinations of and is also generated. For the Kappa value equal to or greater than 0.61 is considered to be in good agreement [39], [40]. Thus, we select the KCs whose values are equal to or greater than 0.61, and the key parameters and corresponding to those KC values are chosen for the alternative combination, implying that the KCs are in substantial agreement or perfect agreement.

Optimal Combination Model

In a general respect, the computer TC value is proportional to the image complexity or to the number of vector plaques of a map. A greater number of vector plaques or a more complex image results in a greater computer TC value. The manual multispectral remote sensing image classification by visual interpretation usually takes considerable time. In this sense, the computer TC value reflects the complexity of the vector graphics or remote sensing image as well as the artificial process.

We construct our portfolio optimisation model based on two principles:

If the requirements are met accurately and a small difference exists between the classification results (for example, the absolute value of difference of KC is no less than 0.01), we chose a less TC combination of and to form the optimal combination model.
If the requirements are met accurately and the increased rate of accuracy was significantly less than the time-lapse rate, we chose a combination of and with a time-lapse rate minimum to form the optimal combination model.

The selection process of portfolio optimisation model is shown in Fig. 3

Download:

Figure 3. Selection process of portfolio optimisation model (KC: Kappa Coefficient; TC: Time-Consuming).

https://doi.org/10.1371/journal.pone.0075852.g003

We choose the optimal combination model and determine the key parameters. Applying these two parameters to the semi-automatic classification proposed by Jiang et al., the fully automatic classification algorithm is formed. The problem of determination of the key parameters by test-decision is completely resolved.

Case Study

Study area and data sources

Study area.

A case study of Weichang County of Hebei Province, China (41°35′–42°40′N, 116°32′–118°14′E) is conducted to confirm the effectiveness of the proposed approach. This study area covers 9219 km² and encompasses over 25% of Zhangjiakou District, Hebei Province. Weichang is also a Manchu and Mongolian Autonomous County, the largest county in Hebei Province, and the most northern junction to the Inner Mongolia autonomous region (Fig. 4).

Download:

Figure 4. Location of the study area: Weichang County, Hebei Province, China.

https://doi.org/10.1371/journal.pone.0075852.g004

There are six types of land cover in the study area: cropland, forestland, grassland, water, residential/construction land, and bare land. Forestland and grassland is dominant, next is cropland, and residential/construction land, water and bare land are relatively fewer. The study area is located in the transition zone of the Inner Mongolia Plateau and the northern Hebei Mountains, with an elevation gradient ranging from 750 meters to 2067 meters above sea level [41]. There are three types of area distinguished based on changing degrees of land cover: dramatic change area, moderate changes area, and little changes area. In the central and southern regions of Weichang County, similar to other cities in China, around the county town of Weichang expanded rapidly in the last decade, and rapid economic growth in the areas with residential/construction land extended to over 100 km² from 1995 to 2010.This growth caused the dominant land change of the area to be a loss of cropland, grassland and forestland. These areas belong to dramatic changes area. In the north and northeast of Weichang County, there are lots of natural forest conservation regions growing with a large number of deciduous, pine, etc, with little land cover change, belonging to little change area. In the east and west of Weichang County, some types of land cover have changed, but the change is not very significant. For example, some cropland was restored to forestland since the implementation of the “Returning crops to forest” policy in 2000. Due to the three different types of land cover changing degrees of land use across the area, we select Weichang County as an ideal case study to evaluate the effective automatic approach.

Data sources.

TheLandsat-5/TM data of 2010 (WRS-2 123/31 for 2010/8/24) and 1∶100,000 land cover maps of two dates are used for this experiment. The known classification maps were produced by the Chinese Academy of Sciences (CAS) with consistent classification schemes that have an overall accuracy of 95% for all land use classes validated by intensive field surveys [1], [42], [43]. Here, we acquired the land cover maps of 2005 and 2010 from CAS, which were visually interpreted and classified using high spatial resolution images (QuickBird) and field surveys, respectively. The map of land cover of 2005 was used as prior knowledge for choosing the pure-pixel training samples, while the land cover map of 2010 was used as a reference map for assessing classification accuracy. The multispectral TM image was radiometrically corrected by CAS. The resulted image covers the whole area of the Weichang County that was used for land cover classification.

Determination and

As a critical component of the proposed methodology, the effect of the training sample automatic selection is determined by the two parameters and . These key parameters are determined using the iterative procedure described in the previous section. We assess classification accuracy by calculating the KCs. As observed in Figure 2. The and are interval of (10%, 100%)with 10% step increments independently to generate totals of 100 land cover classification maps. Using equation of Kappa Coefficient [31], [32], 100 KC results are calculated (Table 1).

Download:

Table 1. The results of KC (Kappa Coefficient).

https://doi.org/10.1371/journal.pone.0075852.t001

Table 1 shows the calculated results for the KCs of different combinations of and . Clearly, the KC increases with and until it reaches its maximum, and it then decreases with increasing and values. As observed in Table 1, the values 0.692, 0.628, 0.621, 0.762, 0.760, 0.612, 0.623 and 0.619 are considered to represent land cover classifications in good agreement, and the values 0.762 and 0.760 show the best agreement among the results. We choose the and values, = 60%, = 60% and = 70%, = 60%, with the chosen combinations of and being (60%, 60%) and (70%, 60%). Because the difference between 0.762 and 0.760 is very small, the optimal combination is not immediately clear. According to the principles of the portfolio optimisation model, we recorded the TC value of each parameter changed as shown in Table 2 and mapped the relationship among and TC value, as shown in Figure 5.

Download:

Figure 5. Relationship among

,

and TC (Time-Consuming, unit: second).

https://doi.org/10.1371/journal.pone.0075852.g005

Download:

Table 2. TC (Time-Consuming, unit: second) of the proposed methodology with different parameters.

https://doi.org/10.1371/journal.pone.0075852.t002

Table 2 clearly shows the TC of the proposed methodology with different values for and . Fig. 5 shows the relationship among , and TC. The TC value of land cover classification increases as the two parameters' incremental change increases, in spite of the ratio of TC difference. As shown in Fig. 5, if the threshold for buffer analysis is below 60% or over 80%, the TC is greatly increased. If is in the interval of 60% to 80% (as marked with carmine colour in Fig. 4), land cover classification does not significantly increase the computational cost, which also proves that the combinations (60%, 60%) and (70%, 60%) are the optimal combinations. For , the accuracies of the two combinations are similar, while the value of TC based on = 60% performs notably better. The tendencies of the two lines are also similar. In detail, the classification accuracy based on = 70% is 0.762, while the accuracy based on P_a = 60% is 0.760. Both of the combinations maintain high accuracy with slight differences. However, the TC of the two lines = 60% and = 70% have prominent changes of 236 s (second) and 370 s, respectively. According to the prerequisite of classification accuracy, the TC of = 60% is less than that of = 70%, so the combination (60%, 60%) is chosen using the portfolio optimisation model.

Results

In light of the aforementioned results, we selected the pure-pixel samples based on the portfolio optimisation model (60%, 60%) as training samples for automatic classification. The land cover classification results are shown in Figure 6. Five types of land cover maps were compared to evaluate the result of the final classification: (1) the visual interpretation of land cover classes of 2010, recognised as relatively precise data(Fig. 6-b); (2) the classification result using Maximum Likelihood(ML) approach (Fig. 6-d); (3) the automatic classification of land cover of 2010 based on the TM image of the same area using the portfolio optimisation model (60%, 60%)(Fig. 6-e); (4) the automatic classification result using the combination model (20%, 20%)(Fig. 6-f); and (5) the automatic classification result using the combination model (80%, 80%)(Fig. 6-g).. Fig. 6 exhibits the data sources including the Landsat-5/TM image of 2010 (Fig. 6-a), the visual interpretation results of 2010 as a standard classified map (Fig. 6-b)and visual interpretation results of 2005 as a prior, exact classification map (Fig. 6-c). Figs. 6-e, 6-f and 6-g show the classification results based on different combination models using our proposed approach. 6-d displays the classification results using ML approach. The classification results (Figs. 6-e, 6-f and 6-g) show that forests and grasslands in Weichang are more predominant than the other four types of land cover, which is consistent with the known, accurate classification map (Fig. 6-b).

Download:

Figure 6. Comparison of land cover classification in Weichang.

https://doi.org/10.1371/journal.pone.0075852.g006

For better quantitative assessment, absolute values (pixel number) were converted to percentage values in each error matrix.As shown in Tables 3,4, 5 and 6, each table uses a different combination model/classification approach. For overall classification accuracy evaluation, the overall accuracy are 83.4% (using the portfolio optimisation model (60%, 60%)), 37.7% (using the combination model (20%, 20%)), 62.2% (using the combination model (80%, 80%)) and 58.4% (using the ML approach), respectively. Apparently, using the portfolio optimisation model can improve the overall classification accuracy significantly. Similarly, the commission errors and the omission errors using the portfolio optimisation model are reduced significantly than using other combination models or using ML classification approach. In order to test whether the KC values are statistical significance, a Z-test on the portfolio optimisation model, other combination models and using ML classification approach were performed respectively, as shown in Table 7.The calculated results showed P values in bold were statistically significant (p<0.0001). The values of asymptotic standard error (ASE) were all less than 0.0002. Statistical comparisons against percentage of pixels reveal a significant difference between the portfolio optimisation model and other combination models and using ML classification approach with confidence intervals (CI) values ranging interval difference.

Download:

Table 3. Error matrix of the combination model (80%, 80%).

https://doi.org/10.1371/journal.pone.0075852.t003

Download:

Table 4. Error matrix of the combination model (20%, 20%).

https://doi.org/10.1371/journal.pone.0075852.t004

Download:

Table 5. Error matrix of the combination model (60%, 60%).

https://doi.org/10.1371/journal.pone.0075852.t005

Download:

Table 6. Error matrix of common classification approach (Maximum Likelihood Approach).

https://doi.org/10.1371/journal.pone.0075852.t006

Download:

Table 7. Comparison of the Z values in each model/approach.

https://doi.org/10.1371/journal.pone.0075852.t007

As shown in the tables, the accuracies of the three combinations and ML approach are significantly different; the result of the portfolio optimisation model is much more accurate than that of the other combination models or ML approach. The structures of the four error matrices are also different. Water showed the highest individual classification accuracy due to its lower reflectance values, whereas there was much misclassification between forest and cropland, forest and grassland, and residential and construction land and bare land because of the similar reflectance values of these land cover types. Residential and construction land and bare land occupied small proportions of the entire study area, which increased the inaccurate effect on the map agreement maybe one of the misclassification reasons. Another main reason for misclassification was the land cover classification system, each type of land cover includes many subcategories, i.e. cropland includes two subcategories of paddy field and dry farming field, residential and construction land includes three subcategories of urban land, rural residential and other construction land, etc.

In Tables 3 and 4, whether by omission or commission error, much misclassification is apparent using the combination model (20%, 20%) and the combination model (80%, 80%). The fundamental reason for these errors in classification is related to the pure-pixel training samples. When the combination model (20%, 20%) is used, there are not enough pure-pixel training samples, which causes classification error. If the combination model (20%, 20%) is chosen, some pixel training samples that are not pure (we refer to these as “Noise”) are used as training samples, leading to classification error. Fig. 7 describes this problem as follows:

Download:

Figure 7. Sketch map of automatic dataset of pure-pixel training samples.

https://doi.org/10.1371/journal.pone.0075852.g007

As shown in the Fig. 7, the primary type of land cover was forest in 2005 (the entire range of the blue line in Fig. 7). In 2010, more land cover changed to grassland and bare land, due to human activities and natural environmental changes, respectively, (adjacent to the outer blue line, within the red line in Fig. 7). The area of pure-pixel training samples was (the area within the yellow line in Fig. 7) or (the area within the green line in Fig. 7) after being buffered inward by the distances d₁ or d2, respectively. Different values were calculated by applying equation 2. We found that if d₁ is too small, the pure-pixel training samples include much “Noise” (forest samples include many grassland or bare land samples), which causes misclassification. In a similar manner, if d₂ is too big, the number of the pure-pixel training samples are too low, which also causes classification error (grassland or bareland was classified forestland). Therefore, the buffer distance (transformed into which is understood easily) is a key parameter in the selection of the pure-pixel training samples. By the same token, is also a key determinant in the selection of these training samples. The selection of these two key parameters, ultimately constituting the optimal combination model (, ), which is closely related to the classification accuracy, is satisfactorily accomplished by automatic selection.

In order to test the applicability of our approach, we also applied our approach in other different regions chosen based on their changing degrees of land cover. The four regions are: 1) Anshan city in Liaoning Province, China, with moderate changes; 2) Neijiang county in Sichuang Province, China, with moderate changes; 3) Shuangtaihe natural conservation area in Liaoning Province, China, with little changes; 4) Qingpu District in Shanghai city, China, with dramatic changes. The testing results show that our method performs well in the regions with normal-to-high rates of land cover change, especially in rapid changing area. Despite the accuracy is slightly lower in regions with little land cover change, but is acceptable. The mainly changed rules were described in literature 1 [1]. The land cover with little change was natural affected without rules which increased error of ultimate result of land cover classification. Some following work, such as improving upon our algorithm our strategies (i.e. consider double-kernel combination method, narrow the double-loop step), may be useful in improving the performance of the method in the future.

Conclusions and Discussion

This study improved Jiang et al.'s strategy and developed a new approach for automatically selecting the key parameters used in land cover automatic classification. Three main, interdependent parts consist of our approach: the selection procedure of the pure-pixel training-sample dataset, the method to determine the key parameters, and the optimal combination model. The main achievements in this study include: (1) Selection the two key parameters automatically instead of using a test-decision, which avoids subjective bias and (2) Determination the portfolio optimisation model (, ) used to select the pure-pixel samples as training samples for automatic classification.

The study area experimental results showed that the methodology determining the key parameters can automatically select the portfolio optimisation model and the classification results based on different combination models using our proposed approach and using the ML approach demonstrate that: (1) The portfolio optimisation model produces more precise results, higher overall classification accuracy and lower omission errors/commission errors; (2) The portfolio optimisation model performs well in the region with normal-to-high degree changes of land cover and may have a wide scope of scientific application; (3) The proposed iterated training-sample selection process can refine the training samples and improve classification accuracy while does not significantly increase the computational cost.

Of course, we recognise that this new methodology has possible limitations. First, this methodology is subject to the limitations of the Jiang et al.'s classification model. If the original assumption is flawed, the selection of the two key parameters is undoubtedly affected. For example, Jiang et al. assume that the classification system of the prior, exact classification map and the subsequent image is the same. A new type of land cover will not be accurately classified if it is not available on the prior, exact classification map but appears on the subsequent image. In this case, we must first improve upon our original method (i.e. consider double-kernel combination classification instead of PCA and three-dimensional feature analysis). One promising option is to use a one-class classifier [44] to identify a new type of land cover because pure-pixel training samples of the new class are needed. Another limitation is that we are using a method that may be limited in scope to obtain the and to form the optimal combination model. The intervals between and values determine the accuracy of the new methodology. In this study, we set the two steps both equal to 10%. In setting these steps to 10%, perhaps we have obtained the approximate optimal values, rather than the global optimums. In the future, we must improve our algorithm to a narrower step (for example, 5% or 1%) to obtain an accurate global optimum.

Supporting Information

Figure S1.

Comparison of land cover classification of region with much land cover change (Anshan): (a) TM image of 2010;(b) land cover of 2005;(c) land cover of 2010 from visual interpretation; (d) land cover classified by our method.

https://doi.org/10.1371/journal.pone.0075852.s001

(TIF)

Figure S2.

Comparison of land cover classification of region with normal land cover change degree (Neijiang): (a) TM image of 2010;(b) land cover of 2005;(c) land cover of 2010 from visual interpretation; (d) land cover classified by the proposed method.

https://doi.org/10.1371/journal.pone.0075852.s002

(TIF)

Figure S3.

Comparison of land cover classification around Shuangtaihe natural conservation region: (a) TM image of 2010;(b) land cover of 2005;(c) land cover of 2010 from visual interpretation; (d) land cover classified our method.

https://doi.org/10.1371/journal.pone.0075852.s003

(TIF)

Figure S4.

Comparison of land cover classification of region with much land cover change (Qinpu): (a) TM image of 2009;(b) land cover of 2005;(c) land cover of 2009 from visual interpretation; (d) land cover classified by our method.

https://doi.org/10.1371/journal.pone.0075852.s004

(TIF)

Table S1.

Statistics of six land cover classes of the three classification results in region with much land cover change (Anshan).

https://doi.org/10.1371/journal.pone.0075852.s005

(DOCX)

Table S2.

Confusion matrix of two classification algorithms of Anshan, 2010.

https://doi.org/10.1371/journal.pone.0075852.s006

(DOCX)

Table S3.

Statistics of four land cover classes of the three classification results in region with normal land cover change (Neijiang).

https://doi.org/10.1371/journal.pone.0075852.s007

(DOCX)

Table S4.

Confusion matrix of two classification algorithms of Neijiang, 2010.

https://doi.org/10.1371/journal.pone.0075852.s008

(DOCX)

Table S5.

Statistics of six land cover classes of the three classification results in natural conservation region with little land cover change (Shuangtaihe).

https://doi.org/10.1371/journal.pone.0075852.s009

(DOCX)

Table S6.

Confusion matrix of two classification algorithms of Shuangtaihe natural conservation region, 2010.

https://doi.org/10.1371/journal.pone.0075852.s010

(DOCX)

Table S7.

Statistics of five land cover classes of the three classification results in natural conservation region with dramatic land cover change (Qinpu district).

https://doi.org/10.1371/journal.pone.0075852.s011

(DOCX)

Table S8.

Confusion matrix of two classification algorithms of QinPu district, 2009.

https://doi.org/10.1371/journal.pone.0075852.s012

(DOCX)

Acknowledgments

We thank Dr. Xiaopeng Qi in National Center for Public Health Surveillance and Information Services, Chinese Center for Disease Control and Prevention for her help in Z-test statistical analyses. We like to thank the editors and anonymous reviewers for their helpful remarks.

Author Contributions

Conceived and designed the experiments: YW DJ YH DZ. Performed the experiments: YW DJ YH. Analyzed the data: YW DJ WW. Contributed reagents/materials/analysis tools: YH DZ XY. Wrote the paper: YW DJ.

References

1. Jiang D, Huang Y, Zhuang D, Zhu Y, Xu X, et al. (2012) A Simple Semi-Automatic Approach for Land Cover Classification form Multispectral Remote Sensing Imagery. PLoS ONE 7 ((9)) e45889
- View Article
- Google Scholar
2. Liu JY, Deng XZ (2010) Progress of the research methodologies on the temporal and spatial process of LUCC. Chinese Sci Bull 55: 1354–1362.
- View Article
- Google Scholar
3. Denis R, Antoine T, Christian P, Claudine D (2011) Comparison of methods for LUCC monitoring over 50 years from aerial photographs and satellite images in a Sahelian catchment. International Journal of Remote Sensing 32 ((6)) 1747–1777.
- View Article
- Google Scholar
4. Daniel EO, Bethany AB, Jeff A, John FM, Steven PH (2011) How much is built? Quantifying and interpreting patterns of built space from different data sources. International Journal of Remote Sensing 32 ((9)) 2621–2644.
- View Article
- Google Scholar
5. Gutman G, Janetos AC, Justice CO, Moran EF, Mustard J, et al.. (2004) Land change science: Observing, monitoring and understanding trajectories of change on the earth's surface. New York: Kluwer Academic Publishers. pp. 108–118.
6. Abdullah SA, Nakagoshi N (2006) Changes in landscape spatial pattern in the highly developing state of Selangor, peninsular Malaysia. Landscape Urban Planning 77: 263–275.
- View Article
- Google Scholar
7. Veldkamp A, Lambin EF (2001) Predicting land-use change. Agriculture. Ecosystems and Environment 85: 1–6.
- View Article
- Google Scholar
8. Robert F, Ian O, Mélanie C, Alice D, Darren P (2012) A method for trend-based change analysis in Arctic tundra using the 25-year Landsat archive. Polar Record 48 ((1)) 83–93.
- View Article
- Google Scholar
9. Berberoğlu S, Akin A, Atkinson PM, Curran PJ (2010) Utilizing image texture to detect land-cover change in Mediterranean coastal wetlands. International Journal of Remote Sensing 31 ((11)) 2793–2815.
- View Article
- Google Scholar
10. Liu J, Liu M, Tian H, Zhuang D, Zhang Z, et al. (2005) Spatial and temporal patterns of China's cropland during 1990–2000: An analysis based on Landsat TM data. Remote Sensing of Environment 98: 442–456.
- View Article
- Google Scholar
11. Loveland T, Merchant J, Ohlen D, Brown J (1991) Development of a land-cover characteristics database for the conterminous U.S.. Photogrammetric Engineering and Remote Sensing 57: 1453–1463.
- View Article
- Google Scholar
12. Zhao Y (2003) The Application Principle and Method of Remote Sensing. Beijing: Science Press. pp. 57–73.
13. Richards JA, Jia X (1999) Remote Sensing Digital Imaging Analysis: an Introduction, third ed. Springer, Berlin. pp. 102–115.
14. Foody GM, Lucas RM, Curran PJ, Honzak M (1997) Nonlinear mixture modelling without end-members using an artificial neural network. International Journal of Remote Sensing 18 ((4)) 937–953.
- View Article
- Google Scholar
15. Wang YC, Feng CC (2011) Patterns and trends in land use/land cover change research explored using self-organizing map. International Journal of Remote Sensing 32 ((13)) 3765–3790.
- View Article
- Google Scholar
16. Robin P, Antoine C (2013) Spatial location and ecological content of support vectors in an SVM classification of tropical vegetation. Remote Sensing Letters 4 ((7)) 686–695.
- View Article
- Google Scholar
17. Jaime PG, Jean FM, Gerard M, Jordi C, Martí OM, et al. (2013) Enhanced land use/cover classification of heterogeneous tropical landscapes using support vector machines and textural homogeneity. International Journal of Applied Earth Observation and Geoinformation 23: 372–383.
- View Article
- Google Scholar
18. Geneletti D, Gorte BGH (2003) A method for object-oriented land cover classification combining Landsat TM data and aerial photographs. International Journal of Remote Sensing 24 ((6)) 1273–1286.
- View Article
- Google Scholar
19. Francisco FDS, John MK, Patrick L (2013) An object-oriented classification method for mapping mangroves in Guinea, West Africa, using multipolarized ALOS PALSAR L-band data. International Journal of Remote Sensing 34 ((2)) 563–586.
- View Article
- Google Scholar
20. Schneider A, Friedl MA, Potere D (2010) Mapping global urban areas using MODIS 500 - m data: New methods and datasets based on ‘urban ecoregions’. Remote Sensing of Environment 114: 1733–1746.
- View Article
- Google Scholar
21. Hansen M, Dubayah R, Defries R (1996) Classification trees: an alternative to traditional land cover classifiers. International Journal of Remote Sensing 17: 1075–1081.
- View Article
- Google Scholar
22. Weng Q (2011) Advances in Environmental Remote Sensing: Sensors, Algorithms and Applications. CRC Press/Taylor and Francis, Boca Raton, FL, USA. pp. 63–78.
23. Aitkenhead MJ, Aalders IH (2011) Automating land cover mapping of Scotland using expert system and knowledge integration methods. Remote Sensing of Environment 115 ((5)) 1285–1295.
- View Article
- Google Scholar
24. Chen J, Gong P, He C, Pu R, Shi P (2003) Land-use/land-cover change detection using improved change-vector analysis. Photogrammetric Engineering & Remote Sensing 69 ((4)) 369–379.
- View Article
- Google Scholar
25. Gong P, Mahler S, Biging G, Newburn D (2003) Vineyard identification in an oak woodland landscape with airborne digital camera imagery. International Journal of Remote Sensing 24: 1303–1315.
- View Article
- Google Scholar
26. Canty MJ (2006) Image Analysis, Classification and Change Detection in remote Sensing with Algorithms for ENVI/IDL. Taylor & Francis, CRC Press. pp. 77–82.
27. Lambin EF, Geist H (2006) Land-use and land-cover change: local processes with global impacts. New York: Springer. pp. 38–49.
28. Xie Y, Sha Z, Bai Y (2010) Classifying historical remotely sensed imagery using a tempo-spatial feature evolution (T-SFE) model. ISPRS Journal of Photogrammetry and Remote Sensing 65 ((2)) 182–190.
- View Article
- Google Scholar
29. Fortier J, Rogan J, Woodcock C, Runfola DM (2011) Utilizing temporally invariant calibration sites to classify multiple dates of satellite imagery. Photogrammetric Engineering & Remote Sensing 77 ((2)) 181–189.
- View Article
- Google Scholar
30. Xian G, Collin H, Fry J (2009) Updating the 2001 National Land Cover Database land cover classification to 2006 by using Landsat imagery change detection methods. Remote Sensing of Environment 113 ((6)) 1133–1147.
- View Article
- Google Scholar
31. Cohen J (1960) A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20: 37–46.
- View Article
- Google Scholar
32. Yvonne MMB, Stephen EF, Paul WH, Richard JL, Frederick M (1975) Discrete Multivariate Analysis: Theory and Practice. MIT Press, Cambridge, MA. pp. 58–67.
33. Jensen JR (2005) Introductory Digital Image Processing: A Remote Sensing Perspective. New York: Pearson Prentice Hall, 3rd ed., 495–515.
34. Allouche O, Tsoar A, Kadmon R (2006) Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology 43: 1223–1232.
- View Article
- Google Scholar
35. Foody G (2007) Map comparison in GIS. Progress in Physical Geography 31: 439–445.
- View Article
- Google Scholar
36. Congalton RG (1991) A review of assessing the accuracy of classifications of remotely sensed data. Remote Sensing of Environment 37: 35–46.
- View Article
- Google Scholar
37. Congalton R, Green K (1999) Assessing the accuracy of remotely sensed data: principles and practices, Boca Raton: CRR/Lewis Press. 137p.
38. Gupta M, Srivastava PK (2010) Integrating GIS and remote sensing for identification of groundwater potential zones in the hilly terrain of Pavagarh, Gujarat, India. Water International 35: 233–245.
- View Article
- Google Scholar
39. Landis J, Koch G (1977) The measurement of observer agreement for categorical data. Biometrics 33: 159–174.
- View Article
- Google Scholar
40. Blackman N, Koval J (2000) Interval estimation for Cohen's Kappa as a measure of agreement. Statistics in Medicine 19: 723–741.
- View Article
- Google Scholar
41. Baidu. Introduction of Mongolia Weichang Manchu Autonomous County, China. Available: http://baike.baidu.com/view/1214694.htm.
42. Liu J, Liu M, Deng X, Zhuang D, Zhang Z, et al. (2002) The land use and land cover change database and its relative studies in China. Journal of Geographical Sciences 12: 275–282.
- View Article
- Google Scholar
43. Liu J, Liu M, Zhuang D, Zhang Z, Deng X (2003) Study on spatial pattern of land-use change in China during 1995–2000. Science in China, Series D 46: 373–384.
- View Article
- Google Scholar
44. Chen J, Chen X, Cui X, Chen J (2011) Change vector analysis in posterior probability space: a new method for land cover change detection. IEEE Geoscience and Remote Sensing Letters 8 ((2)) 317–321.
- View Article
- Google Scholar

[ref1] 1. Jiang D, Huang Y, Zhuang D, Zhu Y, Xu X, et al. (2012) A Simple Semi-Automatic Approach for Land Cover Classification form Multispectral Remote Sensing Imagery. PLoS ONE 7 ((9)) e45889
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Liu JY, Deng XZ (2010) Progress of the research methodologies on the temporal and spatial process of LUCC. Chinese Sci Bull 55: 1354–1362.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Denis R, Antoine T, Christian P, Claudine D (2011) Comparison of methods for LUCC monitoring over 50 years from aerial photographs and satellite images in a Sahelian catchment. International Journal of Remote Sensing 32 ((6)) 1747–1777.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Daniel EO, Bethany AB, Jeff A, John FM, Steven PH (2011) How much is built? Quantifying and interpreting patterns of built space from different data sources. International Journal of Remote Sensing 32 ((9)) 2621–2644.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Gutman G, Janetos AC, Justice CO, Moran EF, Mustard J, et al.. (2004) Land change science: Observing, monitoring and understanding trajectories of change on the earth's surface. New York: Kluwer Academic Publishers. pp. 108–118.

[ref6] 6. Abdullah SA, Nakagoshi N (2006) Changes in landscape spatial pattern in the highly developing state of Selangor, peninsular Malaysia. Landscape Urban Planning 77: 263–275.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Veldkamp A, Lambin EF (2001) Predicting land-use change. Agriculture. Ecosystems and Environment 85: 1–6.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Robert F, Ian O, Mélanie C, Alice D, Darren P (2012) A method for trend-based change analysis in Arctic tundra using the 25-year Landsat archive. Polar Record 48 ((1)) 83–93.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Berberoğlu S, Akin A, Atkinson PM, Curran PJ (2010) Utilizing image texture to detect land-cover change in Mediterranean coastal wetlands. International Journal of Remote Sensing 31 ((11)) 2793–2815.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Liu J, Liu M, Tian H, Zhuang D, Zhang Z, et al. (2005) Spatial and temporal patterns of China's cropland during 1990–2000: An analysis based on Landsat TM data. Remote Sensing of Environment 98: 442–456.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Loveland T, Merchant J, Ohlen D, Brown J (1991) Development of a land-cover characteristics database for the conterminous U.S.. Photogrammetric Engineering and Remote Sensing 57: 1453–1463.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref12] 12. Zhao Y (2003) The Application Principle and Method of Remote Sensing. Beijing: Science Press. pp. 57–73.

[ref13] 13. Richards JA, Jia X (1999) Remote Sensing Digital Imaging Analysis: an Introduction, third ed. Springer, Berlin. pp. 102–115.

[ref14] 14. Foody GM, Lucas RM, Curran PJ, Honzak M (1997) Nonlinear mixture modelling without end-members using an artificial neural network. International Journal of Remote Sensing 18 ((4)) 937–953.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref15] 15. Wang YC, Feng CC (2011) Patterns and trends in land use/land cover change research explored using self-organizing map. International Journal of Remote Sensing 32 ((13)) 3765–3790.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref16] 16. Robin P, Antoine C (2013) Spatial location and ecological content of support vectors in an SVM classification of tropical vegetation. Remote Sensing Letters 4 ((7)) 686–695.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref17] 17. Jaime PG, Jean FM, Gerard M, Jordi C, Martí OM, et al. (2013) Enhanced land use/cover classification of heterogeneous tropical landscapes using support vector machines and textural homogeneity. International Journal of Applied Earth Observation and Geoinformation 23: 372–383.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref18] 18. Geneletti D, Gorte BGH (2003) A method for object-oriented land cover classification combining Landsat TM data and aerial photographs. International Journal of Remote Sensing 24 ((6)) 1273–1286.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref19] 19. Francisco FDS, John MK, Patrick L (2013) An object-oriented classification method for mapping mangroves in Guinea, West Africa, using multipolarized ALOS PALSAR L-band data. International Journal of Remote Sensing 34 ((2)) 563–586.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref20] 20. Schneider A, Friedl MA, Potere D (2010) Mapping global urban areas using MODIS 500 - m data: New methods and datasets based on ‘urban ecoregions’. Remote Sensing of Environment 114: 1733–1746.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref21] 21. Hansen M, Dubayah R, Defries R (1996) Classification trees: an alternative to traditional land cover classifiers. International Journal of Remote Sensing 17: 1075–1081.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref22] 22. Weng Q (2011) Advances in Environmental Remote Sensing: Sensors, Algorithms and Applications. CRC Press/Taylor and Francis, Boca Raton, FL, USA. pp. 63–78.

[ref23] 23. Aitkenhead MJ, Aalders IH (2011) Automating land cover mapping of Scotland using expert system and knowledge integration methods. Remote Sensing of Environment 115 ((5)) 1285–1295.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref24] 24. Chen J, Gong P, He C, Pu R, Shi P (2003) Land-use/land-cover change detection using improved change-vector analysis. Photogrammetric Engineering & Remote Sensing 69 ((4)) 369–379.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref25] 25. Gong P, Mahler S, Biging G, Newburn D (2003) Vineyard identification in an oak woodland landscape with airborne digital camera imagery. International Journal of Remote Sensing 24: 1303–1315.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref26] 26. Canty MJ (2006) Image Analysis, Classification and Change Detection in remote Sensing with Algorithms for ENVI/IDL. Taylor & Francis, CRC Press. pp. 77–82.

[ref27] 27. Lambin EF, Geist H (2006) Land-use and land-cover change: local processes with global impacts. New York: Springer. pp. 38–49.

[ref28] 28. Xie Y, Sha Z, Bai Y (2010) Classifying historical remotely sensed imagery using a tempo-spatial feature evolution (T-SFE) model. ISPRS Journal of Photogrammetry and Remote Sensing 65 ((2)) 182–190.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref29] 29. Fortier J, Rogan J, Woodcock C, Runfola DM (2011) Utilizing temporally invariant calibration sites to classify multiple dates of satellite imagery. Photogrammetric Engineering & Remote Sensing 77 ((2)) 181–189.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref30] 30. Xian G, Collin H, Fry J (2009) Updating the 2001 National Land Cover Database land cover classification to 2006 by using Landsat imagery change detection methods. Remote Sensing of Environment 113 ((6)) 1133–1147.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref31] 31. Cohen J (1960) A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20: 37–46.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref32] 32. Yvonne MMB, Stephen EF, Paul WH, Richard JL, Frederick M (1975) Discrete Multivariate Analysis: Theory and Practice. MIT Press, Cambridge, MA. pp. 58–67.

[ref33] 33. Jensen JR (2005) Introductory Digital Image Processing: A Remote Sensing Perspective. New York: Pearson Prentice Hall, 3rd ed., 495–515.

[ref34] 34. Allouche O, Tsoar A, Kadmon R (2006) Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology 43: 1223–1232.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref35] 35. Foody G (2007) Map comparison in GIS. Progress in Physical Geography 31: 439–445.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref36] 36. Congalton RG (1991) A review of assessing the accuracy of classifications of remotely sensed data. Remote Sensing of Environment 37: 35–46.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref37] 37. Congalton R, Green K (1999) Assessing the accuracy of remotely sensed data: principles and practices, Boca Raton: CRR/Lewis Press. 137p.

[ref38] 38. Gupta M, Srivastava PK (2010) Integrating GIS and remote sensing for identification of groundwater potential zones in the hilly terrain of Pavagarh, Gujarat, India. Water International 35: 233–245.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref39] 39. Landis J, Koch G (1977) The measurement of observer agreement for categorical data. Biometrics 33: 159–174.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref40] 40. Blackman N, Koval J (2000) Interval estimation for Cohen's Kappa as a measure of agreement. Statistics in Medicine 19: 723–741.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref41] 41. Baidu. Introduction of Mongolia Weichang Manchu Autonomous County, China. Available: http://baike.baidu.com/view/1214694.htm.

[ref42] 42. Liu J, Liu M, Deng X, Zhuang D, Zhang Z, et al. (2002) The land use and land cover change database and its relative studies in China. Journal of Geographical Sciences 12: 275–282.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref43] 43. Liu J, Liu M, Zhuang D, Zhang Z, Deng X (2003) Study on spatial pattern of land-use change in China during 1995–2000. Science in China, Series D 46: 373–384.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref44] 44. Chen J, Chen X, Cui X, Chen J (2011) Change vector analysis in posterior probability space: a new method for land cover change detection. IEEE Geoscience and Remote Sensing Letters 8 ((2)) 317–321.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

Figures

Abstract

Introduction

Methods

Automatic Selection of Pure-pixel Training Samples

Determining the Key Parameters

Optimal Combination Model

Case Study

Study area and data sources

Study area.

Data sources.

Determination and

Results

Conclusions and Discussion

Supporting Information

Acknowledgments

Author Contributions

References