Developing and Integrating Advanced Movement Features Improves Automated Classification of Ciliate Species

Ali Soleymani; Frank Pennekamp; Owen L. Petchey; Robert Weibel

doi:10.1371/journal.pone.0145345

Abstract

Recent advances in tracking technologies such as GPS or video tracking systems describe the movement paths of individuals in unprecedented details and are increasingly used in different fields, including ecology. However, extracting information from raw movement data requires advanced analysis techniques, for instance to infer behaviors expressed during a certain period of the recorded trajectory, or gender or species identity in case data is obtained from remote tracking. In this paper, we address how different movement features affect the ability to automatically classify the species identity, using a dataset of unicellular microbes (i.e., ciliates). Previously, morphological attributes and simple movement metrics, such as speed, were used for classifying ciliate species. Here, we demonstrate that adding advanced movement features, in particular such based on discrete wavelet transform, to morphological features can improve classification. These results may have practical applications in automated monitoring of waste water facilities as well as environmental monitoring of aquatic systems.

Citation: Soleymani A, Pennekamp F, Petchey OL, Weibel R (2015) Developing and Integrating Advanced Movement Features Improves Automated Classification of Ciliate Species. PLoS ONE 10(12): e0145345. https://doi.org/10.1371/journal.pone.0145345

Editor: Jesus Malo, Universitat de Valencia, SPAIN

Received: May 5, 2015; Accepted: December 2, 2015; Published: December 17, 2015

Copyright: © 2015 Soleymani et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Data Availability: All relevant data of ciliate movement trajectories will be publicly available upon the acceptance of the manuscript.

Funding: AS and FP were supported by the University of Zurich. Contributions of OLP were supported by the University of Zurich Research Priority Program on ‘Global Change and Biodiversity’ (URPP GCB).

Competing interests: The authors have declared that no competing interests exist.

Introduction

Over the past decades, various tracking technologies such as the Global Positioning System (GPS) and sophisticated video techniques have become accessible to scientists and enabled the recording of large amounts of data about the movement paths of individual organisms [1–4]. GPS tags or collars have the advantage that auxiliary information on the individual can be collected when the device is attached, which can subsequently help in understanding the differences between collected movement paths. Typically the target of these movement analyses is to infer movement patterns corresponding to behaviors such as foraging or dispersal [5,6] or to link the genotype and behavioral phenotype of organisms [7]. However for inferring other sorts of information such as gender or species, remote techniques such as video tracking are neither capturing nor marking the individual and hence auxiliary information on the species or gender of the tracked individual is not known. Previous studies found that it is possible to distinguish trajectories based on individual features such as their genotype and gender [7], the degree of light availability [8] or whether individuals were in resource poor or rich environments [9]. Moreover, individual movement may also be indicative of the internal state of the moving individual, which can be used to evaluate the effect of toxic substances in the environment or laboratory based toxicity assays [10]. Developing techniques to infer behavior from movement paths is an active field of research [11–14] especially for GPS-based movement data, but here we focus on classifying trajectories regarding species identity. This problem is less well studied as remote tracking studies, where multiple species (or genotypes) interact, are still relatively rare in ecology, but are expected to increase rapidly with high-throughput analysis based on image and video analysis [3]. Regardless of the tracking technique used, all these applications have in common that characteristic features of the movement have to be associated to known classes, such as behavior or species identity, which is generally referred to as movement classification.

Movement classification represents a particular set of problems, where either entire movement paths (trajectories) or parts of trajectories (i.e. subtrajectories) are assigned to classes with respect to some a priori unknown criterion. As in all classification problems, training in which the class is known is used to infer criteria (characteristic features of the data) that are able to reliably predict the class of unknown cases. Here, we address how different features of the movement data contribute to classification accuracy.

In particular, we examine how movement data can contribute to classifying different species of ciliates (Kingdom Protozoa, Alveolata, Ciliophora). Ciliates are widely found in all types of fresh-water and marine environments and fulfill important functions in natural ecosystems such as controlling the abundance of bacteria by predation and are themselves important food for small invertebrates such as crustaceans (e.g. Daphnia water fleas) [15,16]. Ciliates are also widely used as model organisms in studies in ecology and evolutionary biology where experimental microcosms (i.e. small-sized standardized containers with tight environmental control) are used [17]. Only recently due to the advent of automated video analysis, quantitative traits such as movement (e.g. speed, linearity) and morphology (e.g. cell size, cell shape) can be measured on large numbers of individuals automatically and hence are explicitly considered in such microcosm studies [17].

Morphological attributes are commonly used to classify ciliate species [18–21]. Our goal here is to investigate how movement of ciliates can contribute to their classification, as well as the performance of movement features only in the classification. We make this distinction to draw general conclusions accounting for cases where information on morphology is missing and only movement features as classification inputs are available. Microbial species are often characterized by little morphological differentiation, even though they are known to be physiologically and genetically diverse [22]. Hence, movement behavior may be a better indicator of taxonomy than morphology, or at least assist with morphological based classification. Automated video based classification of ciliate species has potential application in different fields, for instance for the automated monitoring of waste water facilities as well as environmental monitoring of aquatic systems more broadly [21].

Whereas previous analysis of the data has shown that movement can improve classification [23], here we aim to systematically explore the contributions of more sophisticated movement analysis techniques to classification. Feature extraction from movement data is complicated by two characteristics of movement. First, considering that movement operates through space and time, representing and integrating both of these domains remains a challenge [24]. Respectively in movement classification, relevant features in the spatial and temporal domains should be extracted in order to capture spatiotemporal (as opposed to separate spatial, or temporal) characteristics of the moving individual under study. Second, the patterns underlying the movement classes might relate to multiple spatial and temporal scales (i.e. instantaneous, diurnal or seasonal) and using only the original temporal granularity for calculating MPs is a strong oversimplification of actual movement patterns [25]. Thus, distinguishing features may only become apparent if multiple analysis scales are considered [26].

In this study, wavelet analysis is investigated as a cross-scale analysis approach for extracting features in movement classification. While the related technique of Fourier transform is helpful for identifying periodicities in stationary time series, it will fail on time series where periodicity occurs only irregularly through the data set [27,28]. This is the case for most movement time series, as these are often non-homogeneous, made up of a combination of discrete behaviors. For example, animals may spend more time in a nesting or resting place and thus show only limited movement [29]. In contrast, other places may be used intermittently for foraging and animals may show more movements and hence higher activity [30]. Hence, we test whether integrating features based on wavelet analysis could improve classification due to its ability to detect non-stationary patterns in movement data, where transient types of activity occur. Moreover, features based on the wavelet transform can also be useful for relating these activities to other factors (e.g. physiological, ecological, contextual, etc.) affecting movement [31,32]. Thus, features based on wavelet transform are considered as a complementary tool for identifying the elements of periodic patterns in the movement data.

The contributions of this paper are two-fold. First, we develop a model for movement classification purely based on quantitative features, where each feature measures particular aspects of movement. Three sets of movement features are used (movement parameters only, approximate entropy (ApEn), and wavelet coefficients) and compared to the baseline model that uses only morphological features. We show how gradually adding features improves the performance of the classification model. Secondly, we demonstrate that careful selection and integration of movement features will lead us to comparable results, irrespective of the classification method employed, i.e. decision trees (DT) vs. support vector machines (SVM). Although the results of the classification method might differ among the individual sets of features used, once all features are integrated, the obtained results are comparable between the two classification methods.

Methods and Materials

Overview of movement classification

As in any general classification problem, several steps need to be taken in movement classification in order to make the transition from the observational movement data to the final classes, which we have schematized in Fig 1. The movement parameters (MP, e.g. speed, acceleration, turning angle, etc. [33]) are calculated from the raw movement data. Since trajectories are ordered by time, we get a time series of MP values, which we call an MP profile. The obtained MP profiles are converted to a set of feature vectors, on which statistical descriptors (i.e. mean, standard deviation, median, etc.) may be computed. In this study, we use approximate entropy and discrete wavelet transform to provide additional features. The classification model is built by using the relevant extracted features as quantitative inputs for the model and relating these to the known classes.

Download:

Fig 1. Overview of the movement classification process.

a) raw trajectories of two species of ciliates consisting of time-stamped X- and Y coordinates; b) movement parameters are calculated from the locations and MP profiles through time are obtained; c) extraction of features, for instance, summary statistics of movement parameters (upper panel) or wavelet coefficients (lower panel); d) classification of species based the movement features extracted.

https://doi.org/10.1371/journal.pone.0145345.g001

Extraction of movement features.

Seven movement parameters (i.e. distance travelled, speed, acceleration, turning angle, angular velocity, meandering and sinuosity) were calculated. These values were aggregated into concise representations (i.e. features) to be used in the classification. Features can be related to individual fixes [25], to a short series of fixes, for instance by segmenting the trajectories [34], or to all fixes in a recorded trajectory [26]. Here, we consider two categories of features: aggregate features computed on the whole trajectories and features based on the wavelet transform.

Aggregate movement features.

Moment statistics of movement parameters are the most common form of aggregate features used in classification models. By describing general variations present in the movement data, these features may already differentiate between movement classes at a certain scale. Different moment statistics may be used, such as minimum, maximum, median, mean, standard deviation, etc. However, in the transition from the raw movement trajectories to the summarized representation of classification features, an information loss will be introduced: by the use of only aggregated features at a certain scale, clearly not all aspects of movement can be detected [26].

Therefore, we also used ApEn values as an added feature in the classification model [35,36]. ApEn is a method from time series analysis for quantifying regularities and fluctuations in sequential data [35]. Since moment statistics might ignore subtle changes in the structure of MP profiles, ApEn values are calculated to investigate the regularity or to detect dominant fluctuations in such profiles. As a measure of system complexity, higher values of ApEn suggest a more random distribution (i.e. less predictable profile with complex structure), while a smaller value implies less complexity and more regularity (i.e. highly structural profiles containing many repetitive patterns). In order to better distinguish between movement classes, approximate entropy of MP profiles can be used to show how the structural complexity of particular movement parameters varies over time [36].

Feature extraction based on wavelet analysis.

Based on the MP profiles, the discrete wavelet transform (DWT) was used in order to decompose the movement signal into different levels (see S1 Text, S1 and S2 Figs for a detailed description of the wavelet transform performed). Wavelet analysis can reveal features such as temporal autocorrelation or periodicity patterns in the movement data [37,38], which may go undetected through the aggregate features mentioned before. In each decomposition level, an approximation and a detail sub-band is obtained, yielding two sets of wavelet coefficients. These two sets of information are sufficient to reconstruct the signal [39,40]. Three moment statistics of wavelet coefficients in each sub-band were considered as input features. These include the mean of the absolute values of the coefficients in each sub-band; average power of the wavelet coefficients in each sub-band; and standard deviation of the coefficients in each sub-band. There are two parameters to be set in a DWT analysis: the first is the choice of mother wavelet function, through which the signal is passed in order to characterize the variations. All the wavelets used at different levels of decompositions are scaled and shifted versions of the same mother wavelet function. A Daubechies wavelet (db4) was chosen as the mother wavelet function, due to its superior performance, and order 4 selected to detect the discontinuities in the signals [30,41]. The second parameter is the number of the decomposition levels to provide approximation and detail sub-band at different scales. Considering that possible decomposition levels depend on the length of the trajectories, this was chosen to be 5 in this study.

Other studies have used the distance travelled and speed in the wavelet analysis [30,31,38]. For both, periodic patterns in the profile may be expected whereas for other parameters, it would be rather difficult to interpret the periodicity occurring in the profiles. In the experiments reported here, we used the profiles of the distance travelled to extract the wavelet-based features.

Experiment

Model species

We used 8 species of small, single-celled ciliates as model species for this study: Paramecium caudatum, Paramecium aurelia, Blepharisma japonicum, Colpidium striatum, Colpidium campylum, Cyclidium glaucoma, Tetrahymena thermophila and Loxocephalus sp.

Each ciliate stock was cultured separately in a jar of 240 ml volume covered by aluminum cover to allow air exchange but prevent contaminations. Jars contained protist pellet medium (Carolina Biological Supplies), at a concentration of 0.55 g per liter of Chalkley’s medium and two wheat seeds for slow nutrient release. In addition, the medium contained three bacterial species (Serratia fonticola, Brevibacillus brevis and Bacillus subtilis) as food source for the ciliates. Jars were kept in a temperature-controlled incubator at 15° Celsius. Stocks were transferred monthly by pipetting a small subsample of the previous culture into a jar prepared as described above. Because the different ciliates used show quite pronounced intrinsic differences in cell density under the same culture conditions [17], variable numbers of trajectories were obtained per species.

Data collection

Sampling was done on two dates (24.03.2014 and 07.04.2014) with cultures being 20 days old and thus in the stationary phase. We collected movement trajectories by videoing subsamples of the cultures. To do so, we transferred 1 ml of ciliate culture into a Sedgewick Rafter counting chamber, which was placed under the objective of a stereomicroscope (Leica M205 C) at 25x magnification. We took 20 second video sequences at a frame rate of 25 frames per second using a mounted digital CMOS camera (Hamamatsu C11440) resulting in a total of 500 frames. Dark field illumination was used such that ciliates, transparent in bright field microscopy, appear white on black background; this greatly facilitates the segmentation of videos. We used the software BEMOVI to extract morphological features and movement trajectories of individual cells [23]. Six morphological attributes were extracted for each fix: grey value (pixel intensity from 0 [black] to 255 [white]), area (i.e., cross section), the perimeter, major and minor axes of a fitted ellipse and the aspect ratio (i.e. minor axis/major axis [AR]). Trajectories were filtered by a standardized procedure to get rid of spurious trajectories due to swimming debris: trajectories for analysis were required to show a minimum net displacement of at least 50 pixel, 10 fixes per trajectory and a detection rate of 80% (i.e. a trajectory with a duration of 10 frames has to have at least 8 fixes) and a median step length of greater than 2 pixels. This resulted in 3957 trajectories in total.

Analysis

Different movement features sets were first tested to assess their predictive power for finding species classes. These include all combinations of aggregate movement parameter (MP), approximate entropy (ApEn) and wavelet (Wav) features, leading to 7 movement models including: MP, ApEn, Wav, MP+ApEn, MP+Wav, ApEn+Wav, MP+ApEn+Wav. According to their performance, selected movement feature sets are later integrated to morphological features. The feature sets selected for this study (after initial performance evaluation) and the numbers of features are listed below:

Morphology: Mean and standard deviation for the 6 morphological attributes along the trajectory (12 features per trajectory)
MP only: Mean, standard deviation and median values for 7 movement parameters, i.e. distance travelled, speed, acceleration, turning angle, angular velocity, meandering and sinuosity (21 features per trajectory)
MP+ApEn: Adding ApEn values for all the movement parameter profiles to the MP model (7 additional features per trajectory; total of 28 [= 21+7]).
MP+ApEn+Wav: Adding wavelet features using profiles of the distance travelled (30 additional features; total of 58 [= 28+30])
MP+Morph: Integrates 21 MP only features and 12 morphology features (total of 33 [= 21+12])
MP+ApEn+Wav+Morph: Integrates all features, i.e. morphology and all features based on movement (total of 70 [= 58+12])

Since the number of the features notably increases, a feature selection process was employed to determine the ultimately relevant features in the classification. An evolutionary feature selection process by Genetic Algorithms (GA) in conjunction with the classifier (i.e. DT and SVM) was used to evaluate the significance of the added features in the classification. For SVM, we applied a radial basis function (RBF) with two kernel parameters of C = 20, which is a penalty parameter imposing a tradeoff between training error and generalization performance of SVM classifier and γ = 0.001, which is an exponent factor in the RBF function. In case of DT, a top-down procedure is applied based on the CART learner to traverse the tree, using the following parameter setting: maximal depth of tree = 20, minimal size for split = 4 and confidence value of 0.25. The reported results are based on a 10-fold cross-validation for both classifiers in the feature selection process, with the following parameter settings for GA: population size: 10, number of generations = 30, probability of cross-over = 0.5 and probability of mutation = 1 / (number of features).

For the evaluation of the performance of classification models, the overall classification accuracy and the Kappa coefficient are used. Kappa values are helpful when there is an imbalance in the number of instances between the classes [42], which is the case in our dataset. In case of individual species classes, precision and recall values were measured. Precision is calculated as True Positive / (True Positive + False Positive), whereas recall is defined as True Positive / (True Positive + False Negative).

Results

Contrasting morphology and movement features

The individual confusion matrices shown in Fig 2, as well as the overall accuracy and kappa values for different models (shown below the confusion matrices), allow to contrast movement and morphology features. The baseline Morphology model is quite successful in classifying most of the species, except for Blepharisma, C. campylum and P. aurelia which have low recall and precision values in both SVM and DT cases (Fig 2a). Overall, the Morphology model based on SVM reaches a classification accuracy of 86% and Kappa value of 0.82, which is comparable to the result of the decision tree with an accuracy of 85% and Kappa of 0.81 (Fig 2a). In contrast, classification accuracy based on the MP only model had a considerably lower accuracy of 70% and Kappa value of 0.61 (SVM), and 59% and Kappa of 0.44 (DT) than the baseline morphology only model (Fig 2b). Adding ApEn features led to a small increase using SVM, whereas classification accuracy of the DT slightly decreased (Fig 2c). Further adding wavelet features led to further classification improvement for both classification methods (Fig 2d). The combination of simple aggregate movement features and morphology improved the accuracy by 8% for both classifiers (Kappa 0.92 and 0.9 for SVM and DT, respectively) compared to the morphology only baseline. (Fig 2e). Importantly, the final classification model, which integrates both morphological and all movement features, resulted in similar performances of both classifiers: 95% classification accuracy and Kappa of 0.94 in case of SVM, and 94% accuracy and Kappa of 0.93 for DT (Fig 2f). Whereas the increase due to wavelet and ApEn features in addition to simple MPs looks small with only about 1–2% overall, species-specific improvements (especially for underrepresented species like Blepharisma and P. aurelia) in accuracy and recall may justify the inclusion of advanced features such as wavelets (Fig 2e and 2f). Although DT performed generally less well than SVM for all movement-based features, once morphological features were integrated it performed as well as SVM. Although the two classification methods used different numbers of features (29 vs. 43 for SVM and DT, respectively) to reach such similar classification success, individual features from all feature sets were used in both cases, highlighting the complementary information content in each feature set (Table 1).

Download:

Fig 2. Confusion matrices obtained from SVM (left) and DT (right) by using different feature sets (sections a-f).

The classification precision and recall values are shown for each class in all the tables. The cells are colored in order to indicate the classification precision for each class. Overall classification accuracy and Kappa values are shown below each confusion matrix. Although SVM generally outperforms DT, once both movement and morphology features are integrated, the results are very much comparable (section e and f).

https://doi.org/10.1371/journal.pone.0145345.g002

Download:

Table 1. Number of selected features using SVM and DT in the final classification stage.

https://doi.org/10.1371/journal.pone.0145345.t001

In order to compare the performance of different models to the morphology baseline, we looked at the difference in the overall classification accuracy and Kappa values between particular movement feature sets and morphology (Fig 3). The movement-based features, on their own, were inferior in both accuracy and Kappa compared to the baseline. However, there is an improvement compared to the baseline once complementary features sets are added to the classification model. When morphology and all movement features were integrated, the reported classification accuracy (9%) and Kappa coefficients (0.12) improved substantially, for both SVM and DT, compared to morphology only.

Download:

Fig 3. Comparison of the overall classification accuracy and Kappa coefficient using SVM and DT.

Kappa values are scaled from [0–1] to [0–100], in order to make them comparable with accuracy values. The morphological model is considered as the baseline (0 on the Y axis) and the deviation of models using different feature sets are compared (model—baseline). The error bars shown for each bar plot are derived from the different folds of the cross validation and assist to judge the significance of the increase. Classification based on movement features fares less well than morphology alone, but once integrated, movement features increase both the classification accuracy and Kappa coefficient by about 10%.

https://doi.org/10.1371/journal.pone.0145345.g003

Species classification based on movement features alone

As classification of ciliate species based on movement features is relatively uncommon in the literature, we here compare the models based on movement features only, which would be useful if no information on morphology is available, or the morphology information is unreliable. Classification based on these sets of movement features revealed that certain features perform better than others when classifying species (Fig 4). Unsurprisingly, ApEn and the wavelet features alone were less successful in predicting species compared to the MP features, as they only characterize specific aspects of movement. However, once integrated with MP features, they increased the classification performance in almost all cases. Three species (i.e., Cyclidium, Tetrahymena and Loxocephalus) were quite well predicted by movement features alone, regardless of the classification method, whereas two classes (i.e., Blepharisma and P. aurelia) failed to be correctly predicted by any of the movement feature sets alone (Fig 4). Advanced movement features seem most important for Colpidium striatum and C. campylum, although the performance increase is only subtle when only movement features are considered. However, overall there is a steady improvement in the performance of both classifiers when movement features are added.

Download:

Fig 4. Precision values of predicted ciliate species based on different movement feature sets.

MP is based on 21 general movement features, ApEn are the 7 ApEn features and Wav is based on 30 wavelet features. For the overlapping cases (e.g. when the precision values are zero), only the dots for one class (e.g. Blepharisma in grey) are shown.

https://doi.org/10.1371/journal.pone.0145345.g004

Discussion

Our results demonstrate that: 1) although classification models based only on movement features do not perform as well as morphological features, the integration of both feature sets results in better classification performance than each set alone; 2) adding movement features that are complementary to simple MPs aggregated on the trajectory level increases the classification success overall only slightly, although their contribution can be important for particular classes (especially underrepresented species in the case study here) and may vary with the classifier used; 3) once feature sets are integrated, performance of different classification methods (i.e. SVM or DT) are comparable and allow accurate and robust classification.

Classification performance based on the MP only model is comparable to classification based on morphology in other studies. For instance, a study looking at the automated classification of cichlid fish from Lake Malawi classified on average about 78% correctly [43], whereas another study comparing functional groups of plankton classified about 82% correctly [44]. It should be noted, however, that the numbers of classes were higher (12 combinations of species and sex in the former and 53 functional groups in the latter) in these two studies than in our case. Another study aiming for classification of 6 different movement classes of fish (i.e. a lower number as in our case) had an accuracy of 74% [45], comparable to our results based on movement only. This shows that movement features on their own, are a worse proxy than morphology for species classification in our particular case, but may still provide a worthwhile information gain in other systems, especially when automated classification outperforms human observers [43]. A possible explanation for movement being a less good predictor on its own, is the inherent variability of movement compared to morphology, which may only vary in the restricted range of morphological development. It is known for instance that phenotypic plasticity is larger for behavioral traits (which would include movement) than for instance morphological traits, as shown in a study by [46]. What would be fruitful avenues to improve species classification based on movement behavior? The temporal scale covered by our case study (20 seconds) is still relatively small compared to the lifetime of a cell (several hours to a day) and potentially the temporal scale of behavior. Hence, if we capture only a fraction of the actual behavioral mode, it may be difficult to characterize the species with that information because species identity and behavioral mode may be confounded. A study looking at the movement behavior of cows has shown that the temporal resolution and length of the trajectory determined whether behaviors could be reliably detected or not [47]. Increasing the overall length of the trajectory may help in better capturing the characteristic features of ciliate movement and hence its classification. In addition, the frequency and composition of behavioral modes expressed during the lifetime of a cell may have higher predictive power regarding species identity, as species may show specific signatures of behavioral modes when compared to each other. It was, for instance, shown that movement behavior does vary over the lifetime of cells, although most of the variation can still be summarized in two major behavioral modes [48]. It has to be noted, however, that longer videos have increasing demands in processing power and storage, which may only be justified when higher resolution in terms of behavioral modes is desired and classification success has to exceed the already high success rate shown in our study.

Slight classification performance improvements when features based on ApEn were added to the MP only model might be due to the fact that in the dataset used, dominant fluctuations or regularities are not really present, or that these are similar among classes. The movement of different ciliate species is rather similar to each other (as can be seen in the results of the MP only model) and detecting any dominant regularity in the MP values (captured through ApEn) is rather difficult in our case. One reason for the strongly converging movement behavior among species may be the shared foraging mode. The 8 ciliate species used are all bacterivorous species feeding by phagocytosis, i.e. the engulfing of food particles such as bacteria during swimming [49]. Because they share similar bacterial prey, natural selection may have led to the evolution of very similar movement strategies that allow similar foraging success among species. Although the ApEn features were not largely contributing to classification success, they still yielded a slight improvement to the classification in the case of SVM. Thus, we retained ApEn features in the classification to test if they were considered in the final feature selection process.

The third classification model including wavelet features further improved classification success. This shows that the wavelet features have been successful in capturing periodic movements in ciliate trajectories. These periodic patterns in at least some of the ciliate species could, for instance, be due to a looping behavior, where individuals move away from their departure point and return within a given time period [50]. Such a movement pattern would lead to periodic changes in the net displacement. It is most likely that these movements are performed on a small spatial scale such that they were captured by the wavelet analysis. In other applications such as the classification of EEG signals, wavelet analysis has been successfully applied, owing precisely to the periodic nature of the signals [41,51]. Our study shows that wavelet analysis provides complementary information to static movement parameters and hence improves classification success by capturing an additional aspect of movement. Importantly, adding the complementary wavelets and ApEn also improved the overall classification success from 89% using static movement parameters and morphology to 95% in this study [23]. However, as shown in Fig 4 wavelet or ApEn features on their own are less meaningful in movement based classification problems, since they will only capture specific aspects of movement such as periodic patterns.

Contrasting the morphology + MP model with the morphology + MP + advanced movement features model shows that the advanced features have merit in terms of improving species-specific accuracy and recall. Both species with the lowest number of cases (Blepharisma and P. aurelia) had improved accuracy and recall and even the abundant Tetrahymena was better classified. The increased effort of calculating advanced movement features hence pays off due to the improvements, but simple movement metrics may be preferred if the movement expressed does not show temporal structure (as for other species such as Colpidium, Cyclidium and Loxocephalus). Interestingly, the advanced movement features contribute only to improved classification in the case of these species, when combined with morphology, as classification only based on movement failed completely. This suggests that combined features can have synergistic effects on classification performance and the right combination of features is key for a successful overall classification.

Another achievement of this study is demonstrated by the results of the final classification model that integrates all the movement and morphology features: Careful selection of input features to obtain a set of features that collectively capture the varied aspects of movement will result in the highest classification performance, regardless of the classification method used. In this study, two classification methods with different theoretical background were employed (i.e. SVM and DT). While we were not comparing the performance of those methods, we would like to point out that selecting relevant movement features capturing different aspects of movement is of utmost importance. Such a classification approach can ensure reliable results, as can be seen through the comparable sets of selected features for building the SVM and DT models. This also could be seen in the range of selected features shown in Table 1, where SVM achieves similar results with fewer features. In the MP only model, all the movement parameters are used for both DT and SVM models. In the case of ApEn, SVM uses only 3 features (ApEn of distance travelled, acceleration and turning angle), compared to 5 used by DT (distance travelled, acceleration, speed, meandering and sinuosity). The selected features based on the wavelet transform show that features corresponding to different approximations and sub-bands are intermittently used, confirming the importance of both of these sub-bands in the classification. Although the two sets of selected features for SVM and DT are not exactly the same, all the developed groups of movement features showed up in the feature selection process, indicating their contribution to classifying between species. This is in accordance with the findings of other studies [26], where different combinations of features may end up in comparable results. The final message is that the combination of relevant features—movement and morphology in our case study—can ultimately build reliable classification models with high precision and recall.

Previous classification based on random forest classification showed that imbalance in the abundance of classes would influence the outcome for specific pairs of species. For instance, P. aurelia being less abundant than P. caudatum would get completely lumped into P. caudatum [23]. Whereas not unexpected due to the working principle of the random forest algorithm, the classification is unreliable for the minority class. Here we show that other classification methods such as SVM can accommodate for such imbalances better and may therefore be better suited when dealing with datasets that show large imbalances as the one used in this study.

We also employed the approach presented in [26], where a moving window of different (temporal) sizes is employed for the computation of MPs and then imported to the classification model. The results of this simple cross-scale analysis method, although not presented here, suggested that the original temporal granularity at which the data was captured was the most reliable temporal scale for the calculation of MPs. Consequently, when we employed wavelet analysis, we saw that adding features based on the DWT indeed contributed to improving the performance of the classification. We conclude from this that since scale issues manifest themselves in different ways in movement analysis, appropriate methods need to be used in order to provide complementary measures to scale-specific techniques.

As part of future work, the capability of the discrete wavelet transform will be investigated in other relevant problems in movement research, including trajectory segmentation. Movement classification and segmentation share common characteristics, given that they both aim at grouping parts of trajectories with respect to the similarity in movement properties. Due to similar conceptual backgrounds, the features extracted from movement trajectories can be used towards both classification and segmentation. Hence, features developed for the classification of entire trajectories could also be applied to subtrajectories, with little modification. The focus in the case of segmentation is to divide the trajectories into segments (subtrajectories) with homogeneous movement characteristics, which can point out the particular behaviors to be mined from movement trajectories. Since DWT decomposes the input signals at different levels, it can be used to investigate the variation of behaviors across different scales. This can be particularly interesting in different application domains, where sophisticated methods are needed to automate the process of segmenting large volumes of movement data.

Conclusions

In this study, the contribution of different movement features in a classification problem was investigated. Different ciliate species were considered as the target classes, to assess whether features based on movement can be employed as a complementary proxy to morphology in the classification problem. Our results demonstrate the value of exploring wavelet analysis, together with general movement features, in order to better distinguish the ciliate species. Such features have not been used yet in studies related to automated classification of species in the context of video analysis, and are so far rarely employed for feature extraction in movement classification studies in general. We believe that our findings are applicable to movement ecology studies in general, since they show that movement paths can be automatically classified according to classes such as species, but may also be useful to infer biological states such as behavioral modes. Our results also have potential application for instance in the field of automated monitoring of waste water.

Supporting Information

S1 Fig. Working principle of the discrete wavelet transform.

https://doi.org/10.1371/journal.pone.0145345.s001

(EPS)

S2 Fig. Decomposition of the movement parameter profile through wavelet analysis at different levels.

https://doi.org/10.1371/journal.pone.0145345.s002

(EPS)

S1 Text. Movement classification by the discrete wavelet transform.

https://doi.org/10.1371/journal.pone.0145345.s003

(DOCX)

Author Contributions

Conceived and designed the experiments: AS FP OLP RW. Performed the experiments: FP. Analyzed the data: AS FP. Contributed reagents/materials/analysis tools: AS FP OLP RW. Wrote the paper: AS FP OLP RW.

References

1. Cagnacci F, Boitani L, Powell RA, Boyce MS. Animal ecology meets GPS-based radiotelemetry: a perfect storm of opportunities and challenges. Philos Trans R Soc London B Biol Sci. 2010 Jan 23;365(1550):2157–62. pmid:20566493
- View Article
- PubMed/NCBI
- Google Scholar
2. Nathan R, Spiegel O, Fortmann-Roe S, Harel R, Wikelski M, Getz WM. Using tri-axial acceleration data to identify behavioral modes of free-ranging animals: general concepts and tools illustrated for griffon vultures. J Exp Biol. 2012 Mar 15;215:986–96. pmid:22357592
- View Article
- PubMed/NCBI
- Google Scholar
3. Dell AI, Bender JA, Branson K, Couzin ID, de Polavieja GG, Noldus LPJJ, et al. Automated image-based tracking and its application in ecology. Trends Ecol Evol. 2014 Jul 8;29(7):417–28. pmid:24908439
- View Article
- PubMed/NCBI
- Google Scholar
4. Demšar U, Buchin K, Cagnacci F, Safi K, Speckmann B, Van de Weghe N, et al. Analysis and visualisation of movement: an interdisciplinary review. Mov Ecol. 2015; 3(1):1–24
- View Article
- Google Scholar
5. Gurarie E, Andrews RD, Laidre KL. A novel method for identifying behavioural changes in animal movement data. Ecol Lett. 2009 Apr 9;12(5):395–408. pmid:19379134
- View Article
- PubMed/NCBI
- Google Scholar
6. Nams VO. Combining animal movements and behavioural data to detect behavioural states. Ecol Lett. 2014 Jul 15;17:1228–1237 pmid:25040789
- View Article
- PubMed/NCBI
- Google Scholar
7. Branson K, Robie AA, Bender J, Perona P, Dickinson MH. High-throughput ethomics in large groups of Drosophila. Nat Methods. 2009 May 21;6(6):451–7 pmid:19412169
- View Article
- PubMed/NCBI
- Google Scholar
8. Chen M-R, Moison M, Molinero JC, Hwang J-S. Assessing the effect of food and light on Calanus sinicus swimming behavior through video-recording experiments. J Exp Mar Bio Ecol, 2012 Sep 24;422–423:14–9
- View Article
- Google Scholar
9. Chang Y-C, Yan J-C, Hwang J-S, Wu C-H, Lee M-T. Data-oriented analyses of ciliate foraging behaviors. Hydrobiologia. 2010 Sep 16;666(1):223–37.
- View Article
- Google Scholar
10. Hennessey TM, Lampert TJ. Chapter 15—Behavioral Bioassays and Their Uses in Tetrahymena. In: Collins K, editor. Methods in Cell Biology. Academic Press; 2012. p. 393–410.
11. Fauchald P, Tveraa T. Using first-passage time in the analysis of area-restricted search and habitat selection. Ecology. 2003;84(2):282–8.
- View Article
- Google Scholar
12. Webb SL, Riffell SK, Gee KL, Demarais S. Using Fractal Analyses to Characterize Movement Paths of White-Tailed Deer and Response to Spatial Scale. J Mammal. 2009;90(5):1210–7.
- View Article
- Google Scholar
13. Beyer HL, Morales JM, Murray D, Fortin MJ. The effectiveness of Bayesian state-space models for estimating behavioural states from movement paths. Methods Ecol Evol. 2013;4(5):433–41.
- View Article
- Google Scholar
14. Gurarie E, Bracis C, Delgado M, Meckley TD, Kojola I, Wagner CM. What is the animal doing? Tools for exploring behavioral structure in animal movements. J Anim Ecol. 2015.
- View Article
- Google Scholar
15. Lynn D. The Ciliated Protozoa: Characterization, Classification, and Guide to the Literature. 3rd edition. New York: Springer; 2011. 605 p.
16. Pernthaler J. Predation on prokaryotes in the water column and its ecological implications. Nat Rev Microbiol. 2005 Jan 9;3(7):537–46. pmid:15953930
- View Article
- PubMed/NCBI
- Google Scholar
17. Altermatt F, Fronhofer EA, Garnier A, Giometto A, Hammes F, Klecka J, et al. Big answers from small worlds: a user’s guide for protist microcosms as a model system in ecology and evolution. Methods Ecol Evol. 2015 Jan 6; 6:218–231.
- View Article
- Google Scholar
18. Amaral AL, da Motta M, Pons MN, Vivier H, Roche N, Mota M, et al. Survey of Protozoa and Metazoa populations in wastewater treatment plants by image analysis and discriminant analysis. Environmetrics. 2004 May 28;15(4):381–90.
- View Article
- Google Scholar
19. Ginoris YP, Amaral AL, Nicolau A, Coelho MAZ, Ferreira EC. Development of an image analysis procedure for identifying protozoa and metazoa typical of activated sludge system. Water Res. 2007 May 22;41(12):2581–9. pmid:17399764
- View Article
- PubMed/NCBI
- Google Scholar
20. Amaral AL, Ginoris YP, Nicolau A, Coelho MZ, Ferreira EC. Stalked protozoa identification by image analysis and multivariable statistical techniques. Anal Bioanal Chem. 2008 May 28;391(4):1321–5. pmid:18327573
- View Article
- PubMed/NCBI
- Google Scholar
21. Mesquita DP, Amaral AL, Ferreira EC. Activated sludge characterization through microscopy: A review on quantitative image analysis and chemometric techniques. Anal Chim Acta. 2013 May 28;802:14–28. pmid:24176501
- View Article
- PubMed/NCBI
- Google Scholar
22. Finlay BJ. Protist taxonomy: an ecological perspective. Philos Trans R Soc London Ser B Biol Sci. 2004 Aug 23;359(1444):599–610.
- View Article
- Google Scholar
23. Pennekamp F, Schtickzelle N, Petchey OL. BEMOVI, software for extracting behavior and morphology from videos, illustrated with analyses of microbes. Ecology and Evolution. 2015; 5(13):2584–95 pmid:26257872
- View Article
- PubMed/NCBI
- Google Scholar
24. Long JA, Nelson TA. A review of quantitative methods for movement data. Int J Geogr Inf Sci. 2013;27(2):292–318.
- View Article
- Google Scholar
25. Laube P, Purves R. How fast is a cow? Cross-Scale Analysis of Movement Data. Trans GIS. 2011;15(3):401–18.
- View Article
- Google Scholar
26. Soleymani A, Cachat J, Robinson K, Dodge S, Kalueff A V, Weibel R. Integrating cross-scale analysis in the spatial and temporal domains for classification of behavioral movement. J Spat Inf Sci. 2014;8(8):1–25.
- View Article
- Google Scholar
27. Daubechies I. The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory. 1990;36(5):961–1005.
- View Article
- Google Scholar
28. Fagan WF, Lewis M a, Auger-Méthé M, Avgar T, Benhamou S, Breed G, et al. Spatial memory and animal movement. Ecol Lett. 2013 Oct 15;16(10):1316–29. pmid:23953128
- View Article
- PubMed/NCBI
- Google Scholar
29. March D, Palmer M, Alós J, Grau A, Cardona F. Short-term residence, home range size and diel patterns of the painted comber Serranus scriba in a temperate marine reserve. Mar Ecol Prog Ser. 2010 Feb 11;400:195–206.
- View Article
- Google Scholar
30. Sur M, Skidmore AK, Exo K-M, Wang T, Ens J B, Toxopeus G. Change detection in animal movement using discrete wavelet analysis. Ecol Inform. Elsevier B.V.; 2014 Mar 5;20:47–57.
- View Article
- Google Scholar
31. Wittemyer G, Polansky L, Douglas-hamilton I, Getz WM. Disentangling the effects of forage, social rank, and risk on movement autocorrelation of elephants using Fourier and wavelet analyses. Proc Natl Acad Sci. 2008;105(49):19108–13. pmid:19060207
- View Article
- PubMed/NCBI
- Google Scholar
32. Gaucherel C. Wavelet analysis to detect regime shifts in animal movement. Comput Ecol Softw. 2011;1(2):69–85.
- View Article
- Google Scholar
33. Dodge S, Weibel R, Lautenschütz A. Towards a taxonomy of movement patterns. Inf Vis. 2008;7:240–52.
- View Article
- Google Scholar
34. Dodge S, Weibel R, Forootan E. Revealing the physics of movement: Comparing the similarity of movement characteristics of different types of moving objects. Comput Environ Urban Syst. 2009 Nov 24.;33(6):419–34.
- View Article
- Google Scholar
35. Pincus SM. Approximate entropy as a measure of system complexity. Proc Natl Acad Sci. 1991 Mar 15;88(6):2297–301. pmid:11607165
- View Article
- PubMed/NCBI
- Google Scholar
36. Li X. Using Complexity Measures of Movement for Automatically Detecting Movement Types of Unknown GPS Trajectories. Am J Geogr Inf Syst. 2014;3(2):63–74.
- View Article
- Google Scholar
37. Dray S, Royer-Carenzi M, Calenge C. The exploratory analysis of autocorrelation in animal-movement studies. Ecol Res. 2010 Apr 16;25(3):673–81.
- View Article
- Google Scholar
38. Riotte-Lambert L, Benhamou S, Chamaillé-Jammes S. Periodicity analysis of movement recursions. J Theor Biol. 2013 Jan 21;317:238–43. pmid:23108209
- View Article
- PubMed/NCBI
- Google Scholar
39. Mallat SG. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans PATTERN Anal Mach Intell. 1989;I(7):1604–16.
- View Article
- Google Scholar
40. Mallat SG. Multifrequency channel decompositions of images and wavelet models. IEEE Trans Acoust. 1989;37(12):2091–110.
- View Article
- Google Scholar
41. Subasi A. EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst Appl. 2007 May;32(4):1084–93.
- View Article
- Google Scholar
42. Kuhn M. Building Predictive Models in R Using the caret Package. J Stat Softw. 2008;28(5):1–26.
- View Article
- Google Scholar
43. Joo D, Kwan Y, Song J, Pinho C, Hey J, Won Y-J. Identification of cichlid fishes from Lake Malawi using computer vision. PLoS One. 2013;8(10).
- View Article
- Google Scholar
44. Bell JL, Hopcroft RR. Assessment of ZooImage as a tool for the classification of zooplankton. J Plankton Res. 2008 Jul 9;30(12):1351–67.
- View Article
- Google Scholar
45. Amer M, Bilgazyev E, Todorovic S, Shah S, Kakadiaris I, Ciannelli L. Fine-grained categorization of fish motion patterns in underwater videos. 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops). 2011. p. 1488–95.
46. Relyea RA. Morphological and Behavioral Plasticity of Larval Anurans in Response to Different Predators. Ecology. 2001 Jan 28;82(2):523–40.
- View Article
- Google Scholar
47. de Weerd N, van Langevelde F, van Oeveren H, Nolet B, Kölzsch A, Prins HHT, et al. Deriving Animal Behaviour from High-Frequency GPS: Tracking Cows in Open and Forested Habitat. PLoS One. 2015;10(6):e0129030. pmid:26107643
- View Article
- PubMed/NCBI
- Google Scholar
48. Jordan D, Kuehn S, Katifori E, Leibler S. Behavioral diversity in microbes and low-dimensional phenotypic spaces. Proc Natl Acad Sci. 2013 Jul 10;110:14018–23. pmid:23898201
- View Article
- PubMed/NCBI
- Google Scholar
49. Fenchel T, Blackburn N. Motile chemosensory behaviour of phagotrophic protists: Mechanisms for and efficiency in congregating at food patches. Protist. 1999;150(3):325–36. pmid:10575704
- View Article
- PubMed/NCBI
- Google Scholar
50. Conradt L, Bodsworth EJ, Roper TJ, Thomas CD. Non-random dispersal in the butterfly Maniola jurtina: implications for metapopulation models. Proc R Soc B Biol Sci. 2000;267:1505–10.
- View Article
- Google Scholar
51. Güler I, Ubeyli ED. Adaptive neuro-fuzzy inference system for classification of EEG signals using wavelet coefficients. J Neurosci Methods. 2005 Oct 30;148(2):113–21. pmid:16054702
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Cagnacci F, Boitani L, Powell RA, Boyce MS. Animal ecology meets GPS-based radiotelemetry: a perfect storm of opportunities and challenges. Philos Trans R Soc London B Biol Sci. 2010 Jan 23;365(1550):2157–62. pmid:20566493
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Nathan R, Spiegel O, Fortmann-Roe S, Harel R, Wikelski M, Getz WM. Using tri-axial acceleration data to identify behavioral modes of free-ranging animals: general concepts and tools illustrated for griffon vultures. J Exp Biol. 2012 Mar 15;215:986–96. pmid:22357592
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Dell AI, Bender JA, Branson K, Couzin ID, de Polavieja GG, Noldus LPJJ, et al. Automated image-based tracking and its application in ecology. Trends Ecol Evol. 2014 Jul 8;29(7):417–28. pmid:24908439
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Demšar U, Buchin K, Cagnacci F, Safi K, Speckmann B, Van de Weghe N, et al. Analysis and visualisation of movement: an interdisciplinary review. Mov Ecol. 2015; 3(1):1–24
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. Gurarie E, Andrews RD, Laidre KL. A novel method for identifying behavioural changes in animal movement data. Ecol Lett. 2009 Apr 9;12(5):395–408. pmid:19379134
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Nams VO. Combining animal movements and behavioural data to detect behavioural states. Ecol Lett. 2014 Jul 15;17:1228–1237 pmid:25040789
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Branson K, Robie AA, Bender J, Perona P, Dickinson MH. High-throughput ethomics in large groups of Drosophila. Nat Methods. 2009 May 21;6(6):451–7 pmid:19412169
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Chen M-R, Moison M, Molinero JC, Hwang J-S. Assessing the effect of food and light on Calanus sinicus swimming behavior through video-recording experiments. J Exp Mar Bio Ecol, 2012 Sep 24;422–423:14–9
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref9] 9. Chang Y-C, Yan J-C, Hwang J-S, Wu C-H, Lee M-T. Data-oriented analyses of ciliate foraging behaviors. Hydrobiologia. 2010 Sep 16;666(1):223–37.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref10] 10. Hennessey TM, Lampert TJ. Chapter 15—Behavioral Bioassays and Their Uses in Tetrahymena. In: Collins K, editor. Methods in Cell Biology. Academic Press; 2012. p. 393–410.

[ref11] 11. Fauchald P, Tveraa T. Using first-passage time in the analysis of area-restricted search and habitat selection. Ecology. 2003;84(2):282–8.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref12] 12. Webb SL, Riffell SK, Gee KL, Demarais S. Using Fractal Analyses to Characterize Movement Paths of White-Tailed Deer and Response to Spatial Scale. J Mammal. 2009;90(5):1210–7.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref13] 13. Beyer HL, Morales JM, Murray D, Fortin MJ. The effectiveness of Bayesian state-space models for estimating behavioural states from movement paths. Methods Ecol Evol. 2013;4(5):433–41.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref14] 14. Gurarie E, Bracis C, Delgado M, Meckley TD, Kojola I, Wagner CM. What is the animal doing? Tools for exploring behavioral structure in animal movements. J Anim Ecol. 2015.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref15] 15. Lynn D. The Ciliated Protozoa: Characterization, Classification, and Guide to the Literature. 3rd edition. New York: Springer; 2011. 605 p.

[ref16] 16. Pernthaler J. Predation on prokaryotes in the water column and its ecological implications. Nat Rev Microbiol. 2005 Jan 9;3(7):537–46. pmid:15953930
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref17] 17. Altermatt F, Fronhofer EA, Garnier A, Giometto A, Hammes F, Klecka J, et al. Big answers from small worlds: a user’s guide for protist microcosms as a model system in ecology and evolution. Methods Ecol Evol. 2015 Jan 6; 6:218–231.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref18] 18. Amaral AL, da Motta M, Pons MN, Vivier H, Roche N, Mota M, et al. Survey of Protozoa and Metazoa populations in wastewater treatment plants by image analysis and discriminant analysis. Environmetrics. 2004 May 28;15(4):381–90.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref19] 19. Ginoris YP, Amaral AL, Nicolau A, Coelho MAZ, Ferreira EC. Development of an image analysis procedure for identifying protozoa and metazoa typical of activated sludge system. Water Res. 2007 May 22;41(12):2581–9. pmid:17399764
View Article
PubMed/NCBI
Google Scholar

[59] View Article

[60] PubMed/NCBI

[61] Google Scholar

[ref20] 20. Amaral AL, Ginoris YP, Nicolau A, Coelho MZ, Ferreira EC. Stalked protozoa identification by image analysis and multivariable statistical techniques. Anal Bioanal Chem. 2008 May 28;391(4):1321–5. pmid:18327573
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref21] 21. Mesquita DP, Amaral AL, Ferreira EC. Activated sludge characterization through microscopy: A review on quantitative image analysis and chemometric techniques. Anal Chim Acta. 2013 May 28;802:14–28. pmid:24176501
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref22] 22. Finlay BJ. Protist taxonomy: an ecological perspective. Philos Trans R Soc London Ser B Biol Sci. 2004 Aug 23;359(1444):599–610.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref23] 23. Pennekamp F, Schtickzelle N, Petchey OL. BEMOVI, software for extracting behavior and morphology from videos, illustrated with analyses of microbes. Ecology and Evolution. 2015; 5(13):2584–95 pmid:26257872
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref24] 24. Long JA, Nelson TA. A review of quantitative methods for movement data. Int J Geogr Inf Sci. 2013;27(2):292–318.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref25] 25. Laube P, Purves R. How fast is a cow? Cross-Scale Analysis of Movement Data. Trans GIS. 2011;15(3):401–18.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref26] 26. Soleymani A, Cachat J, Robinson K, Dodge S, Kalueff A V, Weibel R. Integrating cross-scale analysis in the spatial and temporal domains for classification of behavioral movement. J Spat Inf Sci. 2014;8(8):1–25.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref27] 27. Daubechies I. The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory. 1990;36(5):961–1005.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref28] 28. Fagan WF, Lewis M a, Auger-Méthé M, Avgar T, Benhamou S, Breed G, et al. Spatial memory and animal movement. Ecol Lett. 2013 Oct 15;16(10):1316–29. pmid:23953128
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref29] 29. March D, Palmer M, Alós J, Grau A, Cardona F. Short-term residence, home range size and diel patterns of the painted comber Serranus scriba in a temperate marine reserve. Mar Ecol Prog Ser. 2010 Feb 11;400:195–206.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref30] 30. Sur M, Skidmore AK, Exo K-M, Wang T, Ens J B, Toxopeus G. Change detection in animal movement using discrete wavelet analysis. Ecol Inform. Elsevier B.V.; 2014 Mar 5;20:47–57.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref31] 31. Wittemyer G, Polansky L, Douglas-hamilton I, Getz WM. Disentangling the effects of forage, social rank, and risk on movement autocorrelation of elephants using Fourier and wavelet analyses. Proc Natl Acad Sci. 2008;105(49):19108–13. pmid:19060207
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref32] 32. Gaucherel C. Wavelet analysis to detect regime shifts in animal movement. Comput Ecol Softw. 2011;1(2):69–85.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref33] 33. Dodge S, Weibel R, Lautenschütz A. Towards a taxonomy of movement patterns. Inf Vis. 2008;7:240–52.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref34] 34. Dodge S, Weibel R, Forootan E. Revealing the physics of movement: Comparing the similarity of movement characteristics of different types of moving objects. Comput Environ Urban Syst. 2009 Nov 24.;33(6):419–34.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref35] 35. Pincus SM. Approximate entropy as a measure of system complexity. Proc Natl Acad Sci. 1991 Mar 15;88(6):2297–301. pmid:11607165
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref36] 36. Li X. Using Complexity Measures of Movement for Automatically Detecting Movement Types of Unknown GPS Trajectories. Am J Geogr Inf Syst. 2014;3(2):63–74.
View Article
Google Scholar

[117] View Article

[118] Google Scholar

[ref37] 37. Dray S, Royer-Carenzi M, Calenge C. The exploratory analysis of autocorrelation in animal-movement studies. Ecol Res. 2010 Apr 16;25(3):673–81.
View Article
Google Scholar

[120] View Article

[121] Google Scholar

[ref38] 38. Riotte-Lambert L, Benhamou S, Chamaillé-Jammes S. Periodicity analysis of movement recursions. J Theor Biol. 2013 Jan 21;317:238–43. pmid:23108209
View Article
PubMed/NCBI
Google Scholar

[123] View Article

[124] PubMed/NCBI

[125] Google Scholar

[ref39] 39. Mallat SG. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans PATTERN Anal Mach Intell. 1989;I(7):1604–16.
View Article
Google Scholar

[127] View Article

[128] Google Scholar

[ref40] 40. Mallat SG. Multifrequency channel decompositions of images and wavelet models. IEEE Trans Acoust. 1989;37(12):2091–110.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref41] 41. Subasi A. EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst Appl. 2007 May;32(4):1084–93.
View Article
Google Scholar

[133] View Article

[134] Google Scholar

[ref42] 42. Kuhn M. Building Predictive Models in R Using the caret Package. J Stat Softw. 2008;28(5):1–26.
View Article
Google Scholar

[136] View Article

[137] Google Scholar

[ref43] 43. Joo D, Kwan Y, Song J, Pinho C, Hey J, Won Y-J. Identification of cichlid fishes from Lake Malawi using computer vision. PLoS One. 2013;8(10).
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref44] 44. Bell JL, Hopcroft RR. Assessment of ZooImage as a tool for the classification of zooplankton. J Plankton Res. 2008 Jul 9;30(12):1351–67.
View Article
Google Scholar

[142] View Article

[143] Google Scholar

[ref45] 45. Amer M, Bilgazyev E, Todorovic S, Shah S, Kakadiaris I, Ciannelli L. Fine-grained categorization of fish motion patterns in underwater videos. 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops). 2011. p. 1488–95.

[ref46] 46. Relyea RA. Morphological and Behavioral Plasticity of Larval Anurans in Response to Different Predators. Ecology. 2001 Jan 28;82(2):523–40.
View Article
Google Scholar

[146] View Article

[147] Google Scholar

[ref47] 47. de Weerd N, van Langevelde F, van Oeveren H, Nolet B, Kölzsch A, Prins HHT, et al. Deriving Animal Behaviour from High-Frequency GPS: Tracking Cows in Open and Forested Habitat. PLoS One. 2015;10(6):e0129030. pmid:26107643
View Article
PubMed/NCBI
Google Scholar

[149] View Article

[150] PubMed/NCBI

[151] Google Scholar

[ref48] 48. Jordan D, Kuehn S, Katifori E, Leibler S. Behavioral diversity in microbes and low-dimensional phenotypic spaces. Proc Natl Acad Sci. 2013 Jul 10;110:14018–23. pmid:23898201
View Article
PubMed/NCBI
Google Scholar

[153] View Article

[154] PubMed/NCBI

[155] Google Scholar

[ref49] 49. Fenchel T, Blackburn N. Motile chemosensory behaviour of phagotrophic protists: Mechanisms for and efficiency in congregating at food patches. Protist. 1999;150(3):325–36. pmid:10575704
View Article
PubMed/NCBI
Google Scholar

[157] View Article

[158] PubMed/NCBI

[159] Google Scholar

[ref50] 50. Conradt L, Bodsworth EJ, Roper TJ, Thomas CD. Non-random dispersal in the butterfly Maniola jurtina: implications for metapopulation models. Proc R Soc B Biol Sci. 2000;267:1505–10.
View Article
Google Scholar

[161] View Article

[162] Google Scholar

[ref51] 51. Güler I, Ubeyli ED. Adaptive neuro-fuzzy inference system for classification of EEG signals using wavelet coefficients. J Neurosci Methods. 2005 Oct 30;148(2):113–21. pmid:16054702
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

Developing and Integrating Advanced Movement Features Improves Automated Classification of Ciliate Species

Developing and Integrating Advanced Movement Features Improves Automated Classification of Ciliate Species

Correction

Figures

Abstract

Introduction

Methods and Materials

Overview of movement classification

Extraction of movement features.

Aggregate movement features.

Feature extraction based on wavelet analysis.

Experiment

Model species

Data collection

Analysis

Results

Contrasting morphology and movement features

Species classification based on movement features alone

Discussion

Conclusions

Supporting Information

S1 Fig. Working principle of the discrete wavelet transform.

S2 Fig. Decomposition of the movement parameter profile through wavelet analysis at different levels.

S1 Text. Movement classification by the discrete wavelet transform.

Author Contributions

References