Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Differences in Abundances of Cell-Signalling Proteins in Blood Reveal Novel Biomarkers for Early Detection Of Clinical Alzheimer's Disease

  • Mateus Rocha de Paula,

    Affiliation Centre for Bioinformatics, Biomarker Discovery & Information-Based Medicine, The University of Newcastle, Callaghan, Australia

  • Martín Gómez Ravetti,

    Affiliation Departamento de Engenharia de Produção, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Brazil

  • Regina Berretta,

    Affiliation Centre for Bioinformatics, Biomarker Discovery & Information-Based Medicine, The University of Newcastle, Callaghan, Australia

  • Pablo Moscato

    Pablo.Moscato@newcastle.edu.au

    Affiliation Centre for Bioinformatics, Biomarker Discovery & Information-Based Medicine, The University of Newcastle, Callaghan, Australia

Abstract

Background

In November 2007 a study published in Nature Medicine proposed a simple test based on the abundance of 18 proteins in blood to predict the onset of clinical symptoms of Alzheimer's Disease (AD) two to six years before these symptoms manifest. Later, another study, published in PLoS ONE, showed that only five proteins (IL-1, IL-3, EGF, TNF- and G-CSF) have overall better prediction accuracy. These classifiers are based on the abundance of 120 proteins. Such values were standardised by a Z-score transformation, which means that their values are relative to the average of all others.

Methodology

The original datasets from the Nature Medicine paper are further studied using methods from combinatorial optimisation and Information Theory. We expand the original dataset by also including all pair-wise differences of z-score values of the original dataset (“metafeatures”). Using an exact algorithm to solve the resulting Feature Set problem, used to tackle the feature selection problem, we found signatures that contain either only features, metafeatures or both, and evaluated their predictive performance on the independent test set.

Conclusions

It was possible to show that a specific pattern of cell signalling imbalance in blood plasma has valuable information to distinguish between NDC and AD samples. The obtained signatures were able to predict AD in patients that already had a Mild Cognitive Impairment (MCI) with up to 84% of sensitivity, while maintaining also a strong prediction accuracy of 90% on a independent dataset with Non Demented Controls (NDC) and AD samples. The novel biomarkers uncovered with this method now confirms ANG-2, IL-11, PDGF-BB, CCL15/MIP-1; and supports the joint measurement of other signalling proteins not previously discussed: GM-CSF, NT-3, IGFBP-2 and VEGF-B.

Introduction

In November 2007, a study published in Nature Medicine [1] immediately attracted both scientific and media attention. A multidisciplinary team led by Stanford researchers proposed a simple test, based on the abundance of 18 plasma signalling proteins, for early detection of clinical Alzheimer's disease (AD).

They showed that a molecular signature can be used to predict the onset of clinical symptoms of AD as early as two to six years before these symptoms manifest. These initial findings have important consequences as the scientific and social significance of being able to predict the onset of AD before clinical symptoms appear is of unquestionable benefit. The relative simplicity of the proposed method and the quality of the execution of the study contributed to the immediate interest in the scientific community.

The basic experimental design was remarkably simple. Using the abundance of 120 signalling proteins on a training set of 83 archived plasma samples, Ray et al. [1] identified an 18-protein signature, a subset of the set of 120 signalling proteins they were measuring, which proved to be useful to predict clinical symptoms of AD. The signature was able to show an overall effectiveness of 91% and 81% for AD predictability on two separate test sets: one comparing patients who developed clinical AD with Non-Demented Controls (NDC), and another comparing patients with Mild Cognitive Impairment (MCI) that developed AD with those who did not. Predicting AD within patients with a MCI as early as possible is particularly important because during the observation period of memory testing, which can take up to several months, profound neuropathological damage may occur [2].

Soon after this discovery was published, and using the same datasets available on the public domain, Gómez Ravetti and Moscato [3] showed that the abundance of only five proteins was sufficient to obtain an even better total prediction accuracy. They used an integrative bioinformatics approach, based on the ()--Feature Set methodology [4][12], that reduced the search of predictive biomarkers to only a subset of Ray et al.'s [1]: IL-1 (interleukin 1), IL-3 (Interleukin 3 (colony-stimulating factor, multiple), EGF (epidermal growth factor (-urogastrone)), TNF- (Tumour Necrosis Factor ) and G-CSF (colony stimulating factor 3 (granulocyte)).

Their results indicated that, using the abundance of just these five proteins together with simple established logistic-type classifiers, it was possible to distinguish NDC from samples with AD with a higher accuracy than that of the signature proposed by Ray et al. [1].

However, it is important to understand the context in which such accuracies are to be interpreted. As Ray et al. [1] already stated in their supplementary material, since many of the patients are still alive, it is not possible to be completely sure that the study participants that were labelled as AD samples are indeed individuals that will develop AD. An accurate AD diagnosis can only be obtained post-mortem with the histological analysis of brain material. The same can probably be stated of NDC, based on the same argument. Therefore, by “accuracy” what is actually reported in these performance tests is the overall percentage agreement with the current clinical diagnosis. That means that the existing classifiers have a high level of agreement with current clinical diagnosis but, as some of the samples might have been assigned an inaccurate label, they might also not be as robust as they could.

Motivation

One of the most relevant characteristics of Gómez Ravetti and Moscato's study [3] is that they report results of not one, but 24 different classifiers available in the Weka software package [13]. They proposed that the consensus of the prediction of different classifiers, inspired by different mathematical principles, would provide a more reliable prediction than the results of a single classifier. This allowed to establish the relevance of the 5-protein signature since it was able to distinguish between AD samples and NDC with a higher overall percent agreement with clinical diagnosis. Since this average was obtained from the results of 24 different types of classifiers, instead of a single one, the study provided strong evidence that the 5-protein signature is indeed a useful biomarker panel. Figure 1 illustrates the performance of the uncovered 5-protein signature.

thumbnail
Figure 1. Stacked values of the Z-Scores of the 5-protein signature introduced by Gómez Ravetti and Moscato [3].

Figures 1A–B present the stacked values of the Z-Scores of samples in the training and independent test set respectively. The 5-protein signature includes the relative abundances of IL-1 (interleukin 1 ), IL-3 (Interleukin 3 (colony-stimulating factor, multiple)), EGF (epidermal growth factor (-urogastrone)), TNF- (Tumour Necrosis Factor ) and G-CSF (colony stimulating factor 3 (granulocyte)) on a panel of 120 proteins used as reference set. Both Figures 1A–B shows that the stacked values of the Z-Scores of this panel of five proteins are lower in those patients that will develop clinical symptoms of AD in two to five years. The figures have samples as ordered in the original publication by Ray et al. [1]. In Figure 1A the leftmost 44 values correspond to those samples labelled ‘AD’, and in Figure 1B the leftmost 42 are labelled in the same way. The samples marked in red developed AD while the ones marked in blue are ‘Non-AD’ samples (Non-Demented Controls plus Other Dementias). Since a Z-Score transformation was performed on the dataset, the measured values of each protein are, in fact, relative to the variation of the other 119.

https://doi.org/10.1371/journal.pone.0017481.g001

Three facts are worth mentioning from the previous works by Gómez Ravetti and Moscato [3] and Ray et al. [1]: first, the majority of the classifiers performed better using the 5-protein signature than the 18-protein signature. Second, both in Ray et al.'s [1] and Gómez Ravetti and Moscato's [3] studies, some classifiers disagreed with the clinical diagnosis labels on the same samples of the datasets. Third, as Figure 1 clearly illustrates, the average of the Z-scores of their 5-protein biomarker is already a simple, yet powerful, discriminator between the two groups. However, all 5 proteins have, on average, a smaller Z-score in AD samples than in NDC. Since the measured protein abundances were standardised by a Z-score transformation, a positive value indicates the excess of a particular protein over the average value of 120 proteins. In essence, this means that the measures of each protein are, in fact, only relative to the variation of the other 119. Figure 2 shows that the average value of all proteins, excluding the 5 identified by Gómez Ravetti and Moscato's [3], also distinguishes well between the classes. Therefore, it is difficult to state whether the 5 proteins proposed by Gómez Ravetti and Moscato [3] or all other 115 are displaced with respect to the average.

thumbnail
Figure 2. Stacked values of the Z-Scores of the set of all proteins except for those proposed by Gómez Ravetti and Moscato [3].

Figures 2A–B present the stacked values of the Z-Scores of samples in the training and independent test sets, respectively. The samples marked in red samples developed AD the ones marked in blue are ‘Non-AD’ samples (Non-Demented Controls plus Other Dementias). Like in Figure 1, since a Z-Score transformation was performed on the dataset, the measured values of each protein are, in fact, relative to the variation of the other 119. Because of that, and the fact that both Figures 12 distinguish well between classes, it is difficult to state whether the 5 proteins of Gómez Ravetti and Moscato [3] or the other 115 are the ones that are displaced with respect to the average.

https://doi.org/10.1371/journal.pone.0017481.g002

Also, as both previous studies provide only aggregated results, this manuscript proposes a case-by-case analysis of the samples, with a methodology inspired by personalised medicine, using robust diagnostic methods. Although the overall performance of several classifiers is still reported in this work, the results under consideration are also systematically analysed for the samples individually.

Relative pair-wise protein variation of abundance levels are explored by expanding the original set of biomarkers with new “artificial” features, called “meta-features”, that model the relative protein imbalance. Using the meta-features, the relative proteins variations become explicit, providing useful information.

As illustrated by Figure 3, since the difference of values between two features might be interesting to distinguish between two classes, even when those features are not useful for that purpose alone, the working hypothesis is that the use of meta-features might reveal if there exists a characteristic signature of the imbalance of cell signalling processes for AD prediction. Such a characteristic imbalance could also be regarded as a new molecular signature for predicting AD, which might add new information to other early detection tests or inspire entirely new ones. Indeed, the analysis of Figure 4 suggests that there is useful information within the meta-features that can distinguish between AD and NDC samples as most of the AD samples cluster together and only a few AD samples remain in the control group. The 290-protein Z-score differences presented in Figure 4 are a signature obtained with ()--Feature Set methodological approach [3], [6][11], [14] on the set of meta-features only, and ordered using the Memetic Algorithm proposed by Moscato et al. [15]. The original 120 (single) features, which do not represent imbalance information, were not considered to guarantee that the discriminative information was indeed brought by imbalance information. These results motivate further study on the topic. The heatmap on the left of Figure 4 represents the samples from the training set, while the heatmap on the right represents the samples from the test set. The Non-AD (The test set samples include Other Dementia (OD) samples, which have not developed AD but are still demented controls) samples are marked in green, and the samples labelled AD are marked in blue.

thumbnail
Figure 3. The difference of values of two features might be interesting to distinguish between two classes, even when those features alone are not useful for that purpose.

In this example, the samples on the left hand side belong to Class A and the samples on the right hand side belong to Class B. The lines represent the Z-scored abundance levels of feature f1,f2 and the meta-feature f1-f2 for each sample. In this case, f1 and f2 are not effective at distinguishing between Class A and Class B, and would not pass the discretization algorithm's entropy filter [17]. However, if the difference between them is considered, we have a clear distinction, and the resulting meta-feature would be interesting and would pass the entropy filter. Roughly speaking, a feature is interesting to distinguish between two classes if it is possible to determine a pattern of up/down regulation of the samples' Z-scored abundance levels that characterizes each class uniquely (i.e.: in feature ‘f1-f2’, all the samples Z-scored abundance levels are down regulated for Class A and up regulated for Class B. Such a distinction cannot be made either with features ‘f1’ or ‘f2’ alone.).

https://doi.org/10.1371/journal.pone.0017481.g003

thumbnail
Figure 4. Heatmaps of the training (left) and test (right) sets, considering only the meta-features.

The ordering of both rows and columns was done using the Memetic Algorithm presented in [15]. The Non-AD samples are marked in green, and the samples labelled AD are marked in blue. The ordering shows that there seems to be a robust molecular genetic signature that can be obtained by pattern recognition algorithms that explore all possible protein abundances differences in this panel of 120 proteins as variables of interest, a mechanism that quantifies the imbalance of cell signalling in plasma. An annotated version of this heatmap is available in the supplementary material. Please refer to it for detailed information about the ordering, selected features and values. On the training set, the samples that did not cluster with their associated clinical diagnosis group are samples s47, s77, s54, s50, s66, s68, s3 and s1, respectively.

https://doi.org/10.1371/journal.pone.0017481.g004

Materials and Methods

Datasets

The modified datasets, used in the experiments, are based on those provided by the recent work of Ray et al. [1]. Quoting from their supplementary information: “Autoradiographic films were scanned and digitized spots were quantified with the Imagene 6.0 data extraction software (BioDiscovery Inc.). Local background intensities were subtracted from each spot, and the average of the duplicate spots for each protein was normalized to the average of six positive controls on each membrane. For statistical analysis expression data from the two filters per sample were normalized to the median expression of all 120 proteins followed by Z score transformation (data file is available online).”

score transformation has the effect of transforming the original distribution to one in which the mean becomes zero and the standard deviation becomes one. A Z-score quantifies the original score in terms of the number of standard deviations that the score is from the mean of the distribution. In other words, this means that a positive value in the original dataset indicates the excess of a particular protein over the average value of 120 proteins. That is, each value is relative to the variation of the other 119.

Equation 1 calculates the Z-score of the abundance level of protein for a sample , where is the mean of the values of all features, for sample , and is the associated standard deviation.(1)

The original dataset consisted of a Training Set with 43 AD samples and 40 NDC samples, a Test Set with 42 AD samples and 39 NDC samples and another Test Set with 22 samples that had MCI and developed AD and 17 samples that had MCI but did not develop AD. The Other Dementia (OD) samples present in the original test sets are disconsidered for classification purposes, as they might have characteristics that could mask the pursued patterns. They are, however, still present on Figures 4 and 5. The OD samples found on the test set with NDC and AD samples have either Frontotemporal dementia or Corticobasal degeneration, while those found on the test set with samples that already had a MCI may have either a Lewy-Body dementia, Vascular dementia or Frontotemporal dementia.

thumbnail
Figure 5. Histogram of the number of classifiers, out of the 25 under consideration, that disagreed with the clinical diagnosis attributed to the samples, when performing on the training set using the proposed 5 feature and 5 meta-feature signature.

Samples with no such disagreement are omitted. Interestingly, many classifiers consistently disagree with the clinical diagnosis on the same samples, which hints for the questioning of their usefulness in the training set to distinguish between AD and NDC. One of the reasons for that is that the cell signalling could be altered by other medical conditions, such as other diseases and use of medication. Using an arbitrarily chosen threshold of 30% of the classifiers, or 8 classifiers or more (rounded up) disagreeing with the clinical diagnosis label of a sample, it is reasonable to suspect that samples s3, s7, s47, s66 and s77 are not suitable to be part of the training set. The signature used for this experiment includes the following features and meta-features: EGF, IL-1, IL-3, TNF-, G-CSF, “BLC-RANTES”, “MIP-1d-IL-11”, “TNF- - ANG-2”, “TNF- - FAS” and “IL-11-I-TAC”.

https://doi.org/10.1371/journal.pone.0017481.g005

The training and test sets considered in this paper are “enlarged” versions of the original. They include the original 120 features for each sample plus 7,140 “meta-features”, generated by applying the difference operator between each possible pair of features. Symmetric meta-features are not considered, as they are equivalent (e.g.: the information provided by a meta feature obtained by subtracting the value of two features F1-F2 is equivalent to the one given by F2-F1).

As depicted by Equation 2, the meta-features model imbalance information, as each value is the displacement of one score with respect to the other involved in the meta-feature. Moreover, as illustrated by Figure 3, such a displacement may reveal interesting information to distinguish between classes that would not be obvious through the analysis of the features alone.(2)

Methods

The proposed computational methodology includes four basic steps on the expanded datasets, in this order: (1) feature selection, (2) classification, (3) analysis and (4) filtering.

Feature selection is performed using the same methodology presented in [3], [6][11], [14], [16]: first, the dataset is pre-filtered and discretised using Fayyad and Irani's [17] entropy-based algorithm, which minimizes the class entropy and discards features according to the Minimum Description Length principle. The result is an instance of the ()-k-Feature Set problem [3], [6][11], [14]. In this combinatorial optimization problem, three parameters are necessary: , which determines the number of features that must explain the dichotomy between samples in different classes; , which determines how many features must explain the similarities between samples in the same class; and , which specifies the size of the desired signature.

In this work, the five features of Gómez Ravetti and Moscato [3] are forced into the signature and is set to 10, aiming to obtain a signature of about the same size as Ray et al.'s [1], while doubling Gómez Ravetti and Moscato's [3] and allowing the same number of new features or meta-features to be introduced. The rationale behind this is to guide the search towards a small signature with features and meta-features that are related to the ones that are already known to be effective in distinguishing between NDC and samples with AD, therefore helping to guide the search towards an even more effective signature. The parameter is chosen to be the maximum possible such that the combinatorial optimization problem admits at least one feasible (i.e.: it is possible to explain the differences between the samples in different classes with at least features, and the similarities between those in the same class with at least features, for all pairs of samples) solution, assuming a fixed signature size (a defined ) and not considering any restrictions imposed by a value (. One way to determine this is to count the number of features that differ from each other on pairs of samples with different clinical diagnosis labels. The value of would therefore be the smallest of these counts. The value of is chosen in a similar way, considering the features that do not differ from each other on samples with the same clinical diagnosis label, such that the combinatorial optimization problem also admits at least one feasible solution and this choice does not force a change on the value of or . The best features that explain the dichotomy between classes are chosen such that they not only satisfy the and values, but also explain the differences and similarities of a greater number of pairs of samples. The determination of and selection of the best features that satisfy and are done by solving the associated Integer Program (IP) using the ILOG CPLEX optimization package version 11.2. See [3], [6][11], [14], [16] for details on the IP formulations and other previous applications.

Next, classification is performed with 25 different classifiers of different types. Table 1 lists all the classifiers under consideration and their types. They are the same 24 classifiers that Gómez Ravetti and Moscato [3] considered plus the Bagging algorithm, which is considered one of the best classifiers [18] available in the Weka package [13]. Each classifier is run on the selected panel of features and meta-features, using the continuous (non-discretized) values, and all default parameters of Weka as of version 3.6.1 (The only exception is IBk, in which the parameter k is set to 2, to distinguish it from IB1. PAM's [19] threshold is also set to zero, to avoid the shrinkage process, which is also a feature selection procedure that we wish to avoid, as a more sophisticated method is already previously applied.) No fine tuning is done. Since an independent test set is already provided and the average performance of all classifiers is already considered, no cross-validation is performed.

The case-by-case analysis is done by plotting the histogram of the number of classifiers that disagree with the clinical diagnosis label given to each sample. With the objective of providing the classifiers only with training examples that characterise their class well, which would provide them with better hints for pattern searching, the samples for which more than 30% of the classifiers do not agree with the clinical diagnosis are removed from the training set. The process is then repeated with this (new) reduced training set until no more than than 30% of the classifiers disagree with the clinical diagnosis of all samples, the reduced training set yields the same signature as in the previous iteration, or the number of available training samples gets too low. The methodology of reducing the size of the training set by excluding samples, both in semi-supervised or unsupervised settings, is called “data pruning” and has been previously used to avoid overfitting and improve generalisation [20], [21].

Results

The first fact worth mentioning is that, considering the expanded dataset, 118 out of the 120 proteins passed the entropy filter as part of a meta-feature, while only 12 of them pass when considered alone in the original dataset. More interestingly, 91 out of these 118 features passed the entropy filter only in metafeatures that do not include any of the 12 features that passed the entropy filter alone. In other words, these features are only interesting to distinguish between cases and controls when the imbalance information is considered, and their importance is not dominated by any of the features that are already known to be interesting.

Tables 2, 3, 4 compare the average results of three signatures obtained in this work and the two signatures previously identified known ones by Gómez Ravetti and Moscato [3] and Ray et al. [1] when performing on the Training Set, Test Set with NDC and samples with AD (referred simply as Test Set), and Test Set with MCI samples that developed and did not develop AD (Test Set MCI), respectively. The values shown are the average results of the 25 classifiers under consideration.

thumbnail
Table 2. Average results of classification performed on the Training Set.

https://doi.org/10.1371/journal.pone.0017481.t002

thumbnail
Table 3. Average results of classification performed on the Test Set with NDC and samples that developed AD.

https://doi.org/10.1371/journal.pone.0017481.t003

thumbnail
Table 4. Average results of classification performed on the Test Set with samples that already had a MCI that developed AD or not.

https://doi.org/10.1371/journal.pone.0017481.t004

First Iteration of the Method

The first signature, obtained in the first iteration of the method depicted in the previous section, included the following features and meta-features: EGF, IL-1, IL-3, TNF-, G-CSF, “BLC (chemokine (C-X-C motif) ligand 13) –RANTES (chemokine (C-C motif) ligand 5)”, “MIP-1d (chemokine (C-C motif) ligand 15) – IL-11 (interleukin 11)”, “TNF-–ANG-2 (angiopoietin-2)”, “TNF-–FAS (Tumor Necrosis Factor receptor superfamily, member 6)” and “IL-11–I-TAC (chemokine (C-X-C motif) ligand 11)”. Referred as “S1”, it improved Ray et al.'s [1] classification accuracy of 93% to 94.4%. Gómez Ravetti and Moscato's [3] signature still holds the best results when performing on the test set with NDC and samples that developed AD, with 92.3% of accuracy in average. Their results still outperform all others both in sensitivity (94.8%) and specificity (90.2%).

When performing on the test set with MCI samples that developed AD and MCI samples that did not, this signature improved Ray et al.'s [1] accuracy of 66.2% to 70.4% mainly due to an improvement of 6.5% in sensitivity, which reached 82%. Although an improvement in specificity was not expected, since there are no MCI samples in the training set, it was also raised to 60.2%.

Figure 6 shows the number of classifiers that disagree with the clinical diagnosis label for each sample, when performing against the training set. It is interesting to notice that there is a set of samples that several classifiers, of a wide-range of different types, consistently disagree with the clinical diagnosis label attributed to a sample. Therefore, it is reasonable to think that these samples might either be mislabelled, that the clinical diagnosis is inadequate, or other latent clinical factors, such as the presence of other existing patient conditions (diseases, medication, or other factors) affected the cell signalling proteins present in this signature. This is consistent with Ray et al.'s [1] remark that there might be mislabelled samples in the dataset, due to the fact that the patients were still alive and an accurate diagnosis could only be issued with the post-mortem analysis of the brain cortex.

thumbnail
Figure 6. Histogram of the number of classifiers, out of the 25 under consideration, that disagreed with the clinical diagnosis attributed to the samples, when performing on the training set using the proposed 4 feature and 6 meta-feature signature.

The performance of the signature obtained after the removal of samples s4, s7, s47, s66 and s77, against the training set, improved significantly. More than 30% of the classifiers disagreed with the clinical diagnosis of only one sample s15. However, since only 8 classifiers got it wrong, which was the threshold limit rounded up, and no classifier disagreed with the clinical diagnosis of this sample on the previous iteration, it was not considered problematic and the training set was not reduced even more. The signature used for this experiment includes the following features and meta-features: EGF, IL-1, TNF-, G-CSF, “EGF-IGFBP-2”, “GM-CSF-IL-1”, “IL-1-IL-11”, “MIP-1d-NT-3”, “PDGF-BB-VEGF-B” and “TNF--ANG-2”.

https://doi.org/10.1371/journal.pone.0017481.g006

Second Iteration of the Method

Following the hypothesis that some of the training samples might not have the correct label or might not characterise their target classes well, a new training set was generated by disconsidering samples for which more than 30% of the classifiers disagreed about the label: s3, s7, s47, s66 and s77. This threshold was determined through the visual analysis of Figure 6, aiming to cut off the highest histogram peaks while observing the samples that did not cluster to their clinical diagnosis label group also in Figure 4 (samples s47, s77, s54, s50, s66, s68, s3 and s1), and trying not to discard too many samples. It is interesting to notice that amongst the samples that did not cluster well on Figure 4 only sample s54 did not appear on Figure 6.

Using this new training set, a new signature was obtained on a second iteration of the proposed method. The new signature, referred as “S2” included the following features and meta-features: EGF, IL-1, TNF-, G-CSF, “EGF–IGFBP-2 (insulin-like growth factor binding protein 2, 36 kDa)”, “GM-CSF (colony stimulating factor 2 (granulocyte-macrophage)) –IL-1”, “IL-1–IL-11”, “MIP-1d–NT-3 (neurotrophin 3)”, “PDGF-BB (platelet-derived growth factor beta polypeptide (simian sarcoma viral (v-sis) oncogene homologue)) –VEGF-B (vascular endothelial growth factor B)” and “TNF-–ANG-2”. It is worth noticing that, after filtering the training set, the obtained signature no longer included IL-3 because the entropy filter discarded it. That could be because most of the information provided by IL-3 that distinguished cases and controls was in the excluded samples. Even though some of the meta-features were also replaced in this signature, that was not because they did not pass the entropy filter, but because the feature selection method chose to select different ones.

The performance of this second signature on the training set still outperforms the other signatures, reaching an average of 98% of learning accuracy, against Ray et al.'s [1] 93%. This second signature matches Ray et al.'s [1] accuracy of 90% against the Test Set with AD and NDC samples, but is still outperformed by Gómez Ravetti and Moscato's [3] accuracy of 92.3%.

The most remarkable characteristic of this new signature, however, is not the improvement in total accuracy, but in sensitivity, when the Test Set with samples that already had a MCI is used as benchmark.

Even though a loss in specificity is observed, a good performance for this item is not expected, as no training samples had MCI. Figure 7 shows that, using the second signature, more than 30% of the classifiers disagree about the clinical diagnosis label of only 1 sample (s15). Therefore a third iteration takes place with a new training set that disconsiders this sample. However, the obtained signature is the same as that obtained in the previous iteration, which interrupts the procedure.

thumbnail
Figure 7. Stacked values of the differences of Z-Scores of only the meta-features of the 4 feature and 6 meta-feature signature.

The signature used in this experiment includes the following meta-features: “EGF-IGFBP-2”, “IL-1-GM-CSF”, “IL-1-IL-11”, “MIP-1d-NT-3”, “PDGF-BB-VEGF-B” and “TNF--ANG-2”. Figures 7A–C present the stacked values of of the differences of Z-Scores in the training, independent test set of samples with AD and NDC samples, and the independent test set with MCI samples that developed and did not develop AD, respectively. The figures show that the stacked values of this panel of 6 meta-features are lower in those patients that will develop clinical symptoms of AD in two to five years. In Figures 7A–C the samples marked in red are those that developed AD, the samples marked in blue are those that did not develop AD (NDC Figures 7A–B, and samples that had a MCI and did not develop AD on the Figure 7C) and the ones marked in green are OD. Since these values correspond to a difference of Z-Scored values, the average is common and cancelled, thus the measured values of each protein are no longer relative to the variation of the other 119, but only of the other protein involved in the meta-feature. It is interesting to notice that this signature, composed only of meta-features, distinguishes well between AD and NDC on both training (see Figure 7A) and test with AD and NDC samples (see Figure 7B) sets and also on the test set with MCI samples that developed and did not develop AD (see Figure 7C). As shown on Figures 7B–C, it is also remarkable that the signature also distinguishes fairly well between samples that developed AD and OD.

https://doi.org/10.1371/journal.pone.0017481.g007

A Step Further

Finally, a signature consisted of the six meta-features of the previous signature is taken under consideration to evaluate their contribution on the observed performance. The meta feature “GM-CSF–IL-1” was replaced by its equivalent meta feature “IL-1–GM-CSF”. They are equivalent from the feature selection point of view because they have the same absolute value. In other words, the Feature Set feature selection approach could select either meta-feature and discard the other, as it would be redundant. Therefore only one is present on the dataset, as this halves the size of the associated problem. This modification is proposed only to help with the visualisation on Figure 5.

Figures 5A–C present the stacked values of of the differences of Z-Scores in the training, independent test set of samples with AD and NDC samples, and the independent test set with MCI samples that developed and did not develop AD, respectively. The figures show that the stacked values of this panel of 6 meta-features are lower in those patients that will develop clinical symptoms of AD in two to five years.

It is interesting to notice that this signature, composed only of meta-features, distinguishes well between AD and NDC on both training (see Figure 5A) and test with AD and NDC samples (see Figure 5B) sets and also on the test set with MCI samples that developed and did not develop AD (see Figure 5C). As shown on Figures 5B–C, it is also remarkable that the signature also distinguishes well between samples that developed AD and OD.

The fact that the average results of this set of tables differed from those of the previous signature by less than 1% on the training set, 2% on the Test Set and even yielded a better sensitivity on the Test Set MCI suggests that these particular single features were not playing a key role to distinguish between AD and NDC in this signature, and supports the theory that there is useful information within the meta-features to distinguish between the classes. Also, since the results for Test Set MCI also did not change significantly, it is reasonable to say that the sample pruning and usage of metafeatures introduced a good generalization in the signature.

Discussion

ANG-2 (ANGPT2, Angiopoietin 2) is a regulator of angiogenesis. Ahmed et al. [22] have recently shown that apoE(−/−) mice that were fed a Western diet had a significant reduction of atherosclerotic lesion size and oxidized LDL and macrophage content of the plaques after a single systemic administration of ANG-2 adenovirus [22]. Thirumangalakudi et al. observed that ANG-2 levels in microvessels were increased in AD patients but not in age-matched controls [23].

Neurons and neuronal nuclei in hippocampus have been reported to express RANTES (CCL5, chemokine (C-C motif) ligand 5) which could induce an inflammatory cell infiltration in AD [24] (see also [25]). RANTES has also been observed as upregulated in the cerebral microcirculation of AD patients (in another study by members of the same team [26]), as well as by other groups of researchers [25], [27]. Even though this biomarker does not appear in any of the selected meta-features of the 10 feature signature, it appeared quite intensely in the 290 feature signature of Figure 4.

FAS/CD95, the Tumor Necrosis Factor Receptor Super Family 6 gene (TNFRSF6), has also appeared in our signature. Increased levels in cerebrospinal fluid in AD patients have been reported in [28]. The upregulation has been motivating several disease mechanistic explanations [29][37]. Several researchers have then tried to find polymorphisms that may have been correlated with AD and that avenue of research has not been highly prosperous [38][42], but there exist some studies with relatively positive results [34], [41], [42].

Serum levels of BLC have been reported as being elevated in multiple sclerosis [43]. Weiss et al. have shown that neural precursors cells express a receptor for BLC [44]. Upregulation of BLC was observed in scrapie-infected brain tissue in [45]. Baker, Martin and Manuelidis also reported in 2002 that microglia of Creutzfeldt-Jakob disease-infected brains characteristically present an upregulation of BLC [46]. In contrast, the selection of a metafeature involving RANTES and BLC indicates that the difference of z-scores of RANTES and BLC are differentially observed in AD and NDC participants of this study. As RANTES upregulation in AD has been put forward as a mechanism for neuroprotection [26] the concurrent lack of upregulation of BLC may point to a protective response that is not properly functioning in early AD worth investigating.

NT-3 (neurotrophin 3, Nerve growth factor 2) [47][50] also appears in a meta feature with MIP-1delta (CCL15, chemokine (C-C motif) ligand 15). This is a novel biomarker that may interest several AD researchers as the selective targeting of several neurotrophin receptors has been proposed as a viable mechanism of intervention for neuroprotection [51][56] (with much of the attention being on the p75(NTR), the common neurotrophin receptor [53], [57][66]). Hippocampal upregulation of NT-3 has been observed in mouse models of AD [67]. The ratio of NGF/NT-3 (NGF is the Nerve growth factor) was observed to be significantly upregulated in AD (in a comparison with control samples) in hippocampus and frontal cortex [68]. Lesne et al. propose that NT-3 reduces Abeta-induced apoptosis by limiting the cleavage of caspase-3, caspase-8, and caspase-9 [69].

The joint identification of PDGFB/PDGF-BB (platelet-derived growth factor beta polypeptide (simian sarcoma viral (v-sis) oncogene homologue), a member of the neurotrophic factor family [70] and VEGFB/VEGF-B (vascular endothelial growth factor B) is intriguing. At the time of the publication of Ray et al.'s [1] manuscript, on which database our work is based, VEGF-B was generally recognized as an angiogenic factor, although of low activity. Almost a year later, Poesen et al. proposed that the 60 kDa VEGF-B isoform is a neuroprotective factor [71] and Falk et al. later shown that exogenous VEGF-B is neuroprotective in a culture model of Parkinson's disease [72]. New roles for VEGF-B are being discovered, like those on lipid uptake, more specifically, on controlled endothelial uptake of fatty acids [73].

A link with CSF2/GM-CSF (colony stimulating factor 2 (granulocyte-macrophage)) with AD is more well-established [36], [74][78]. A similar remark could be applied to IGFBP-2 [79], [80].

It is also interesting to note that TNF-alpha was present in all signatures. A very recent study by O'Bryant et al. with serum protein based multiplex biomarker data from 197 patients diagnosed with AD and 203 controls showed a 0.74 fold change in AD patients [81]. They have also observed a 0.7 fold change on G-CSF (colony stimulating factor 3 (granulocyte)). These results may somewhat indicate that two of the proteins in Gómez Ravetti and Moscato's 5-protein signature [3], and the signature obtained on the first iteration of our method may indeed change in both studies.

The work of [82] showed that there are indications that plasma levels of EGF are linked with cognitive decline in Parkinsons disease, indicating it may not be entirely AD-specific as single biomarker.

Conclusions

In this paper we modelled the relative protein imbalance using “artificial” features, called “metafeatures”. Selecting features and metafeatures using the Feature Set Problem approach it was possible to show that a specific pattern of cell signalling imbalance in blood plasma provided valuable information for distinguishing between NDC and AD patients. Moreover, the obtained signatures were able to predict AD in patients that already had MCI with up to 84% sensitivity, while also maintaining a strong prediction accuracy of 90% on a independent dataset with NDC and AD samples.

Using a data-pruning strategy, we found good evidence that, as already remarked by Ray et al. [1], the dataset indeed had “suspicious” training samples, that could have the wrong diagnosis label or did not characterise their classes well due to other clinical factors. We believe that their removal could have introduced better generalisation to the obtained signatures. That also supports the theory that, even though our reported accuracy for predicting AD and NDC is lower than the best reported [3], it does not necessarilly mean that the signature does not perform well, as there might also be test samples with the wrong clinical diagnosis or that also do not characterise their classes well due to other clinical factors.

The novel biomarkers uncovered with the proposed method now confirms ANG-2, IL-11, PDGF-BB, CCL15/MIP-1; and supports the joint measurement of other signalling proteins in plasma not previously discussed: GM-CSF, NT-3, IGFBP-2 and VEGF-B.

Acknowledgments

The authors would like to thank L. Milward, D. Johnstone and J. Marsden for their comments on a preliminary version of this paper.

Author Contributions

Conceived and designed the experiments: MRP PM. Performed the experiments: MRP. Analyzed the data: MRP MGR RB PM. Contributed reagents/materials/analysis tools: RB PM. Wrote the paper: MRP.

References

  1. 1. Ray S, Britschgi M, Herbert C, Takeda-Uchimura Y, Boxer A, et al. (2007) Classification and prediction of clinical Alzheimer's diagnosis based on plasma signaling proteins. Nat Med 13: 1359–1362.
  2. 2. Soares HD, Chen Y, Sabbagh M, Rohrer A, Schrijvers E, et al. (2009) Identifying Early Markers of Alzheimer's Disease using Quantitative Multiplex Proteomic Immunoassay Panels. Annals of the New York Academy of Sciences 1180: 56–67.
  3. 3. Gómez Ravetti M, Moscato P (2008) Identification of a 5-protein biomarker molecular signature for predicting Alzheimer's disease. PLoS One 3: e3111.
  4. 4. Berretta R, Costa W, Moscato P (2008) Combinatorial optimization models for finding genetic signatures from gene expression datasets. Methods Mol Biol 453: 363–377.
  5. 5. Hourani M, Berretta R, Mendes A, Moscato P (2008) Genetic Signatures for a Rodent Model of Parkinson's Disease Using Combinatorial Optimization Methods. pp. 379–392.
  6. 6. Walker JM, Mendes A, Scott RJ, Moscato P (2008) Microarrays–identifying molecular portraits for prostate tumors with different gleason patterns. In: Trent RJ, editor. pp. 131–151. Clinical Bioinformatics, Humana Press, volume 141 of Methods in Molecular Medicine.
  7. 7. Rosso OA, Mendes A, Berretta R, Rostas JA, Hunter M, et al. (2009) Distinguishing childhood absence epilepsy patients from controls by the analysis of their background brain electrical activity (II): a combinatorial optimization approach for electrode selection. J Neurosci Methods 181: 257–267.
  8. 8. Gómez Ravetti M, Berretta R, Moscato P (2009) Novel Biomarkers for Prostate Cancer Revealed by (α,β)-k-Feature Sets. pp. 149–175.
  9. 9. Cotta C, Moscato P (2003) The k-Feature Set Problem is W[2]-complete. Journal of Computer and System Sciences 67: 686–690.
  10. 10. Cotta C, Sloper C, Moscato P (2004) Evolutionary Search of Thresholds for Robust Feature Set Selection: Application to the Analysis of Microarray Data. In: Raidl GR, Cagnoni S, Branke J, Corne DW, Drechsler R, et al., editors. pp. 21–30. Applications of Evolutionary Computing, Springer Berlin/Heidelberg, Lecture Notes in Computer Science.
  11. 11. Berretta R, Mendes A, Moscato P (2007) Selection of Discriminative Genes in Microarray Experiments Using Mathematical Programming. Journal of Research and Practice in Information Technology 39: 287–299.
  12. 12. Cotta C, Langston MA, Moscato P (2007) Combinatorial and Algorithmic Issues for Microarray Analysis. pp. 74.1–74.14.
  13. 13. Witten IH, Frank E (2005) Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, 2nd edition.
  14. 14. Moscato P, Mathieson L, Mendes A, Berretta R (2005) The electronic primaries: predicting the U.S. presidency using feature selection with safe data reduction. ACSC '05: Proceedings of the Twenty-eighth Australasian conference on Computer Science. Darlinghurst, Australia, Australia: Australian Computer Society, Inc. pp. 371–379.
  15. 15. Moscato P, Mendes A, Berretta R (2007) Benchmarking a memetic algorithm for ordering microarray data. Biosystems 88: 56–75.
  16. 16. Gómez Ravetti M, Rosso OA, Berretta R, Moscato P (2010) Uncovering Molecular Biomarkers That Correlate Cognitive Decline with the Changes of Hippocampus' Gene Expression Profiles in Alzheimer's Disease. PloS one 5: e10153.
  17. 17. Fayyad UM, Irani KB (1993) Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning. 13th International Joint Conference on Artificial Intelligence. Morgan Kaufmann.
  18. 18. Bauer E, Kohavi R (1999) An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants. Mach Learn 36: 105–139.
  19. 19. Tibshirani R, Hastie T, Narasimhan B, Chu G (2002) Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proceedings of the National Academy of Sciences of the United states of America 99: 6567–6572.
  20. 20. Vezhnevets A, Barinova O (2007) Avoiding boosting overfitting by removing confusing samples. Machine Learning: ECML 2007 4701/2007 430–441.
  21. 21. Angelova A, Abu-Mostafa Y, Perona P (2005) Pruning training sets for learning of object categories. pp. 494–501.
  22. 22. Ahmed A, Fujisawa T, Niu XL, Ahmad S, Al-Ani B, et al. (2009) Angiopoietin-2 confers atheroprotection in apoE−/−mice by inhibiting LDL oxidation via nitric oxide. Circulation research 1333–1336.
  23. 23. Thirumangalakudi L, Samany PG, Owoso A, Wiskar B, Grammas P (2006) Angiogenic proteins are expressed by brain blood vessels in Alzheimer's disease. Journal of Alzheimer's Disease 10: 111–118.
  24. 24. Fialaa M, Lina J, Ringmanb J, Kermani-Arabc V, Tsaoa G, et al. (2005) Ineffective phagocytosis of amyloid-{ß} by macrophages of Alzheimer's disease patients. Journal of Alzheimer's Disease 7: 221–232.
  25. 25. Iarlori C, Gambi D, Gambi F, Lucci I, Feliciani C, et al. (2005) Expression and production of two selected beta-chemokines in peripheral blood mononuclear cells from patients with Alzheimer's disease. Experimental gerontology 40: 605–611.
  26. 26. Tripathy D, Thirumangalakudi L, Grammas P (2010) RANTES upregulation in the Alzheimer's disease brain: A possible neuroprotective role. Neurobiology of Aging 31: 8–16.
  27. 27. Reale M, Iarlori C, Feliciani C, Gambi D (2008) Peripheral chemokine receptors, their ligands, cytokines and Alzheimer's disease. Journal of Alzheimer's Disease 14: 147–159.
  28. 28. Richartz E, Noda S, Schott K, Günthner A, Lewczuk P, et al. (2000) Increased serum levels of CD95 in Alzheimer's disease. Dementia and geriatric cognitive disorders 13: 178–182.
  29. 29. Frey C, Bonert A, Kratzsch T, Rexroth G, Rösch W, et al. (2006) Apolipoprotein E epsilon 4 is associated with an increased vulnerability to cell death in Alzheimer's disease. Journal of Neural Transmission 113: 1753–1761.
  30. 30. Lombardi VR, Fernandez-Novoa L, Etcheverria I, Seoane S, Cacabelos R (2004) Association between APOE epsilon4 allele and increased expression of CD95 on T cells from patients with Alzheimer's disease. Methods Find Exp Clin Pharmacol 26: 523–529.
  31. 31. Yew DT, Li P, Others (2004) Fas and activated caspase 8 in normal, Alzheimer and multiple infarct brains. Neuroscience letters 367: 113–117.
  32. 32. Su JH, Anderson AJ, Cribbs DH, Tu C, Tong L, et al. (2003) Fas and Fas Ligand are associated with neuritic degeneration in the AD brain and participate in [beta]-amyloid-induced neuronal death. Neurobiology of disease 12: 182–193.
  33. 33. de la Monte SM, Chiche JD, von dem Bussche A, Sanyal S, Lahousse SA, et al. (2003) Nitric oxide synthase-3 overexpression causes apoptosis and impairs neuronal mitochondrial function: relevance to Alzheimer's-type neurodegeneration. Laboratory investigation 83: 287–298.
  34. 34. Feuk L, Prince J, Breen G, Emahazion T, Carothers A, et al. (2000) Apolipoprotein-E dependent role for the FAS receptor in early onset Alzheimer's disease: finding of a positive association for a polymorphism in the TNFRSF6 gene. Human genetics 107: 391–396.
  35. 35. Nishimura T, Akiyama H, Yonehara S, Kondo H, Ikeda K, et al. (1995) Fas antigen expression in brains of patients with Alzheimer-type dementia. Brain research 695: 137–145.
  36. 36. Tarkowski E, Blennow K, Wallin A, Tarkowski A (1999) Intracerebral production of tumor necrosis factor-alpha, a local neuroprotective agent. Alzheimer disease and vascular dementia J Clin Immunol 19: 223–230.
  37. 37. de La Monte SM, Wands JR (2001) Alzheimer-associated neuronal thread protein-induced apoptosis and impaired mitochondrial function in human central nervous system-derived neuronal cells. Journal of Neuropathology & Experimental Neurology 60: 195–207.
  38. 38. Andreoli V, Nicoletti G, Romeo N, Condino F, La Russa A, et al. (2007) Fas antigen and sporadic Alzheimer's disease in Southern Italy: evaluation of two polymorphisms in the TNFRSF6 gene. Neurochemical Research 32: 1445–1449.
  39. 39. He X, Zhang Z, Zhang J, Zhou Y, Tang M, et al. (2006) The Fas gene A-670G polymorphism is not associated with sporadic Alzheimer disease in a Chinese Han population. Brain research 1082: 192–195.
  40. 40. Rosenmann H, Meiner Z, Kahana E, Aladjem Z, Friedman G, et al. (2000) The Fas Promoter Polymorphism at Position–670 Is Not Associated with Late-Onset Sporadic Alzheimer's Disease. Dementia and geriatric cognitive disorders 17: 143–146.
  41. 41. Feuk L, Prince JA, Blennow K, Brookes AJ (2002) Further evidence for role of a promoter variant in the TNFRSF6 gene in Alzheimer disease. Human mutation 21: 53–60.
  42. 42. Chiappelli M, Nasi M, Cossarizza A, Porcellini E, Tumini E, et al. (2006) Polymorphisms of Fas Gene: Relationship with Alzheimer's Disease and Cognitive Decline. Dementia and geriatric cognitive disorders 22: 296–300.
  43. 43. Festa E, Hankiewicz K, Kim S, Skurnick J, Wolansky L, et al. (2009) Serum levels of CXCL13 are elevated in active multiple sclerosis. Multiple Sclerosis 15: 1271–1279.
  44. 44. Weiss N, Deboux C, Chaverot N, Miller F, Baron-Van Evercooren A, et al. (2010) IL8 and CXCL13 are potent chemokines for the recruitment of human neural precursor cells across brain endothelial cells. Journal of Neuroimmunology 223: 131–134.
  45. 45. Riemer C, Queck I, Simon D, Kurth R, Baier M (2000) Identification of upregulated genes in scrapie-infected brain tissue. Journal of Virology 74: 10245–10248.
  46. 46. Baker C, Martin D, Manuelidis L (2002) Microglia from Creutzfeldt-Jakob disease-infected brains are infectious and show specific mRNA activation profiles. Journal of virology 76: 10905–10913.
  47. 47. Arenas E, Persson H (1994) Neurotrophin-3 prevents the death of adult central noradrenergic neurons in vivo. Nature 368–371.
  48. 48. Hock C, Heese K, Müller-Spahn F, Huber P, Riesen W, et al. (2000) Increased cerebrospinal fluid levels of neurotrophin 3 (NT-3) in elderly patients with major depression. Molecular psychiatry 5: 510–513.
  49. 49. Durany N, Michel T, Kurt J, Cruz-Sánchez FF, Cervós-Navarro J, et al. (2000) Brain-derived neurotrophic factor and neurotrophin-3 levels in Alzheimer's disease brains. International Journal of Developmental Neuroscience 18: 807–813.
  50. 50. Narisawa-Saito M, Wakabayashi K, Tsuji S, Takahashi H, Nawa H (1996) Regional specificity of alterations in NGF, BDNF and NT-3 levels in Alzheimer's disease. Neuroreport 7: 2925–2928.
  51. 51. Saragovi HU, Hamel E, Di Polo A (2009) A Neurotrophic Rationale for the Therapy of Neurodegenerative Disorders. Current Alzheimer Research 6: 419–423.
  52. 52. Poon WW, Blurton-Jones M, Tu CH, Feinberg LM, Chabrier MA, et al. (2009) [beta]-amyloid impairs axonal bdnf retrograde trafficking. Neurobiology of Aging In Press, Corrected Proof: -.
  53. 53. Knowles JK, Rajadas J, Nguyen TVV, Yang T, LeMieux MC, et al. (2009) The p75 Neurotrophin Receptor Promotes Amyloid-{beta}(1–42)-Induced Neuritic Dystrophy In Vitro and In Vivo. Journal of Neuroscience 29: 10627–10637.
  54. 54. Schulte-Herbrüggen O, Jockers-Scherübl MC, Hellweg R (2008) Neurotrophins: from pathophysiology to treatment in Alzheimer's disease. Current Alzheimer research 5: 38–44.
  55. 55. Longo FM, Yang T, Knowles JK, Xie Y, Moore LA, et al. (2007) Small Molecule Neurotrophin Receptor Ligands: Novel Strategies for Targeting Alzheimers Disease Mechanisms. Current Alzheimer Research 4: 503–506.
  56. 56. Cole GM, Frautschy SA (2007) The role of insulin and neurotrophic factor signaling in brain aging and Alzheimer's Disease. Experimental gerontology 42: 10–21.
  57. 57. Fombonne J, Rabizadeh S, Banwait S, Mehlen P, Bredesen DE (2009) Selective vulnerability in Alzheimer's disease: Amyloid precursor protein and p75NTR interaction. Annals of neurology 65: 294–303.
  58. 58. Masoudi R, Ioannou MS, Coughlin MD, Pagadala P, Neet KE, et al. (2009) Biological activity of nerve growth factor precursor is dependent upon relative levels of its receptors. Journal of Biological Chemistry 284: 18424.
  59. 59. Diarra A, Geetha T, Potter P, Babu JR (2009) Signaling of the neurotrophin receptor p75 in relation to Alzheimer's disease. Biochemical and biophysical research communications 390: 352–356.
  60. 60. Coulson EJ, May LM, Sykes AM, Hamlin AS (2009) The Role of the p75 Neurotrophin Receptor in Cholinergic Dysfunction in Alzheimer's Disease. The Neuroscientist 15: 317–323.
  61. 61. Bengoechea TG, Chen Z, O'Leary D, Masliah E, Lee KF (2009) p75 reduces β-amyloid-induced sympathetic innervation deficits in an Alzheimer's disease mouse model. Proceedings of the National Academy of Sciences 106: 7870–7875.
  62. 62. Sotthibundhu A, Sykes AM, Fox B, Underwood CK, Thangnipon W, et al. (2008) Beta-amyloid(1–42) induces neuronal death through the p75 neurotrophin receptor. J Neurosci 28: 3941–3946.
  63. 63. Coulson EJ (2006) Does the p75 neurotrophin receptor mediate Abeta-induced toxicity in Alzheimer's disease? J Neurochem 98: 654–660.
  64. 64. Chiarini A, Dal Pra I, Whitfield JF, Armato U (2006) The killing of neurons by beta-amyloid peptides, prions, and pro-inflammatory cytokines. Italian journal of anatomy and embryology 111: 221–246.
  65. 65. Mirnics ZK, Yan C, Portugal C, Kim TW, Saragovi HU, et al. (2005) P75 neurotrophin receptor regulates expression of neural cell adhesion molecule 1. Neurobiology of disease 20: 969–985.
  66. 66. Yaar M, Zhai S, Pilch PF, Doyle SM, Eisenhauer PB, et al. (1997) Binding of beta-amyloid to the p75 neurotrophin receptor induces apoptosis. A possible mechanism for Alzheimer's disease. Journal of Clinical Investigation 100: 2333–2340.
  67. 67. Wolf SA, Kronenberg G, Lehmann K, Blankenship A, Overall R, et al. (2006) Cognitive and physical activity differently modulate disease progression in the amyloid precursor protein (APP)-23 model of Alzheimer's disease. Biological psychiatry 60: 1314–1323.
  68. 68. Hock C, Heese K, Hulette C, Rosenberg C, Otten U (2000) Region-specific neurotrophin imbalances in Alzheimer disease: decreased levels of brain-derived neurotrophic factor and increased levels of nerve growth factor in hippocampus and cortical areas. Archives of neurology 57: 846–851.
  69. 69. Lesne S, Gabriel C, Nelson DA, White E, MacKenzie ET, et al. (2005) Akt-dependent expression of NAIP-1 protects neurons against amyloid-β toxicity. Journal of Biological Chemistry 280: 24941–24947.
  70. 70. Xiyang YB, Liu S, Liu J, Hao CG, Wang ZJ, et al. (2009) Roles of Platelet-Derived Growth Factor-B Expression in the Ventral Horn and Motor Cortex in the Spinal Cord–Hemisected Rhesus Monkey. Journal of neurotrauma 26: 275–287.
  71. 71. Poesen K, Lambrechts D, Van Damme P, Dhondt J, Bender F, et al. (2008) Novel role for vascular endothelial growth factor (VEGF) receptor-1 and its ligand VEGF-B in motor neuron degeneration. Journal of Neuroscience 28: 10451–10459.
  72. 72. Falk T, Zhang S, Sherman SJ (2009) Vascular endothelial growth factor B (VEGF-B) is up-regulated and exogenous VEGF-B is neuroprotective in a culture model of Parkinson's disease. Molecular Neurodegeneration 1: 49.
  73. 73. Hagberg CE, Falkevall A, Wang X, Larsson E, Huusko J, et al. (2010) Vascular endothelial growth factor B controls endothelial fatty acid uptake. Nature 917–921.
  74. 74. Sanchez-Ramos J, Song S, Sava V, Catlow B, Lin X, et al. (2009) Granulocyte colony stimulating factor decreases brain amyloid burden and reverses cognitive impairment in Alzheimer's mice. Neuroscience 163: 55–72.
  75. 75. Manczak M, Mao P, Nakamura K, Bebbington C, Park B, et al. (2009) Neutralization of granulocyte macrophage colony-stimulating factor decreases amyloid beta 1–42 and suppresses microglial activity in a transgenic mouse model of Alzheimer's disease. Human molecular genetics 18: 3876–3893.
  76. 76. Cao C, Arendash GW, Dickson A, Mamcarz MB, Lin X, et al. (2009) A [beta]-specific Th2 cells provide cognitive and pathological benefits to Alzheimer's mice without infiltrating the CNS. Neurobiology of disease 34: 63–70.
  77. 77. Danielyan L, Klein R, Hanson LR, Buadze M, Schwab M, et al. (2010) Protective Effects of Intranasal Losartan in the APP/PS1 Transgenic Mouse Model of Alzheimer Disease. Rejuvenation Research 541–546.
  78. 78. Murphy GM Jr, Yang L, Cordell B (1998) Macrophage colony-stimulating factor augments beta-amyloid-induced interleukin-1, interleukin-6, and nitric oxide production by microglial cells. J Biol Chem 273: 20967–20971.
  79. 79. Tham A, Nordberg A, Grissom FE, Carlsson-Skwirut C, Viitanen M, et al. (1993) Insulin-like growth factors and insulin-like growth factor binding proteins in cerebrospinal fluid and serum of patients with dementia of the Alzheimer type. Journal of Neural Transmission: Parkinson's Disease and Dementia Section 5: 165–176.
  80. 80. Moloney AM, Griffin RJ, Timmons S, O'Connor R, Ravid R, et al. (2010) Defects in IGF-1 receptor, insulin receptor and IRS-1/2 in Alzheimer's disease indicate possible resistance to IGF-1 and insulin signalling. Neurobiol Aging 31: 224–243.
  81. 81. O'Bryant SE, Xiao G, Barber R, Reisch J, Doody R, et al. (2010) A Serum Protein-Based Algorithm for the Detection of Alzheimer Disease. Archives of Neurology 67: 1077–1081.
  82. 82. Chen-Plotkin A, Hu W, Siderowf A, Weintraub D, Goldmann Gross R, et al. (2009) Plasma epidermal growth factor levels predict cognitive decline in Parkinson disease. Annals of Neurology.