Whole exome sequencing provides unprecedented opportunities to identify causative DNA variants in rare Mendelian disorders. Finding the responsible mutation via traditional methods in families with hearing loss is difficult due to a high degree of genetic heterogeneity. In this study we combined autozygosity mapping and whole exome sequencing in a family with 3 affected children having nonsyndromic hearing loss born to consanguineous parents. Two novel missense homozygous variants, c.508C>A (p.H170N) in GIPC3 and c.1328C>T (p.T443M) in ZNF57, were identified in the same ~6 Mb autozygous region on chromosome 19 in affected members of the family. Both variants co-segregated with the phenotype and were absent in 335 ethnicity-matched controls. Biallelic GIPC3 mutations have recently been reported to cause autosomal recessive nonsyndromic sensorineural hearing loss. Thus we conclude that the hearing loss in the family described in this report is caused by a novel missense mutation in GIPC3. Identified variant in GIPC3 had a low read depth, which was initially filtered out during the analysis leaving ZNF57 as the only potential causative gene. This study highlights some of the challenges in the analyses of whole exome data in the bid to establish the true causative variant in Mendelian disease.
Citation: Sirmaci A, Edwards YJK, Akay H, Tekin M (2012) Challenges in Whole Exome Sequencing: An Example from Hereditary Deafness. PLoS ONE 7(2): e32000. doi:10.1371/journal.pone.0032000
Editor: Iris Schrijver, Stanford University School of Medicine, United States of America
Received: October 27, 2011; Accepted: January 17, 2012; Published: February 21, 2012
Copyright: © 2012 Sirmaci et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported by National Institutes of Health - National Institute on Deafness and Other Communication Disorders grant R01DC009645 to MT. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Hearing loss is one of the most common sensorial disorders in humans. Genetic factors account for more than 50% of cases with congenital or prelingual hearing loss, with autosomal recessive (77%), autosomal dominant (22%), and X-linked inheritance (1%) , . Identification of the responsible mutation in families with autosomal recessive nonsyndromic hearing loss is difficult since there are mutations in 40 different genes identified for this common form of deafness (Hereditary Hearing Loss Homepage_http://hereditaryhearingloss.org/). A two step approach combining linkage analysis and whole exome sequencing based on the next generation technologies have been applied previously in other studies for Mendelian disease ,  including deafness , . The first step uses the genome wide SNP genotyping to identify autozygous regions when parental consanguinity is present and narrows down the search space for possible loci. The second step examines exome sequences to identify genetic variation at base-pair resolution and survey the protein coding portion of the human genome . The integrated approach is faster and more cost efficient than the sequencing various candidate genes with the traditional Sanger sequencing techniques since the resulting loci generated from linkage are typically too large . Thus whole exome sequencing using the next generation technologies provides a new and transformational approach for identifying causative mutations in Mendelian disorders , –. Here, we apply this two step approach, face the challenges, and eventually uncover a novel mutation causing hereditary hearing loss in a family. This study provides some comprehensive insights which would be valuable in certain scenarios and will help minimize certain limitations in using the new whole exome sequencing technologies.
Materials and Methods
This study was approved by the Ankara University Medical School Ethics Committee (Turkey), and by the University of Miami Institutional Review Board (USA). All participants provided written informed consent prior to enrollment. Written informed consent was obtained from the next of kin on the behalf of the minors/children participants involved in this study. A family with three affected children who were diagnosed with sensorineural hearing loss via standard audiometry was recruited (Figures 1A and 1B). A thorough clinical evaluation including an ophthalmological exam and high resolution CT scans of the temporal bone in affected family members were normal. EKGs, liver and kidney function tests, serum electrolytes, urinalysis, CBC, and leukocyte subgroups were all within normal limits in affected subjects. DNA was extracted from peripheral leukocytes of each member of the family via a phenol chloroform method. Obtained samples were prescreened for mutations in GJB2 (MIM 121011) via Sanger sequencing of both exons and for the m.1555A>G mutation in MTRNR1 (MIM 561000). Heterozygous p.M163V amino acid change was found in GJB2 in all three affected and three unaffected siblings and in father but no other GJB2 mutations were present in the affected members.
Figure 1. Pedigree with haplotypes, audiograms and two identified variants.
(A) The pedigree and the third longest autozygous region on chromosome 19 that co-segregates with the phenotype. (B) Audiograms of affected members in the family. (C) Electropherograms showing the wild type, homozygous and heterozygous forms of the variant p.T443M in ZNF57. (D) Electropherograms showing the wild type, homozygous and heterozygous form of the variant p.H170N in GIPC3.doi:10.1371/journal.pone.0032000.g001
Genome-wide SNP genotyping was performed in six members of the family (III-1, III-2, IV-3, IV-5, IV-6, and IV-7) using Affymetrix 6.0 arrays. Genotypes were transferred into Excel files and sorted according to genomic positions along with all 40 previously identified autosomal recessive nonsyndromic deafness genes. The co-segregation of the flanking genotypes for each gene was visually evaluated.
In order to identify the responsible variant, genomic DNA of IV-7 was evaluated by whole exome sequencing. The Agilent Human SureSelect 50 MB kit was used to extract the target regions from genomic libraries for exome sequencing. The sample was multiplexed with two other samples in a single lane of an Illumina HiSeq 2000 flow cell allowing for the sequencing of 101 bp paired-end reads. Raw data were analyzed using v1.7 of the Illumina CASAVA pipeline to extract the reads.
The reads were aligned with the human genome reference sequence (hg19 build), using the Mapping and Assembly with Quality (MAQ) software v0.7.1 . Pairs of reads with identical outer coordinates were removed to improve the overall accuracy of variant calling. Variants (SNPs and indels) were called with MAQ. SNPs with a read coverage ≥8× and a Phred-like consensus quality of ≥20 were considered in the initial analysis. SNPs were annotated with SeattleSeq (http://gvs.gs.washington.edu/SeattleSeqAnnotation/) version 6.13 into functional categories such as missense, nonsense, splice sites, coding, non-coding, UTRs. DNA variants were filtered against dbSNP132  and phase 3 of the 1000 Genomes project. PolyPhen2 predictions were generated for non-synonymous SNPs . Indels with read coverage ≥8× covered by reads from both strands were predicted. Indels were annotated with SeattleSeq with respect to relative position of a gene (eg coding sequence, downstream or intronic) and their affects (eg frame-shift, amino-acid insertion or amino-acid deletion). After the initial analysis SNPs and indels were re-analyzed with read coverage ≥2× and ≥4× for comparison of the obtained variants.
The Agilent Human SureSelect 50 MB whole exome capture and subsequent sequencing was evaluated by calculating the fraction of the target covered and the average read depth of the target. The MAQ alignments formed the basis of the depth and coverage calculations (Figures 2 A and 2B). On-target and off-target coverage were computed to create wig files that are uploaded into the UCSC Genome Browser  for visual assessment of depth and coverage and further annotate regions of interest (Figures 2C and S1).
Figure 2. Coverage of autozygous regions with whole exome sequencing.
(A) The exome coverage of the five longest autozygous regions. The plot shows the fraction of on-target coverage (Y-axis) and the read depth (X-axis) for the following specified regions. (B) Average coverage at minimum 8× and GC content of five autozygous regions. (C) Coverage of exon 3 in GIPC3. Red arrow indicates mutation point.doi:10.1371/journal.pone.0032000.g002
Three-dimensional models were built using MODELER  from the Accelrys Discovery Studio package. BLAST sequence similarity searches  were used to identify suitable structural templates. This was done by using the query sequence and searching against the database of sequences from the Protein Data Bank (PDB)  using BLAST. The structural templates were selected by choosing the sequence of known structure with the highest percent identity with the query sequence. Software modules from the Accelrys Discovery Studio were used to inspect the molecules for quality control and to create illustrations.
Results of genome-wide SNP genotyping showed that none of the known deafness genes co-segregated with the phenotype. In the family, five autozygous segments longer than 2 Mb were present on chromosomes 8, 15, 16, 19 and 21 (Table 1). These five autozygous regions include 382 annotated genes.
Table 1. Five autozygous regions detected with Affymetrix 6.0 arrays in the family.doi:10.1371/journal.pone.0032000.t001
The exome sequencing experiment of one affected individual (IV-7) achieved the expected number of reads (87,586,240) and target coverage plus average read depth. Eighty seven million reads were generated which constitutes 8.6 gigabases of raw sequence. More than 95% of the reads mapped to the reference genome. Comparable with other labs , –, , , when measured at a minimum depth of 8×, 82% of the target region was covered with an average depth of 68× (Figure 2A). Likewise, when measured at 1× and 20× coverage, nearly 95% and 69% of the intended target was covered with an average depth of 68× and 66× respectively. In terms of variant calls, the MAQ predicted 99,374 SNPs and 5,420 indels.
The five autozygous regions from genome wide genotyping data on chromosomes 8, 15, 16, 19 and 21 were investigated using the results of the exome sequencing. For the five autozygous regions much variation on the read depth was observed. Using a minimum depth of 8 as a filter, the chromosome 8 region has high average read depth of 101× with 92% coverage compared to the chromosome 19 region which has low average read depth of 15× with 38% coverage (Figure 2A). There was a strong negative correlation (r = −0.96) between the percentage of gunanine cytosine (GC) bases and the read depth or coverage of autozygous regions (Figure 2B). We focused on exonic and flanking intronic variants within these five autozygous regions. Table 2 shows the four novel homozygous missense, nonsense, splice site and frame shift variants (not reported in dbSNP132) in the five autozygous regions when we used a filter of minimum 8× read depth. Sanger sequencing confirmed one novel missense variant in the second longest autozygous region on chr8 (76476256A>T) in HNF4G (MIM 605966) and one novel missense variant in the third longest autozygous region on chr19 (2917947C>T) in ZNF57 (no MIM number available) (nucelotide numbers are according to Hg19). We then recruited additional family members who were not typed with Affymetrix 6.0 chips to evaluate co-segregation of these variants. The novel variant c.1263A>T (p.Q421H) in HNF4G did not co-segregate with the phenotype in the entire family but variant c.1328C>T (p.T443M) in ZNF57 did (Figure 1A and C). For the variant in ZNF57 PolyPhen2 classification was possibly damaging with a score of 0.938, MutPred predicted that T443M amino acid substitution caused a gain of catalytic residue at V439 (p = 0.0472) and predicted the g score (probability of deleterious mutation) of 0.497 , . Four coding exons and intron-exon boundaries of ZNF57 were Sanger sequenced and no other nucleotide change was found. The indentified nucleotide change was not found in 335 Turkish controls via Sanger sequencing.
Table 2. Novel missense, nonsense, splice site, and frameshift variants in top five autozygous regions.doi:10.1371/journal.pone.0032000.t002
ZNF57 is a recently discovered human zinc finger gene which has not been implicated in hearing loss. The ZNF57 protein product comprises 555 amino-acids with a KRAB-A domain at the amino-terminus and 13 tandemly arranged C2H2 zinc fingers at the carboxyl-terminus. Over expression of ZNF57 was shown to inhibit the transcriptional activities of NFAT and p21 demonstrating that ZNF57 is likely to function as a negative transcriptional regulator in NFAT-p21 signaling pathway . The variant p.T443M is located in the linker between zinc fingers 10 and 11. The wild type linker in ZNF57 (has the sequence TQEQL) and the canonical zinc finger linker sequence is TGEKP. Both linkers comprise five residues and have a conserved threonine at the first position. Threonine at this position is highly conserved and attains a ConSeq conservation score of 8 in a scale of 1 to 9 (where 9 is most conserved) . This conserved threonine is changed to methionine in the variant form of ZNF57 p.T443M. The functional unit for the zinc finger protein ZNF57 is unknown. Whilst zinc fingers are known to bind DNA, zinc fingers also interact directly with proteins  and RNA  and many have more than one role and form both protein-DNA and protein-protein interactions –.
The full-length ZNF57 sequence was BLASTed against the sequences of the PDB. A designed zinc finger peptide with six zinc fingers known as Aart  had the highest percent sequence identity with the query. The query sequence ZNF57 (with only the four zinc fingers 9, 10, 11 and 12) was aligned with the structural template Aart (PDB Code: 2I13 – chain B) using the Accelrys global alignment program. The alignment comprises 144 topologically equivalent position sharing 47.2% sequence identity (ZNF57:362–505 and 2I13:152–295). Indels were absent in the alignment. Protein structural models for ZNF57 (the wild type and p.T443M) were built (Figure 3). Threonine 443 of the wild type is the last residue at the C-terminal end of the α-helix and likely contributes to the α-helix cap (Figure 3). The methionine side chain is longer, more flexible and is unbranched compared with threonine (Figure 3). Mutation of threonine to methionine is likely to affect DNA binding capability indirectly in several different ways , . Threonine is capable of being phosphorylated whereas methionine is not. Phosphorylation and dephosphorylation is implicated in the regulation of zinc finger protein binding and function and the conserved threonine in the linker region is a prime candidate for this type of regulation. Threonine is on the surface and accessible to possible phosphorylation events . Others suggest that the DNA-induced helix capping in the conserved linker sequence is a determinant of binding affinity in C2H2 zinc fingers . In evolution, threonine is one of the most frequently observed amino acids at this position in the zinc finger domain topology . In mutational studies of the linkers between two contiguous zinc fingers, mutation of threonine to alanine had deleterious effects on DNA binding . Similarly, mutating threonine to leucine in the linker was shown to reduce DNA binding .
Figure 3. Molecular modeling of p.T443M in ZNF57.
The zinc finger domain comprises two β-strands (blue) and one α-helix (red); the turns (green) and the loops (light gray) are shown. Amino-acid residue 443 is pink in the wild type (A) and yellow in the mutant (B).doi:10.1371/journal.pone.0032000.g003
While this work was ongoing a missense mutation in Gipc3 was reported to be associated with age-related sensorineural hearing loss in the mouse, and two missense variants in GIPC3 (MIM 608792) in two small families with sensorineural hearing loss . The autozygous region on chromosome 19 in our family also includes GIPC3, in which no novel variant had passed our filters. We then re-analyzed the exome sequencing data reducing the filter for read depth to ≥4× instead of ≥8×; two additional variants in autozygous regions were detected (Table 2) and only the variant c.508C>A (p.H170N) in exon 3 of GIPC3 was confirmed by Sanger sequencing (Figure 1D). Read depth for this variant was 5 and the exon containing this variant was poorly covered (Figure 2C). PolyPhen2 classification for this variant was probably damaging (score 1.0) and MutPred predicted the g score (probability of deleterious mutation) as 0.850. A ConSeq conservation score of 9 is obtained for H170 showing that this residue is highly conserved. The mutation was absent in 335 healthy ethnicity-matched controls.
The GIPC3 sequences (wild type and p.H170N) were each aligned with the PDZ domain in GIPC2 (PDB code: 3GGE - chain B). The alignment (GIPC3:108–199 and 3GGE:3–95) comprises 92 positions sharing 26.6% sequence identity with no indels. The structural template (3GGE) and the models each have two α-helices and six β-strands. Protein structural models for GIPC3 (the wild type and p.H170N) were built (Figures 4A and 4B). The mutated form of GIPC3 was compared with the wild type, structural differences were observed. In the mutated form of the model, the substrate molecular recognition pocket was larger and the associated charge distribution was reduced (Figures 4C and 4D) compared with the wild type. In the wild type H170 side chain is pointing away from the core and the resulting side chain is solvent accessible whilst the asparagine side chain 170 in the mutated form of GIPC3 points inwards towards the hydrophobic core and forms a tight network of H-bonds. The asparagine side chain forms two side chain H-bonds with two main chain atoms (ASP 128 NH: ASN 170 OD1 and ASN 170 OD2-HD22:THR 127 O) which renders the side chain solvent inaccessible. In addition, N170 forms two main chain to main chain H bonds (ALA 174: ASN 170 O and VAL 173 N:ASN 170 O). Residue 170 is the first residue of α-helix 2. In the wild type the side chain solvent accessibility for residue H170 is greater than 10%. H170 does not form side chain to side chain H-bonds within the PDZ domain (Figures 4E and 4F). Whilst the accepted amino acid residue substitution profile is variable at position 170 across the PDZ superfamily , , histidine for the GIPC family members at this position is invariant. This position 170 coincides with a key ligand binding pocket ,  and the mutation from histidine to asparagine is predicted to alter the ligand binding site with a change of substrate specificity resulting in an adverse alteration in the protein function.
Figure 4. Diagrams of structural models for GIPC3.
The ribbon diagrams for the 3D models of the PDZ domain in GIPC3. The wild type (A) and the mutated p.H170N (B) forms are shown. The key for structural features follow; the α-helices (red); the β-strands (blue); the β-turns (green); the loops (light gray); the side-chain for H170 in the wild type (pink) and the side-chain for N170 highlights the mutation (yellow). The surface diagrams show the surface topology and the interpolated charge distribution of the PDZ domain of GIPC3. Both the wild type (C) and the mutated p.H170N protein (D) are provided. The p.H170N mutated form shows one of the substrate molecular recognition pockets as being deeper with a larger volume compared to the wild type. The p.H170N mutated form shows the interpolated charge distribution as being reduced compared with the wild type. Structural and local environment for position 170 with the H-bond patterns are shown. The GIPC3 wild type (E) shows an absence of side-chain to main-chain H-bonds with the H170 side-chain; whilst the asparagine side-chain (yellow) in the mutated protein (F) forms side-chain to main-chain H-bonds.doi:10.1371/journal.pone.0032000.g004
After finding potential regions through autozygosity mapping we identified two novel rare variants to cause hearing loss in this study (GIPC3 p.H170N and ZNF57 p.T443M). Both variants were good candidates based on genetic and in silico data. It is important to restate that the first analysis was done using minimum read depth of 8× and the variant in GIPC3 was not detected until a less stringent filtering was performed. If GIPC3 was not known as a cause of deafness we could have concluded that the variant in ZNF57 was causative. Human GIPC3 gene encodes a 312 amino acid protein which localizes to the sensory hair cells in the inner ear . In addition to the central PDZ domain, there are two conserved domains as GIPC homologous domain 1 (GH1 domain) and GH2 domain. The GH2 domain of GIPC1 interacts directly with the actin-based molecular motor myosin VI, in which mutations cause hearing loss in humans and mice –. After two missense mutations reported in one Indian and one Dutch family in GIPC3 , very recently six more missense and a nonsense mutations were described in seven Pakistani families  (Figure 5). Following the previously reported p.R189C mutation in a Pakistani family, the p.H170N mutation in the Turkish family described in this study is the second mutation outside of one of the two GH domains. We should note that the mutation ZNF57 p.T443M might not be a red herring. We can speculate that this mutation also may contribute to hearing loss or more likely it might contribute to the susceptibility for another phenotype which was not considered here and that these two variants (ZNF57 p.T443M and GIPC3 p.H170N) are found to co-segregate. Whilst the ZNF57 variant (NC_000019.9:2917947C>T, NM_173480.2:c.1328C>T, NP_775751.1:p.Thr443Met) is absent in dbSNP version 132, we note that in a recent update of the dbSNP (version 135), this variant is present as rs142727006 with an allele frequency of 0.003. The GIPC3 variant (NC_000019.9:3586908) is absent from both the dbSNP (version 135) and the NHLBI Sequencing Project/Exome Variant Server Database. Our study clearly demonstrates some of the challenges faced using the high throughput exome sequencing technologies to find causative mutations in Mendelian disease. Table 3 shows the differences between analyzing the data using coverage filters of 2×, 4× and 8×. The ultimate goal of this combined approach is to identify disease causing mutations accurately and economically. To minimize the number of false negative results, an exome sequencing experiment requires adequate and uniform sequencing depth across the target regions. We demonstrate that despite attaining the expected quality metrics for the number of reads, the amount of DNA generated, the target coverage and the average depth across the intended target, a marked unevenness of capture of one region compared with other regions was evident (Figure 2). The targeted regions were not captured with uniformity (Figure S1) and we present this as an issue which needs to be evaluated with care when using exome sequencing as a tool for Mendelian disease gene discovery. Several factors can influence uniformity and unbiased capture and these include biases in the GC content as we clearly illustrate in this study (Figure 2). This study has provided some comprehensive insights and will be informative for scientists who plan to use exome sequencing technology. In certain scenarios this information may influence analysis or experimental design to reduce some of the limitations in surveying exome sequences for Mendelian disease gene discovery.
Figure 5. The two-dimensional structure of GIPC3 and the localization of identified mutations.doi:10.1371/journal.pone.0032000.g005
Coverage and read depth of five autozygous regions.
Conceived and designed the experiments: HA MT. Performed the experiments: AS. Analyzed the data: YJKE MT. Contributed reagents/materials/analysis tools: AS YJKE MT. Wrote the paper: AS YJKE MT.
- 1. Friedman TB, Griffith AJ (2003) Human nonsyndromic sensorineural deafness. Annual review of genomics and human genetics 4: 341–402.
- 2. Morton CC, Nance WE (2006) Newborn hearing screening–a silent revolution. The New England journal of medicine 354: 2151–2164.
- 3. Becker J, Semler O, Gilissen C, Li Y, Bolz HJ, et al. (2011) Exome sequencing identifies truncating mutations in human SERPINF1 in autosomal-recessive osteogenesis imperfecta. American journal of human genetics 88: 362–371.
- 4. Ostergaard P, Simpson MA, Brice G, Mansour S, Connell FC, et al. (2011) Rapid identification of mutations in GJC2 in primary lymphoedema using whole exome sequencing combined with linkage analysis with delineation of the phenotype. J Med Genet 48: 251–255.
- 5. Sirmaci A, Walsh T, Akay H, Spiliopoulos M, Sakalar YB, et al. (2010) MASP1 mutations in patients with facial, umbilical, coccygeal, and auditory findings of Carnevale, Malpuech, OSA, and Michels syndromes. American journal of human genetics 87: 679–686.
- 6. Walsh T, Shahin H, Elkan-Miller T, Lee MK, Thornton AM, et al. (2010) Whole exome sequencing and homozygosity mapping identify mutation in the cell polarity protein GPSM2 as the cause of nonsyndromic hearing loss DFNB82. American journal of human genetics 87: 90–94.
- 7. Stitziel NO, Kiezun A, Sunyaev S (2011) Computational and statistical approaches to analyzing variants identified by exome sequencing. Genome biology 12: 227.
- 8. Bamshad MJ, Ng SB, Bigham AW, Tabor HK, Emond MJ, et al. (2011) Exome sequencing as a tool for Mendelian disease gene discovery. Nature reviews Genetics 12: 745–755.
- 9. Montenegro G, Powell E, Huang J, Speziani F, Edwards YJ, et al. (2011) Exome sequencing allows for rapid gene identification in a Charcot-Marie-Tooth family. Annals of neurology 69: 464–470.
- 10. Ng SB, Bigham AW, Buckingham KJ, Hannibal MC, McMillin MJ, et al. (2010) Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nature genetics 42: 790–793.
- 11. Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, et al. (2010) Exome sequencing identifies the cause of a mendelian disorder. Nature genetics 42: 30–35.
- 12. Singleton AB (2011) Exome sequencing: a transformative technology. Lancet neurology 10: 942–946.
- 13. Li H, Ruan J, Durbin R (2008) Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome research 18: 1851–1858.
- 14. Sayers EW, Barrett T, Benson DA, Bolton E, Bryant SH, et al. (2010) Database resources of the National Center for Biotechnology Information. Nucleic acids research 38: D5–16.
- 15. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, et al. (2010) A method and server for predicting damaging missense mutations. Nature methods 7: 248–249.
- 16. Sanborn JZ, Benz SC, Craft B, Szeto C, Kober KM, et al. (2011) The UCSC Cancer Genomics Browser: update 2011. Nucleic acids research 39: D951–959.
- 17. Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, et al. (2000) Comparative protein structure modeling of genes and genomes. Annual review of biophysics and biomolecular structure 29: 291–325.
- 18. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research 25: 3389–3402.
- 19. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, et al. (2000) The Protein Data Bank. Nucleic acids research 28: 235–242.
- 20. Asan NF, Xu Y, Jiang H, Tyler-Smith C, Xue Y, et al. (2011) Comprehensive comparison of three commercial human whole-exome capture platforms. Genome biology 12: R95.
- 21. Zuchner S, Dallman J, Wen R, Beecham G, Naj A, et al. (2011) Whole-exome sequencing links a variant in DHDDS to retinitis pigmentosa. American journal of human genetics 88: 201–206.
- 22. Li B, Krishnan VG, Mort ME, Xin F, Kamati KK, et al. (2009) Automated inference of molecular mechanisms of disease from amino acid substitutions. Bioinformatics 25: 2744–2750.
- 23. Wang Y, Zhou J, Ye X, Wan Y, Li Y, et al. (2010) ZNF424, a novel human KRAB/C2H2 zinc finger protein, suppresses NFAT and p21 pathway. BMB reports 43: 212–218.
- 24. Ashkenazy H, Erez E, Martz E, Pupko T, Ben-Tal N (2010) ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res 38: W529–533.
- 25. Zhang W, Xu C, Bian C, Tempel W, Crombet L, et al. (2011) Crystal structure of the Cys2His2-type zinc finger domain of human DPF2. Biochemical and biophysical research communications 413: 58–61.
- 26. Wolfe SA, Nekludova L, Pabo CO (2000) DNA recognition by Cys2His2 zinc finger proteins. Annual review of biophysics and biomolecular structure 29: 183–212.
- 27. Lee JS, Galvin KM, Shi Y (1993) Evidence for physical interaction between the zinc-finger transcription factors YY1 and Sp1. Proceedings of the National Academy of Sciences of the United States of America 90: 6145–6149.
- 28. Perkins ND, Agranoff AB, Pascal E, Nabel GJ (1994) An interaction between the DNA-binding domains of RelA(p65) and Sp1 mediates human immunodeficiency virus gene activation. Molecular and cellular biology 14: 6570–6583.
- 29. Merika M, Orkin SH (1995) Functional synergy and physical interactions of the erythroid transcription factor GATA-1 with the Kruppel family proteins Sp1 and EKLF. Molecular and cellular biology 15: 2437–2447.
- 30. Segal DJ, Crotty JW, Bhakta MS, Barbas CF 3rd, Horton NC (2006) Structure of Aart, a designed six-finger zinc finger peptide, bound to DNA. Journal of molecular biology 363: 405–421.
- 31. Jantz D, Berg JM (2004) Reduction in DNA-binding affinity of Cys2His2 zinc finger proteins by linker phosphorylation. Proceedings of the National Academy of Sciences of the United States of America 101: 7589–7593.
- 32. Laity JH, Dyson HJ, Wright PE (2000) DNA-induced alpha-helix capping in conserved linker sequences is a determinant of binding affinity in Cys(2)-His(2) zinc fingers. Journal of molecular biology 295: 719–727.
- 33. Thukral SK, Morrison ML, Young ET (1991) Alanine scanning site-directed mutagenesis of the zinc fingers of transcription factor ADR1: residues that contact DNA and that transactivate. Proceedings of the National Academy of Sciences of the United States of America 88: 9188–9192.
- 34. Choo Y, Klug A (1993) A role in DNA binding for the linker sequences of the first three zinc fingers of TFIIIA. Nucleic acids research 21: 3341–3346.
- 35. Charizopoulou N, Lelli A, Schraders M, Ray K, Hildebrand MS, et al. (2011) Gipc3 mutations associated with audiogenic seizures and sensorineural hearing loss in mouse and human. Nature communications 2: 201.
- 36. Doyle DA, Lee A, Lewis J, Kim E, Sheng M, et al. (1996) Crystal structures of a complexed and peptide-free membrane protein-binding domain: molecular basis of peptide recognition by PDZ. Cell 85: 1067–1076.
- 37. Songyang Z, Fanning AS, Fu C, Xu J, Marfatia SM, et al. (1997) Recognition of unique carboxyl-terminal motifs by distinct PDZ domains. Science 275: 73–77.
- 38. Ahmed ZM, Morell RJ, Riazuddin S, Gropman A, Shaukat S, et al. (2003) Mutations of MYO6 are associated with recessive deafness, DFNB37. Am J Hum Genet 72: 1315–1322.
- 39. Avraham KB, Hasson T, Steel KP, Kingsley DM, Russell LB, et al. (1995) The mouse Snell's waltzer deafness gene encodes an unconventional myosin required for structural integrity of inner ear hair cells. Nat Genet 11: 369–375.
- 40. Reed BC, Cefalu C, Bellaire BH, Cardelli JA, Louis T, et al. (2005) GLUT1CBP(TIP2/GIPC1) interactions with GLUT1 and myosin VI: evidence supporting an adapter function for GLUT1CBP. Mol Biol Cell 16: 4183–4201.
- 41. Rehman AU, Gul K, Morell RJ, Lee K, Ahmed ZM, et al. (2011) Mutations of GIPC3 cause nonsyndromic hearing loss DFNB72 but not DFNB81 that also maps to chromosome 19p. Human genetics.