High-throughput functional analysis of human cancer-associated mutations the utilize of frightful editors

  • Money
  • April 12, 2022
  • No Comment

Summary

Comprehensive phenotypic characterization of the varied mutations learned in cancer tissues is one in every of the supreme challenges in cancer genomics. On this scrutinize, we evaluated the functional outcomes of 29,060 cancer-connected transition mutations that outcome in protein variants on the survival and proliferation of non-tumorigenic lung cells the utilize of cytosine and adenine frightful editors and single files RNA (sgRNA) libraries. By monitoring frightful enhancing efficiencies and outcomes the utilize of surrogate plan sequences paired with sgRNA-encoding sequences on the lentiviral provide effect, we identified sgRNAs that prompted a single predominant protein variant per sgRNA, enabling linking these mutations to the mobile phenotypes brought about by frightful enhancing. The capabilities of the excellent majority of the protein variants (28,458 variants, 98%) had been classified as neutral or most likely neutral; easiest 18 (0.06%) and 157 (0.5%) variants brought on outgrowing and most likely outgrowing phenotypes, respectively. We rely on that our plan will also be prolonged to extra variants of unknown significance and other tumor sorts.

Here’s a preview of subscription snarl

Accumulate admission to alternate choices

Subscribe to Nature+

Accumulate on the spot online fetch admission to to your entire Nature household of 50+ journals

Subscribe to Journal

Accumulate fat journal fetch admission to for 1 365 days

$99.00

easiest $8.25 per inform

All prices are NET prices.
VAT will most likely be added later within the checkout.
Tax calculation will most likely be finalised all over checkout.

Make a selection article

Accumulate time restricted or fat article fetch admission to on ReadCube.

$32.00

All prices are NET prices.

Info availability

We beget submitted the deep sequencing files from this scrutinize to the National Heart of Biotechnology Info’s Sequence Learn Archive below accession amount PRJNA667758. We beget provided the datasets aged on this scrutinize as Supplementary Tables 2–4 and deepcrispr.data/BEvariants.

Code availability

The custom Python scripts aged for the era of the MAGeCK enter file the utilize of UMIs are on hand on GitHub (https://github.com/oreolic/CancerLibrary).

References

  1. McLendon, R. et al. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455, 1061–1068 (2008).

    CAS 

    Google Pupil 

  2. Hudson, T. J. et al. International community of cancer genome initiatives. Nature 464, 993–998 (2010).

    CAS 
    PubMed 

    Google Pupil 

  3. Campbell, P. J. et al. Pan-cancer diagnosis of entire genomes. Nature 578, 82–93 (2020).

    Google Pupil 

  4. Bailey, M. H. et al. Comprehensive characterization of cancer driver genes and mutations. Cell 173, 371–385 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  5. Sondka, Z. et al. The COSMIC Most cancers Gene Census: describing genetic dysfunction valid thru all human cancers. Nat. Rev. Most cancers 18, 696–705 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  6. Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer entire genomes. Nature 578, 102–111 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  7. Stratton, M. R., Campbell, P. J. & Futreal, P. A. The cancer genome. Nature 458, 719 (2009).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  8. Giacomelli, A. O. et al. Mutational processes shape the panorama of TP53 mutations in human cancer. Nat. Genet. 50, 1381–1387 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  9. Kotler, E. et al. A scientific p53 mutation library hyperlinks differential functional impact to cancer mutation sample and evolutionary conservation. Mol. Cell 71, 178–190 (2018).

    CAS 
    PubMed 

    Google Pupil 

  10. Majithia, A. R. et al. Prospective functional classification of all that you might perchance be imagine missense variants in PPARG. Nat. Genet. 48, 1570–1575 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  11. Brenan, L. et al. Phenotypic characterization of a entire position of MAPK1/ERK2 missense mutants. Cell Bag. 17, 1171–1183 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  12. Ahler, E. et al. A combined plan unearths a regulatory mechanism coupling Src’s kinase process, localization, and phosphotransferase-self reliant capabilities. Mol. Cell 74, 393–408 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  13. Matreyek, Okay. A. et al. Multiplex review of protein variant abundance by hugely parallel sequencing. Nat. Genet. 50, 874–882 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  14. Starita, L. M. et al. A multiplex homology-directed DNA repair assay unearths the impact of extra than 1,000 BRCA1 missense substitution variants on protein characteristic. Am. J. Hum. Genet. 103, 498–508 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  15. Chiasson, M. A. et al. Multiplexed dimension of variant abundance and process unearths VKOR topology, active position and human variant impact. eLife 9, e58026 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  16. Kim, H. & Kim, J. S. A files to genome engineering with programmable nucleases. Nat. Rev. Genet. 15, 321–334 (2014).

    CAS 
    PubMed 

    Google Pupil 

  17. Findlay, G. M. et al. Correct classification of BRCA1 variants with saturation genome enhancing. Nature 562, 217–222 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  18. Kandoth, C. et al. Mutational panorama and significance valid thru 12 significant cancer sorts. Nature 502, 333–339 (2013).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  19. Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A. & Liu, D. R. Programmable enhancing of a plan frightful in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  20. Nishida, Okay. et al. Centered nucleotide enhancing the utilize of hybrid prokaryotic and vertebrate adaptive immune programs. Science 353, aaf8729 (2016).

  21. Gaudelli, N. M. et al. Programmable frightful enhancing of A•T to G•C in genomic DNA without DNA cleavage. Nature 551, 464–471 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  22. Kim, H. S. et al. Systematic identification of molecular subtype-selective vulnerabilities in non-little-cell lung cancer. Cell 155, 552–566 (2013).

    CAS 
    PubMed 

    Google Pupil 

  23. Bamford, S. et al. The COSMIC (Catalogue of Somatic Mutations in Most cancers) database and location. Br. J. Most cancers 91, 355–358 (2004).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  24. Hanna, R. E. et al. Massively parallel review of human variants with frightful editor shows. Cell 184, 1064–1080 (2021).

    CAS 
    PubMed 

    Google Pupil 

  25. Kuscu, C. et al. CRISPR-STOP: gene silencing thru frightful-enhancing-prompted nonsense mutations. Nat. Ideas 14, 710–712 (2017).

    CAS 
    PubMed 

    Google Pupil 

  26. Koblan, L. W. et al. Bettering cytidine and adenine frightful editors by expression optimization and ancestral reconstruction. Nat. Biotechnol. 36, 843–846 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  27. Song, M. et al. Sequence-voice prediction of the efficiencies of adenine and cytosine frightful editors. Nat. Biotechnol. 38, 1037–1043 (2020).

    CAS 
    PubMed 

    Google Pupil 

  28. Kim, H. Okay. et al. In vivo high-throughput profiling of CRISPR–Cpf1 process. Nat. Ideas 14, 153–159 (2017).

    CAS 
    PubMed 

    Google Pupil 

  29. Kim, H. Okay. et al. Deep finding out improves prediction of CRISPR–Cpf1 files RNA process. Nat. Biotechnol. 36, 239–241 (2018).

    CAS 
    PubMed 

    Google Pupil 

  30. Kim, H. Okay. et al. SpCas9 process prediction by DeepSpCas9, a deep finding out-primarily primarily based mannequin with high generalization performance. Sci. Adv. 5, eaax9249 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  31. Kim, H. Okay. et al. High-throughput diagnosis of the activities of xCas9, SpCas9-NG and SpCas9 at matched and mismatched plan sequences in human cells. Nat. Biomed. Eng. 4, 111–124 (2020).

    CAS 
    PubMed 

    Google Pupil 

  32. Kim, N. et al. Prediction of the sequence-voice cleavage process of Cas9 variants. Nat. Biotechnol. 38, 1328–1336 (2020).

    CAS 
    PubMed 

    Google Pupil 

  33. Kim, H. Okay. et al. Predicting the effectivity of prime enhancing files RNAs in human cells. Nat. Biotechnol. 39, 198–206 (2021).

    CAS 
    PubMed 

    Google Pupil 

  34. Hill, A. J. et al. On the attach of CRISPR-primarily primarily based single-cell molecular shows. Nat. Ideas 15, 271–274 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  35. Michlits, G. et al. CRISPR-UMI: single-cell lineage tracing of pooled CRISPR–Cas9 shows. Nat. Ideas 14, 1191–1197 (2017).

    CAS 
    PubMed 

    Google Pupil 

  36. Schmierer, B. et al. CRISPR/Cas9 screening the utilize of wierd molecular identifiers. Mol. Syst. Biol. 13, 945 (2017).

    PubMed 
    PubMed Central 

    Google Pupil 

  37. Arbab, M. et al. Determinants of frightful enhancing outcomes from plan library diagnosis and machine finding out. Cell 182, 463–480 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  38. Doench, J. G. et al. Optimized sgRNA attach to maximise process and lower off-plan outcomes of CRISPR–Cas9. Nat. Biotechnol. 34, 184–191 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  39. Li, W. et al. MAGeCK enables noteworthy identification of the biggest genes from genome-scale CRISPR/Cas9 knockout shows. Genome Biol. 15, 554 (2014).

    PubMed 
    PubMed Central 

    Google Pupil 

  40. Ghandi, M. et al. Subsequent-era characterization of the Most cancers Cell Line Encyclopedia. Nature 569, 503–508 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  41. Carter, H. et al. Most cancers-voice high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. Most cancers Res. 69, 6660–6667 (2009).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  42. Ng, P. C. & Henikoff, S. Predicting deleterious amino acid substitutions. Genome Res. 11, 863–874 (2001).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  43. Adzhubei, I. A. et al. One plan and server for predicting adverse missense mutations. Nat. Ideas 7, 248–249 (2010).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  44. Miosge, L. A. et al. Comparability of predicted and accurate penalties of missense mutations. Proc. Natl Acad. Sci. USA 112, E5189–E5198 (2015).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  45. Solar, S. et al. An prolonged position of yeast-primarily primarily based functional assays accurately identifies human illness mutations. Genome Res. 26, 670–680 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  46. Chen, H. et al. Comprehensive review of computational algorithms in predicting cancer driver mutations. Genome Biol. 21, 43 (2020).

    PubMed 
    PubMed Central 

    Google Pupil 

  47. Markusic, D., Oude-Elferink, R., Das, A. T., Berkhout, B. & Seppen, J. Comparability of single regulated lentiviral vectors with rtTA expression pushed by an autoregulatory loop or a constitutive promoter. Nucleic Acids Res. 33, e63 (2005).

    PubMed 
    PubMed Central 

    Google Pupil 

  48. Yi, S. A. et al. HPV-mediated nuclear export of HP1γ drives cervical tumorigenesis by downregulation of p53. Cell Death Differ. 27, 2537–2551 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  49. Eekels, J. J. M. et al. A competitive cell improve assay for the detection of refined outcomes of gene transduction on cell proliferation. Gene Ther. 19, 1058–1064 (2012).

    CAS 
    PubMed 

    Google Pupil 

  50. Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the following era. Cell 144, 646–674 (2011).

    CAS 

    Google Pupil 

  51. Hanahan, D. & Weinberg, R. A. The hallmarks of cancer. Cell 100, 57–70 (2000).

    CAS 
    PubMed 

    Google Pupil 

  52. Sequist, L. V. et al. Genotypic and histological evolution of lung cancers buying resistance to EGFR inhibitors. Sci. Transl. Med. 3, 75ra26 (2011).

    PubMed 
    PubMed Central 

    Google Pupil 

  53. Ganesan, P. et al. Epidermal improve ingredient receptor P753S mutation in cutaneous squamous cell carcinoma conscious about cetuximab-primarily primarily based therapy. J. Clin. Oncol. 34, e34–e37 (2016).

    PubMed 

    Google Pupil 

  54. Stabile, L. P. et al. Mixed focusing on of the estrogen receptor and the epidermal improve ingredient receptor in non-little cell lung cancer reveals enhanced antiproliferative outcomes. Most cancers Res. 65, 1459–1470 (2005).

    CAS 
    PubMed 

    Google Pupil 

  55. Landrum, M. J. et al. ClinVar: improvements to having access to files. Nucleic Acids Res. 48, D835–D844 (2020).

    CAS 
    PubMed 

    Google Pupil 

  56. Chen, Y. et al. PHLDA1, one other PHLDA household protein that inhibits Akt. Most cancers Sci. 109, 3532–3542 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  57. Nagai, M. A. Pleckstrin homology-admire arena, household A, member 1 (PHLDA1) and cancer. Biomed. Bag. 4, 275–281 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  58. Botti, E. et al. Developmental ingredient IRF6 shows tumor suppressor process in squamous cell carcinomas. Proc. Natl Acad. Sci. USA 108, 13710–13715 (2011).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  59. Jobling, R. et al. Monozygotic twins with variable expression of Van der Woude syndrome. Am. J. Med. Genet. A 155A, 2008–2010 (2011).

    PubMed 

    Google Pupil 

  60. Stupack, D. G. Caspase-8 as a therapeutic plan in cancer. Most cancers Lett. 332, 133–140 (2013).

    CAS 
    PubMed 

    Google Pupil 

  61. Jia, D. et al. Crebbp loss drives little cell lung cancer and increases sensitivity to HDAC inhibition. Most cancers Discov. 8, 1422–1437 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  62. Pasqualucci, L. et al. Inactivating mutations of acetyltransferase genes in B-cell lymphoma. Nature 471, 189–195 (2011).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  63. Cuella-Martin, R. et al. Helpful interrogation of DNA damage response variants with frightful enhancing shows. Cell 184, 1081–1097 (2021).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  64. Sánchez-Rivera, F. J. et al. Snide enhancing sensor libraries for prime-throughput engineering and functional diagnosis of cancer-associated single nucleotide variants. Nat. Biotechnol. https://doi.org/10.1038/s41587-021-01172-3 (2022).

  65. Kim, Y. B. et al. Increasing the genome-focusing on scope and precision of frightful enhancing with engineered Cas9-cytidine deaminase fusions. Nat. Biotechnol. 35, 371–376 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  66. Ran, F. A. et al. In vivo genome enhancing the utilize of Staphylococcus aureus Cas9. Nature 520, 186–191 (2015).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  67. Li, X. et al. Snide enhancing with a Cpf1–cytidine deaminase fusion. Nat. Biotechnol. 36, 324–327 (2018).

    CAS 
    PubMed 

    Google Pupil 

  68. Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a category 2 CRISPR–Cas system. Cell 163, 759–771 (2015).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  69. Nishimasu, H. et al. Engineered CRISPR–Cas9 nuclease with expanded focusing on space. Science 361, 1259–1262 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  70. Hu, J. H. et al. Developed Cas9 variants with big PAM compatibility and high DNA specificity. Nature 556, 57–63 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  71. Anders, C., Bargsten, Okay. & Jinek, M. Structural plasticity of PAM recognition by engineered variants of the RNA-guided endonuclease Cas9. Mol. Cell 61, 895–902 (2016).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  72. Kleinstiver, B. P. et al. Engineered CRISPR–Cas9 nucleases with altered PAM specificities. Nature 523, 481–485 (2015).

    PubMed 
    PubMed Central 

    Google Pupil 

  73. Walton, R. T., Christie, Okay. A., Whittaker, M. N. & Kleinstiver, B. P. Unconstrained genome focusing on with attain-PAMless engineered CRISPR–Cas9 variants. Science 368, 290–296 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  74. Zhou, C. et al. Off-plan RNA mutation prompted by DNA frightful enhancing and its elimination by mutagenesis. Nature 571, 275–278 (2019).

    CAS 
    PubMed 

    Google Pupil 

  75. Thuronyi, B. W. et al. Exact evolution of frightful editors with expanded plan compatibility and improved process. Nat. Biotechnol. 37, 1070–1079 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  76. Richter, M. F. et al. Phage-assisted evolution of an adenine frightful editor with improved Cas arena compatibility and process. Nat. Biotechnol. 38, 883–891 (2020).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  77. Gaudelli, N. M. et al. Directed evolution of adenine frightful editors with increased process and therapeutic utility. Nat. Biotechnol. 38, 892–900 (2020).

    CAS 
    PubMed 

    Google Pupil 

  78. Kurt, I. C. et al. CRISPR C-to-G frightful editors for inducing focused DNA transversions in human cells. Nat. Biotechnol. 39, 41–46 (2021).

    CAS 
    PubMed 

    Google Pupil 

  79. Zhao, D. et al. Glycosylase frightful editors permit C-to-A and C-to-G frightful changes. Nat. Biotechnol. 39, 35–40 (2021).

    CAS 
    PubMed 

    Google Pupil 

  80. Hanson, G. & Coller, J. Codon optimality, bias and usage in translation and mRNA decay. Nat. Rev. Mol. Cell Biol. 19, 20–30 (2018).

    CAS 
    PubMed 

    Google Pupil 

  81. Sanjana, N. E., Shalem, O. & Zhang, F. Improved vectors and genome-extensive libraries for CRISPR screening. Nat. Ideas 11, 783–784 (2014).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  82. Meier, J. A., Zhang, F. & Sanjana, N. E. GUIDES: sgRNA attach for loss-of-characteristic shows. Nat. Ideas 14, 831–832 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  83. Ramirez, R. D. et al. Immortalization of human bronchial epithelial cells within the absence of viral oncoproteins. Most cancers Res. 64, 9027–9034 (2004).

    CAS 
    PubMed 

    Google Pupil 

  84. Ellis, B. L., Potts, P. R. & Porteus, M. H. Rising increased titer lentivirus with caffeine. Hum. Gene Ther. 22, 93–100 (2011).

    CAS 
    PubMed 

    Google Pupil 

  85. Dang, Y. et al. Optimizing sgRNA construction to enhance CRISPR–Cas9 knockout effectivity. Genome Biol. 16, 280 (2015).

    PubMed 
    PubMed Central 

    Google Pupil 

  86. Shalem, O. et al. Genome-scale CRISPR–Cas9 knockout screening in human cells. Science 343, 84–87 (2014).

    CAS 

    Google Pupil 

  87. Billon, P. et al. CRISPR-mediated frightful enhancing enables atmosphere pleasant disruption of eukaryotic genes thru induction of STOP codons. Mol. Cell 67, 1068–1079 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  88. Behan, F. M. et al. Prioritization of cancer therapeutic targets the utilize of CRISPR–Cas9 shows. Nature 568, 511–516 (2019).

    CAS 
    PubMed 

    Google Pupil 

  89. Hart, T., Brown, Okay. R., Sircoulomb, F., Rottapel, R. & Moffat, J. Measuring error charges in genomic perturbation shows: gold requirements for human functional genomics. Mol. Syst. Biol. 10, 733 (2014).

    PubMed 
    PubMed Central 

    Google Pupil 

  90. Martincorena, I. et al. Standard patterns of varied in cancer and somatic tissues. Cell 171, 1029–1041 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  91. Kandasamy, Okay. et al. NetPath: a public helpful resource of curated signal transduction pathways. Genome Biol. 11, R3 (2010).

    PubMed 
    PubMed Central 

    Google Pupil 

  92. Wang, G. & Fersht, A. R. Mechanism of initiation of aggregation of p53 printed by Φ-price diagnosis. Proc. Natl Acad. Sci. USA 112, 2437-2442 (2015).

  93. Zhao, D. et al. Combinatorial CRISPR–Cas9 metabolic shows instruct significant redox get rid of watch over factors dependent on the KEAP1–NRF2 regulatory axis. Mol. Cell 69, 699–708 (2018).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  94. Clement, Okay. et al. CRISPResso2 provides appropriate and rapidly genome enhancing sequence diagnosis. Nat. Biotechnol. 37, 224–226 (2019).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  95. Smith, T., Heger, A. & Sudbery, I. UMI-instruments: modeling sequencing errors in odd molecular identifiers to enhance quantification accuracy. Genome Res. 27, 491–499 (2017).

    CAS 
    PubMed 
    PubMed Central 

    Google Pupil 

  96. Zhu, S. et al. Info RNAs with embedded barcodes improve CRISPR-pooled shows. Genome Biol. 20, 20 (2019).

    PubMed 
    PubMed Central 

    Google Pupil 

  97. Xu, P. et al. Genome-extensive interrogation of gene capabilities thru frightful editor shows empowered by barcoded sgRNAs. Nat. Biotechnol. 39, 1403–1413 (2021).

Accumulate references

Acknowledgements

We thank J. W. Choi for aiding with computational diagnosis. This work used to be supported, partly, by the National Learn Foundation of Korea (grants 2017R1A2B3004198 (H.H.Okay.), 2017M3A9B4062403 (H.H.Okay.) and 2018R1A5A2025079 (H.H.Okay)); the Brain Korea 21 Plus Project (Yonsei College College of Remedy); the Yonsei Signature Learn Cluster Program of 2021-22-0014 (H.H.Okay.); a grant of the MD-PhD/Scientific Scientist Practising Program (S.L.) thru the Korea Neatly being Commerce Vogue Institute (KHIDI), funded by the Ministry of Neatly being & Welfare, Republic of Korea; Lung Most cancers SPORE P50 (CA070907; J.D.M.); and the Korean Neatly being Skills R&D Project, Ministry of Neatly being and Welfare, Republic of Korea (grant HI21C1314 (H.H.Okay.)).

Author files

Author notes

  1. These authors contributed equally: Younggwang Kim, Seungho Lee.

Affiliations

  1. Department of Pharmacology, Yonsei College College of Remedy, Seoul, Republic of Korea

    Younggwang Kim, Seungho Lee, Soohyuk Cho, Jinman Park, Dongwoo Chae & Hyongbum Henry Kim

  2. Graduate College of Scientific Science, Brain Korea 21 Plus Project for Scientific Sciences, Yonsei College College of Remedy, Seoul, Republic of Korea

    Younggwang Kim, Soohyuk Cho, Jinman Park & Hyongbum Henry Kim

  3. Department of Utilized Statistics, Yonsei College, Seoul, Republic of Korea

    Taeyoung Park

  4. Hamon Heart for Therapeutic Oncology Learn, College of Texas Southwestern Scientific Heart, Dallas, TX, USA

    John D. Minna

  5. Severance Biomedical Science Institute, Yonsei College College of Remedy, Seoul, Republic of Korea

    Hyongbum Henry Kim

  6. Heart for Nanomedicine, Institute for Traditional Science (IBS), Seoul, Republic of Korea

    Hyongbum Henry Kim

  7. Yonsei-IBS Institute, Yonsei College, Seoul, Republic of Korea

    Hyongbum Henry Kim

  8. Institute for Immunology and Immunological Ailments, Yonsei College College of Remedy, Seoul, Republic of Korea

    Hyongbum Henry Kim

Contributions

Y.Okay., S.L. and H.H.Okay. conceived and designed the scrutinize. Y.Okay. and S.L. performed a entire lot of the experiments. J.P. seriously contributed to computational diagnosis. S.C. seriously assisted within the moist experiments. Y.Okay. and S.L. analyzed the files per comments of H.H.Okay. J.D.M. generated and provided HBEC30KT-shTP53 cells (P cells). D.C. and T.P. contributed to the mathematical diagnosis (Supplementary Point to 2). Y.Okay. and H.H.Okay. wrote the manuscript with enter from all authors.

Corresponding author

Correspondence to
Hyongbum Henry Kim.

Ethics declarations

Competing interests

Yonsei College has filed a patent utility per this work, by which Y.Okay., S.L. and H.H.Okay. are listed as inventors. J.D.M. receives licensing prices from the National Institutes of Neatly being and the College of Texas Southwestern Scientific Heart for distributing human cell lines. Your entire other authors repeat no competing interests.

Gape analysis

Gape analysis files

Nature Biotechnology thanks the nameless reviewers for their contribution to the gawk analysis of this work.

Extra files

Publisher’s instruct Springer Nature stays neutral with reference to jurisdictional claims in published maps and institutional affiliations.

Prolonged files

Prolonged Info Fig. 1 Exon transcript profiles of P cells.

a, Expression of TP53 mRNA in P cells and HBEC30KT cells. FPKM, fragments per kilobase of transcript per million. Boxplots are represented for n = 3 biologically self reliant samples as follows: heart line of box indicating the median, box limits indicating the upper and lower quartile; whiskers point to the 1.5 cases interquartile vary. b, Gene position enrichment diagnosis (GSEA) of exon transcript profiles of HBEC30KT, P cells, and HCC4017, a lung cancer cell line. The single sample GSEA rating (ssGSEA rating) represents the level to which the genes in a particular gene position are up- or down-regulated interior the sample. RNA expression files had been retrieved from Kim et al34.

Prolonged Info Fig. 2 Skills of libraries C and A.

a, The system of deciding on sgRNA-plan pairs for the era of libraries C and A. SNVs, single nucleotide variants; sgRNA, single files RNA. b, Skills of lentiviral libraries of sgRNA-encoding and plan sequence pairs with odd molecular identifiers (UMIs). Oligonucleotides containing a 20-nt files sequence, and the corresponding plan sequence had been synthesized and cloned into the pLenti-gRNA-puro vector to produce plasmid library 1. The plasmids had been then digested with BsmBI restriction enzyme and ligated with fragments containing the sgRNA scaffold sequences and UMIs to produce plasmid library 2. Lentiviral libraries generated from plasmid library 2 had been then transduced into cells expressing cytosine frightful editor (CBE) or adenine frightful editor (ABE) in a doxycycline-inducible system.

Prolonged Info Fig. 3 Snide enhancing efficiencies and indel frequencies at built-in plan sequences.

Snide enhancing efficiencies measured at each space within the indicated space for plan nucleotide Cs (a) or As (b) in built-in surrogate plan sequences. Place 1 is the 5’ kill of the plan sequence and space 20 is straight away upstream of the NGG PAM. The numbers of analyzed plan sequences (n) are as follows: n = 5,865 (space −4), 5,393 (space −3), 5,782 (space -2), 5,815 (space -1), 5,292 (space 1), 5,614 (space 2), 5,697, 6,394, 10,586, 9,382, 8,837, 5,421, 6,130, 5,339, 5,541, 5,796, 5,058, 5,723, 5,955, 5,348, 5,779, 5,437, 4,884, 5,502 (space 20) for ABE (a); n = 19,475 (space -4), 20,753 (space -3), 20,110 (space -2), 19,425 (space -1), 19,984 (space 1), 20,004 (space 2), 17,873, 24,870, 35,421, 33,186, 32,807, 19,895, 19,195, 20,227, 19,549, 18,986, 20,367, 18,793, 18,361, 20,478, 19,605, 20,975, 21,542, 22,952 (space 20) for CBE (b). Boxplots are represented as follows: heart line of box indicating the median, box limits indicating the upper and lower quartile; whiskers point to the 10th and 90th percentiles. Outliers are shown the utilize of dots. c, Indel frequencies measured 10 days after the transduction of sgRNA plan pairs. The different of analyzed plan sequence is indicated on the kill of every dataset. (n = 62,000 (Library C, Replicate 1), 77,201 (Library C, Replicate 2), 21,617 (Library A, Replicate 1) and 20,913 (Library A, Replicate 2). Boxplots are represented as follows: heart white dot of box indicating the median, box limits indicating the upper and lower quartile; the distributions of indel frequencies are represented with kernel densities. d, Nonsynonymous frightful enhancing efficiencies on the built-in plan sequences of synonymous get rid of watch over sgRNAs and other sgRNAs within the given datasets. The different of synonymous and other sgRNAs are as follows; 431 and 21,055 (Library A, Replicate 1), 413 and 20,372 (Library A, Replicate 2), 2,272 and 59,390 (Library C, Replicate 1), and a pair of,795 and 73,691 (Library C, replicate 2), respectively. Boxplots are represented as follows: heart line of box indicating the median, box limits indicating the upper and lower quartile; whiskers point to the 1.5 cases interquartile vary.

Prolonged Info Fig. 4 Efficiency of high-throughput reports.

a, Distribution of median normalized log fold changes (LFCs) of 338 sgRNAs focusing on the biggest genes relying on the nonsynonymous frightful enhancing efficiencies sure on the built-in plan sequences in library C2. NT, nontargeting sgRNAs. The different of sgRNAs n = 359 (NT), 5 (60%). Boxplots are represented as follows: heart line of box indicating the median, box limits indicating the upper and lower quartile; whiskers point to the 1.5 cases interquartile vary. (in contrast with NT, student’s t-check; NS, now now not considerable, *P = 1.5 × 10−4, P = 2.2 × 10−32). b,c, Receiver working attribute-space below the curve (ROC-AUC) diagnosis of LFCs for sgRNAs predicted to induce halt codons in well-liked the biggest genes versus nontargeting sgRNAs in library C2 (b) and library C (c) at rising thresholds of nonsynonymous frightful enhancing efficiencies. AUC values are indicated in parentheses. d, ROC-AUC diagnosis of LFCs for sgRNAs predicted to induce halt codons in well-liked the biggest genes versus nontargeting controls at rising thresholds of the different of UMIs in each sgRNA in library C. An space below curve for each UMI cutoff is shown within the parenthesis. e, Correlations between median LFCs of UMIs for sgRNAs and LFCs of UMI CPM (counts per million) for the identical sgRNAs in library C2. Crimson dots expose sgRNAs predicted to induce nonsense mutations in selected well-liked the biggest genes. The different of sgRNAs n = 3,229 (merged), 2,913 (other sgRNAs, blue dots), 217 (sgRNAs focusing on the biggest genes, purple dots), 99 (nontargeting sgRNAs, dark dots). Pearson correlation coefficients (r) are shown.

Prolonged Info Fig. 5 Invent of little libraries and reproducibility of frightful enhancing efficiencies the utilize of these libraries.

a-b, Invent of little libraries C1, C2, and A1 (a) and C3 and A2 (b). UMIs, odd molecular identifiers. c, Correlations between nonsynonymous frightful enhancing efficiencies on the built-in plan sequences of natural replicates. The coloration of every dot used to be resolute by the different of neighboring dots (that is, dots interior a distance that is three cases the radius of the dot). The frightful enhancing efficiencies had been sure ten days after the initial transduction of every library into P-C or P-A cells. Handiest sgRNAs with extra than 100 raw learn counts in each replicate had been included. Pearson correlation coefficients (r) are shown. The different of sgRNAs n = 3,181 (library C1), 3,063 (library C2), and 1,520 (library A1).

Prolonged Info Fig. 6 The different of protein variants generated by an sgRNA.

a, The proportion of sgRNAs that induce a predominant protein variant. The numbers of sgRNAs are indicated in parentheses. b, The different of considerable (frequency > 10%) protein variants generated by sgRNAs that induce extra than one protein variants without a predominant protein variant. The numbers of protein variants are indicated in parentheses.

Prolonged Info Fig. 7 Affiliation between computationally predicted capabilities of variants and measured capabilities of variants.

a, The ratings from driver detection algorithms (CTAT-cancer and CHASM) for 4,143 protein variants. The different of variants n = 15 (depleting), 39 (most likely depleting), 864 (perchance depleting), 2,141 (neutral), 1,056 (perchance outgrowing), 25 (most likely outgrowing), and 3 (outgrowing). b, The ratings from algorithms that predict the functional outcomes of variants (SIFT and PolyPhen-2) for 3,899 protein variants. The different of variants n = 12 (depleting), 38 (most likely depleting), 807 (perchance depleting), 2,009 (neutral), 1,008 (perchance outgrowing), 22 (most likely outgrowing), and 3 (outgrowing). c,d, Distribution of SIFT ratings (c) and PolyPhen-2 ratings (d) for missense variants in well-liked the biggest genes per the LFC in library C. The different of variants n = 10 (

Prolonged Info Fig. 8 Allele frequency monitoring after transduction of sgRNA-encoding sequences into P-C or P-A cells.

sgRNA-encoding lentivirus used to be transduced into P-C and P-A cells at day 0 and doxycycline used to be added to induce expression of CBE and ABE, respectively, and maintained except day 10, after which doxycycline used to be eliminated. The functional classification outcomes obtained from the high-throughput experiments and these from these person experiments are shown in purple and green, respectively, on the kill of every graph. The mean values of two self reliant samples are indicated.

Prolonged Info Fig. 9 The outcomes of competitive proliferation assays.

a, An example for scoot cytometry gating method aged within the competitive proliferation assays. b, Mean relative enrichment values ± abnormal deviation of three replicates. Pupil’s t check used to be performed below the null hypothesis that the proportions of sgRNA-transduced and nontargeting sgRNA-transduced cells will most likely be the identical. Two nontargeting sgRNAs had been aged because the get rid of watch over and the mean values of relative enrichment had been aged because the get rid of watch over.

Prolonged Info Fig. 10 Notable gene groups associated with outgrowing/most likely outgrowing and depleting/most likely depleting sgRNAs and variants.

a, (Left panel) The part of functionally classified sgRNAs (prime) focusing on cancer gene census (CGC)5 genes and predominant protein variants (backside) encoded by CGC genes within the outgrowing and most likely outgrowing groups. Outcomes from all libraries excluding library eC had been combined. P-values from two-sided Fisher’s accurate check are shown. The different of sgRNAs or variants both focusing on or encoded by CGC genes amongst all sgRNAs or variants in each team are shown on the x-axes. (Horny panel) Detailed distribution of sgRNAs predicted to introduce mutations in CGC genes (prime) and variants generated in CGC genes (backside). The different of sgRNAs or variants equivalent to each gene is specified in parentheses. b, The part of functionally classified sgRNAs (left) focusing on Depmap well-liked the biggest genes (CEGs) and protein variants (lawful) encoded by CEGs within the depleting and most likely depleting groups. Outcomes from all libraries excluding library eC had been combined. P-values from two-sided Fisher’s accurate check are shown. The numbers of sgRNAs or variants both focusing on or encoded by CEG genes amongst all sgRNAs or variants in each team are shown on the x-axes.

Supplementary files

Supplementary Info

Supplementary Notes 1 and a pair of and Supplementary Figs. 1–7

Supplementary Desk

Supplementary Desk 1. Composition of sgRNA-encoding libraries C, A, C1, C2, C3, A1, A2, dA and eC. Barcode sequences aged for sorting, sgRNA sequences and plan sequences, at the side of neighboring sequences (5′-neighboring sequence (4 bp) + plan sequence (20 bp + 3-bp PAM = 23 bp) + 3′-neighboring sequence (3 bp) = 30 bp of genomic DNA sequence). Info about intended mutations and DeepCBE or DeepABE effectivity ratings are moreover included (provided as a separate Excel file). Supplementary Desk 2. The outcomes of MAGeCK analyses. RPM of four replicateUMI, LFCs, median LFCs (mLFCs), clear or detrimental MAGeCK RRA P values and LFCs of UMI CPM are shown for each sgRNA (provided as a separate Excel file). Supplementary Desk 3. Helpful classifications of sgRNAs and protein variants. a, Helpful classification of sgRNAs per the proliferation and survival (sheet 1). b, Helpful classification of sgRNAs in library eC (sheet 2). c, Snide enhancing outcomes and allele frequencies on the built-in plan sequences (dependency on EGF signaling) (sheet 3). d, Doable classification of sgRNAs with low frightful enhancing efficiencies (sheet 4) (provided as a separate Excel file). Supplementary Desk 4. Outcomes of allele frequency monitoring after provide of an person sgRNA for 20 selected sgRNAs. After lentiviral transduction of the required person sgRNA, protein variant frequencies had been calculated from DNA sequence diagnosis. Endogenous DNA sequence variants encoding the identical amino acid change had been combined into one protein variant (provided as a separate Excel file). Supplementary Desk 5. Oligonucleotides aged on this scrutinize (provided in a separate Excel file).

About this text

Cite this text

Kim, Y., Lee, S., Cho, S. et al. High-throughput functional analysis of human cancer-associated mutations the utilize of frightful editors.
Nat Biotechnol (2022). https://doi.org/10.1038/s41587-022-01276-4

Accumulate citation

  • Bought:

  • Authorized:

  • Revealed:

  • DOI: https://doi.org/10.1038/s41587-022-01276-4

Related post

Leave a Reply

Your email address will not be published.