echemi logo
Product
  • Product
  • Supplier
  • Inquiry
    Home > Biochemistry News > Biotechnology News > PGG.SV of the Human Genome Structural Variation Database and Computational Analysis Platform

    PGG.SV of the Human Genome Structural Variation Database and Computational Analysis Platform

    • Last Update: 2022-10-25
    • Source: Internet
    • Author: User
    Search more information of high quality chemicals, good prices and reliable suppliers, visit www.echemi.com
      

    On October 16, 2022, the internationally renowned academic journal Nucleic Acids Research published online the PGG.
    SV of the human genome structural variation database jointly developed by Professor Xu Shuhua's team from the School of Life Sciences/Institute of Human Phenomics of Fudan University, Zhang Guoqing, researcher from the Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, and researcher Fan Shaohua from the School of Life Sciences, Fudan University PGGSV/), titled "PGG.
    SV: A whole-genome-sequencing-based structural variant resource and data analysis platform"
    .
    By collecting whole genome sequencing data of the global population, the database focuses on the mining and integration of genomic structural variation data, providing a comprehensive platform
    for data acquisition, information query and online analysis for the study of human genome structural variation.

    Genomic structural variants (SVs) mainly include DNA deletion, insertion, fragment duplication and other variant types of large fragments on the genome, and a large number of studies have shown that SV is related to a variety of complex genetic diseases such as cancer, autism, and neurodevelopmental disorders, and has received continuous attention
    in the field of medicine and genetics in recent years.
    With the advancement and popularization of genome sequencing technology, a large number of structural variants have been continuously discovered and studied, and some structural variants with strong pathogenicity have gradually been verified
    .
    The research team aims to construct a representative and diverse dataset of genomic structural variation in healthy people, on the one hand, to provide reliable control samples for the study of structural variation in patients with genetic diseases, and on the other hand, to annotate and predict the function of variation, which will also effectively narrow the screening scope of pathogenic mutations and provide effective guidance and assistance
    to researchers in related fields.

    Due to the significant differences and diversity of structural variants among different regions and ethnic groups, and the existing databases and public datasets each adopt different analysis processes, there has been a lack of a structural variation resource and analysis platform with population samples and next-generation sequencing data, especially the coverage of East Asian population samples is seriously insufficient
    .
    The research team integrated large-scale sequencing data, including 6,048 whole genome sequencing data from 177 representative regions and ethnic groups around the world, especially the in-depth analysis of China's rich ethnic diversity characteristics, covering 50 ethnic minorities
    in China for the first time.
    As of publication, the database contains a total of 584,277 structural variants, which will continue to be added
    in the future.
    In addition, PGG.
    SV includes the third-generation long-reads sequencing data for the first time, which has greater advantages in the detection of structural variations, especially in the detection and determination of insertion sequences, and its effect is significantly better than second-generation sequencing technology
    .
    Previous databases of large-scale structural variants were built
    on next-generation sequencing or gene chip data.
    The research team generated and collected 1,030 third-generation sequencing genomes, and for the first time used the combination of third-generation sequencing and second-generation sequencing to construct a structural variation database, which greatly improved the quantity and quality
    of structural variation detection results.


    Figure 1 PGG.
    Schematic diagram of SV data processing flow

    In terms of database function, PGG.
    SV provides a concise and friendly query function, providing accurate display of the genomic location of different population structural variations, as well as statistical information
    such as frequency differences between various ethnic groups around the world.
    Taking advantage of the previous accumulation of the research group, PGG.
    SV linked with PGG.
    SNV and other databases previously developed by Professor Xu Shuhua's team, and combined the detailed results of single nucleotide variation (SNV) with structural variation with the help of linkage imbalance and genomic spatial location information to enhance the analysis function
    of data diversity 。 In addition, PGG.
    SV provides rich clinical effect analysis and predictive analysis capabilities, providing prediction and enrichment analysis of genes and regulatory elements associated with structural variants, as well as tools for retrieving relevant structural variants by specific diseases and phenotypes, for users with
    clinical research needs.

    Finally, PGG.
    SV also supports rich online analysis and visualization capabilities
    .
    On the one hand, the research team provides a comparison and annotation of the structural variation results submitted by the user, so that the user can understand the difference between his own target sample and the control sample provided by the database; On the other hand, the research team also provides structural variation visualization capabilities, which can retrieve user-submitted DNA sequences on the human genome, display the genomic location of related variants, and provide detailed visualization
    of changes in the spatial structure of variations.


    Figure 2 PGG.
    Schematic diagram of the SV interface

    In general, PGG.
    SV provides a high-quality data resource on population genomic structural variation, greatly improves the detection and display of human genome structural variation information based on next-generation sequencing data, especially for the first time, it comprehensively covers the diversity of structural variation in East Asian populations and Chinese populations, and provides annotations
    of related genes and potential clinical effects.
    In addition, the platform provides a variety of online analysis capabilities, including case-control studies, as well as visualization tools
    for structural variation in the human genome.

    Yimin Wang, Ling Zunchao, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, and Jiao Gong, Institute of Human Phenome, Fudan University, are the co-first authors
    of this paper.
    Professor Xu Shuhua, researcher Zhang Guoqing and researcher Fan Shaohua are the co-corresponding authors
    of this paper.
    The research work has been supported
    by the Basic Science Center of the National Foundation of China, the National Natural Science Foundation of China, the Pilot Project of the Chinese Academy of Sciences, the Newton Fund of the Royal Society, and the Shanghai Municipal Major Special Project of the Human Phenotype.

    Original link: https://academic.
    oup.
    com/nar/advance-article/doi/10.
    1093/nar/gkac905/6761741?login=false

    This article is an English version of an article which is originally in the Chinese language on echemi.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to service@echemi.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

    Related Articles

    Contact Us

    The source of this page with content of products and services is from Internet, which doesn't represent ECHEMI's opinion. If you have any queries, please write to service@echemi.com. It will be replied within 5 days.

    Moreover, if you find any instances of plagiarism from the page, please send email to service@echemi.com with relevant evidence.