-
Categories
-
Pharmaceutical Intermediates
-
Active Pharmaceutical Ingredients
-
Food Additives
- Industrial Coatings
- Agrochemicals
- Dyes and Pigments
- Surfactant
- Flavors and Fragrances
- Chemical Reagents
- Catalyst and Auxiliary
- Natural Products
- Inorganic Chemistry
-
Organic Chemistry
-
Biochemical Engineering
- Analytical Chemistry
-
Cosmetic Ingredient
- Water Treatment Chemical
-
Pharmaceutical Intermediates
Promotion
ECHEMI Mall
Wholesale
Weekly Price
Exhibition
News
-
Trade Service
On October 16, 2022, the internationally renowned academic journal Nucleic Acids Research published online the PGG.
SV of the human genome structural variation database jointly developed by Professor Xu Shuhua's team from the School of Life Sciences/Institute of Human Phenomics of Fudan University, Zhang Guoqing, researcher from the Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, and researcher Fan Shaohua from the School of Life Sciences, Fudan University PGGSV/), titled "PGG.
SV: A whole-genome-sequencing-based structural variant resource and data analysis platform"
.
By collecting whole genome sequencing data of the global population, the database focuses on the mining and integration of genomic structural variation data, providing a comprehensive platform
for data acquisition, information query and online analysis for the study of human genome structural variation.
Genomic structural variants (SVs) mainly include DNA deletion, insertion, fragment duplication and other variant types of large fragments on the genome, and a large number of studies have shown that SV is related to a variety of complex genetic diseases such as cancer, autism, and neurodevelopmental disorders, and has received continuous attention
in the field of medicine and genetics in recent years.
With the advancement and popularization of genome sequencing technology, a large number of structural variants have been continuously discovered and studied, and some structural variants with strong pathogenicity have gradually been verified
.
The research team aims to construct a representative and diverse dataset of genomic structural variation in healthy people, on the one hand, to provide reliable control samples for the study of structural variation in patients with genetic diseases, and on the other hand, to annotate and predict the function of variation, which will also effectively narrow the screening scope of pathogenic mutations and provide effective guidance and assistance
to researchers in related fields.
Due to the significant differences and diversity of structural variants among different regions and ethnic groups, and the existing databases and public datasets each adopt different analysis processes, there has been a lack of a structural variation resource and analysis platform with population samples and next-generation sequencing data, especially the coverage of East Asian population samples is seriously insufficient
.
The research team integrated large-scale sequencing data, including 6,048 whole genome sequencing data from 177 representative regions and ethnic groups around the world, especially the in-depth analysis of China's rich ethnic diversity characteristics, covering 50 ethnic minorities
in China for the first time.
As of publication, the database contains a total of 584,277 structural variants, which will continue to be added
in the future.
In addition, PGG.
SV includes the third-generation long-reads sequencing data for the first time, which has greater advantages in the detection of structural variations, especially in the detection and determination of insertion sequences, and its effect is significantly better than second-generation sequencing technology
.
Previous databases of large-scale structural variants were built
on next-generation sequencing or gene chip data.
The research team generated and collected 1,030 third-generation sequencing genomes, and for the first time used the combination of third-generation sequencing and second-generation sequencing to construct a structural variation database, which greatly improved the quantity and quality
of structural variation detection results.
Figure 1 PGG.
Schematic diagram of SV data processing flow
In terms of database function, PGG.
SV provides a concise and friendly query function, providing accurate display of the genomic location of different population structural variations, as well as statistical information
such as frequency differences between various ethnic groups around the world.
Taking advantage of the previous accumulation of the research group, PGG.
SV linked with PGG.
SNV and other databases previously developed by Professor Xu Shuhua's team, and combined the detailed results of single nucleotide variation (SNV) with structural variation with the help of linkage imbalance and genomic spatial location information to enhance the analysis function
of data diversity 。 In addition, PGG.
SV provides rich clinical effect analysis and predictive analysis capabilities, providing prediction and enrichment analysis of genes and regulatory elements associated with structural variants, as well as tools for retrieving relevant structural variants by specific diseases and phenotypes, for users with
clinical research needs.
Finally, PGG.
SV also supports rich online analysis and visualization capabilities
.
On the one hand, the research team provides a comparison and annotation of the structural variation results submitted by the user, so that the user can understand the difference between his own target sample and the control sample provided by the database; On the other hand, the research team also provides structural variation visualization capabilities, which can retrieve user-submitted DNA sequences on the human genome, display the genomic location of related variants, and provide detailed visualization
of changes in the spatial structure of variations.
Figure 2 PGG.
Schematic diagram of the SV interface
In general, PGG.
SV provides a high-quality data resource on population genomic structural variation, greatly improves the detection and display of human genome structural variation information based on next-generation sequencing data, especially for the first time, it comprehensively covers the diversity of structural variation in East Asian populations and Chinese populations, and provides annotations
of related genes and potential clinical effects.
In addition, the platform provides a variety of online analysis capabilities, including case-control studies, as well as visualization tools
for structural variation in the human genome.
Yimin Wang, Ling Zunchao, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, and Jiao Gong, Institute of Human Phenome, Fudan University, are the co-first authors
of this paper.
Professor Xu Shuhua, researcher Zhang Guoqing and researcher Fan Shaohua are the co-corresponding authors
of this paper.
The research work has been supported
by the Basic Science Center of the National Foundation of China, the National Natural Science Foundation of China, the Pilot Project of the Chinese Academy of Sciences, the Newton Fund of the Royal Society, and the Shanghai Municipal Major Special Project of the Human Phenotype.
Original link: https://academic.
oup.
com/nar/advance-article/doi/10.
1093/nar/gkac905/6761741?login=false