-
Categories
-
Pharmaceutical Intermediates
-
Active Pharmaceutical Ingredients
-
Food Additives
- Industrial Coatings
- Agrochemicals
- Dyes and Pigments
- Surfactant
- Flavors and Fragrances
- Chemical Reagents
- Catalyst and Auxiliary
- Natural Products
- Inorganic Chemistry
-
Organic Chemistry
-
Biochemical Engineering
- Analytical Chemistry
-
Cosmetic Ingredient
- Water Treatment Chemical
-
Pharmaceutical Intermediates
Promotion
ECHEMI Mall
Wholesale
Weekly Price
Exhibition
News
-
Trade Service
Viruses have brought huge threats to human health, such as the Spanish flu in 1918, AIDS, Ebola, SARS, and the new coronavirus
.
It is estimated that there are as many as 3x10 5 viruses that can cause human infectious diseases , but unfortunately only a small number of them are known
With the help of new technologies, such as high-throughput sequencing, thousands of novel viruses have been discovered, and the number of such discoveries is increasing exponentially
.
However, interpretation of these sequenced sequences, such as splicing the sequences, remains a challenge
On January 26, 2022, Artem Babaian of Canada published an article Petabase-scale sequence alignment catalyses viral discovery in Nature, and developed a cloud computing platform-Serratus, which can achieve PB (1PB=1024TB) level sequence alignment, and More than 105 novel RNA viruses were identified .
Public databases such as SRA (Sequence Read Archive) have petabyte-level sequences, and these data information can be used for free
.
The researchers uploaded images of this information to the cloud platform Serratus (free and open source, https://serratus.
io), and used Serratus to analyze more than short-sequence datasets (the cost of which was as low as an average of less than $2 per dataset per day).
to 1 cent)
To identify libraries containing virus-related sequences, the researchers screened 3,837,755 public RNA-seq, meta-genome, meta-transcriptome, and meta-virome datasets and compared them with all coronavirus and vertebrate virus sequences, and then compared them with All RNA-dependent RNA polymerase (RdRP) sequences were aligned, and 15,016 known sOTUs (species-like operational taxonomic units) and 131,957 unknown sOTUs were identified
.
It is estimated that the types of viruses are about 10 8 -10 12 , so the number of viruses calculated this time is only 0.
1% of the estimated number
In view of the epidemic situation of the new coronavirus in the past two years, the researchers tried to use Serratus to mine the coronavirus in the existing data set and found 70 sOTUs, of which 44 have been reported, 17 contain partial RdRP, and 9 are novel coronavirus virus
.
The hepatitis virus causes more deaths each year than HIV, tuberculosis, malaria,
etc.
Among them, hepatitis D virus is a type of hepatitis virus and was considered to be the only type of delta virus until 2018
In short, in the context of the epidemic of the new coronavirus, this study applied the hot concept of the Internet in recent years, "cloud computing" to sequence alignment, and discovered more than 100,000 new RNA viruses by mining public databases.
Expanding our understanding of the viral world helps us predict and prevent future viral pandemics
Original link:
https://doi.
org/10.
1038/s41586-021-04332-2