用于分析冠状病毒基因组中广谱抗病毒药物诱导的过渡突变和错误率的引物ID下一代测序

Shuntai  Zhou; Collin S. Hill; Michael U. Clark; Timothy P. Sheahan; Ralph  Baric; Ronald  Swanstrom

doi:10.21769/BioProtoc.3938

Improve Research Reproducibility A Bio-protocol resource

提交稿件
订阅
登录
/
注册
- 个人主页
- 编辑个人信息
- 修改密码
- 退出
CN
- EN - English
- CN - 中文

Peer-reviewed

Primer ID Next-Generation Sequencing for the Analysis of a Broad Spectrum Antiviral Induced Transition Mutations and Errors Rates in a Coronavirus Genome

用于分析冠状病毒基因组中广谱抗病毒药物诱导的过渡突变和错误率的引物ID下一代测序

SZ Shuntai Zhou

CH Collin S. Hill

MC Michael U. Clark

TS Timothy P. Sheahan

RB Ralph Baric

RS Ronald Swanstrom email

发布: 2021年03月05日第11卷第5期 DOI: 10.21769/BioProtoc.3938 浏览次数: 7471

评审: Alka MehraSuresh PantheeKirsten A. Copren

PDF

Q&A

引用

Cited by

参见作者原研究论文

The authors used this protocol in:

Cover of Science Translational Medicine, featuring study using the protocol.

Apr 2020

Bio-protocol welcomes Protocols in Bioinformatics and Computational Biology

实验方案合集

Cell Imaging - A Special Collection for Cell Bio 2023

相关实验方案

抗病毒物质和抗体的杀灭病毒活性及中和活性检测

Chie Aoki-Utsubo [...] Hak Hotta

2018年05月20日 13866 阅读

在两细胞系中分析鞘磷脂酶对于风疹病毒感染力的影响

Noriyuki Otsuki [...] Makoto Takeda

2018年09月05日 6092 阅读

登革病毒滴度测定改良聚焦形成法

Maharah Binte Abdul Mahid [...] Kitti Wing Ki Chan

2024年10月20日 2411 阅读

Abstract

Next generations sequencing (NGS) has become an important tool in biomedical research. The Primer ID approach combined with the MiSeq platform overcomes the limitation of PCR errors and reveals the true sampling depth of population sequencing, making it an ideal tool to study mutagenic effects of potential broad-spectrum antivirals on RNA viruses. In this report we describe a protocol using Primer ID sequencing to study the mutations induced by antivirals in a coronavirus genome from an in vitro cell culture model and an in vivo mouse model. Viral RNA or total lung tissue RNA is tagged with Primer ID-containing cDNA primers during the initial reverse transcription step, followed by two rounds of PCR to amplify viral sequences and incorporate sequencing adaptors. Purified and pooled libraries are sequenced using the MiSeq platform. Sequencing data are processed using the template consensus sequence (TCS) web-app. The Primer ID approach provides an accurate sequencing protocol to measure mutation error rates in viral RNA genomes and host mRNA. Sequencing results suggested that β-D-N4-hydroxycytidine (NHC) greatly increased the transition substitution rate but not the transversion substitution rate in the viral RNA genomes, and cytosine (C) to uridine (U) was found as the most frequently seen mutation.

Keywords: Coronavirus (冠状病毒)

Antivirals (抗病毒药)

Next generation sequencing (新一代测序)

Primer ID (引物 ID)

Mutation (突变)

Background

Next generation sequencing (NGS) has been extensively used in biomedical research for the last decade. When applying NGS to study intra-host viral populations for RNA viruses, modifications in library prep and sequencing protocols need to be considered. The virus titers (or viral loads) vary greatly from specimen to specimen. While conventional NGS platforms require 1-500 ng of DNA (or RNA) in a sequencing run, in most cases a clinical sample will have less than 100 fg of viral RNA, requiring that the viral RNA first be converted to cDNA using reverse transcriptase, followed by one or two rounds of PCR amplification to generate enough material for sequencing. However, the extensive cycles of PCR amplification will cause nucleotide mis-incorporation, recombination and amplification bias. Furthermore, the NGS platforms have relatively high error rates adding additional uncertainty to the resulting sequences (Liu et al., 2012).

We have developed the Primer ID NGS approach to overcome the errors and bias from the conventional NGS approaches (Jabara et al., 2011; Zhou et al., 2015). During the initial cDNA synthesis step, we use cDNA primers with an embedded 11-base degenerate block of nucleotides (the Primer ID) to tag each viral RNA template with a unique ID, a version of the approach of adding a unique molecular identifier (UMI). The Primer ID is carried on throughout the downstream amplifications and sequencing. After sequencing, all raw sequences with the same ID are collapsed to make a template consensus sequence (TCS). Each TCS is linked to an original template that was queried in the initial cDNA synthesis step, and the number of TCS shows the sequencing sampling depth of the viral population. By the construction of consensus sequence from multiple raw sequencing reads for each viral template, we can greatly reduce the PCR and sequencing errors. We have shown that the Primer ID approach can greatly reduce the sequencing error to 1 in 10,000 nucleotides (100-fold reduction from the raw sequencing reads) and also reveal the true sampling depth of the viral population, making it possible to accurately measure the substitution rates and diversity in the viral genomes in a population (Figure 1) (Zhou et al., 2015). We have further developed a multiplexing Primer ID approach, allowing the sequencing of multiple regions of the viral genome in a single cDNA synthesis/PCR (Figure 2) (Dennis et al., 2018).

Figure 1. Primer ID Approach. A. The structure of the Primer ID primer and its binding to the viral RNA template. B. An example of creating the template consensus sequence from raw sequence reads.

Figure 2. Library prep steps for the Multiplexed Primer ID sequencing. Biological sequence region in blue color, Primer ID region in yellow color, forward spacer region in orange, MiSeq barcode sequence region in purple.

This approach is ideal for studying intra-host RNA virus populations due to its ability to accurately characterize individual viral genomes. We have used this approach to determine the genetic structures of intra-host viral populations in HIV (Zhou et al., 2016; Lee et al., 2017; Dennis et al., 2018; Abrahams et al., 2019; Adewumi et al., 2020; Council et al., 2020), to study drug resistance mutations in HIV and HCV (Clutter et al., 2017; Zhou et al., 2018; Keys et al., 2015), and to detect nucleotide mutation rates in single-stranded DNA molecule after heating (Lewis et al., 2016).

In this report we describe a protocol using the Primer ID MiSeq sequencing to detect the mutagenic effect of EIDD-1931(β-D-N4-hydroxycytidine, NHC) and EIDD-2801 (an orally bioavailable prodrug of EIDD-1931 and is also recognized as MK-4482 or Molnupiravir) in the Middle East respiratory syndrome coronavirus (MERS-CoV) genome from in vitro cell culture and in vivo mouse models. In the cell culture sequencing experiment, we used a multiplexed Primer ID protocol to sequence multiple regions in the MERS-CoV ORF1b in a single library prep and sequencing reaction, while in the sequencing of total lung RNA from MERS-CoV infected mice, we used a multiplexed Primer ID protocol targeting both MERS-CoV genomes and two selected mouse mRNAs in a single library prep and sequencing reaction, in which we can directly compared the mutagenic effect induced by EIDD-2801 on MERS-CoV genomes and host mRNA. This protocol is not restricted to MERS-CoV and EIDD-1931 study only, but it can be used to detect viral RNA mutation rate in general.

Materials and Reagents

Pipette tips
Collection tubes (QIAGEN, catalog number: 19201 , stored at room temperature)
Qubit Assay Tubes (ThermoFisher Scientific, Invitrogen, catalog number: Q32856 , stored at room temperature)
SuperScript III One-Step RT-PCR System (ThermoFisher Scientific, Invitrogen, catalog number: 12574026 , stored at -20 °C)
Primer ID primers (Integrated DNA Technologies) at 100 µM in Tris-HCl buffer stored at -20 °C
SuperScript III Reverse Transcriptase (ThermoFisher Scientific, Invitrogen, catalog number: 18080085 , stored at -20 °C)
RNaseOUT Recombinant Ribonuclease Inhibitor (ThermoFisher Scientific, Invitrogen, catalog number: 10777019 , stored at -20 °C)
Ribonuclease H (ThermoFisher Scientific, Invitrogen, catalog number: 18021071 , stored at -20 °C)
KAPA2G Robust PCR kits (Hotstart PCR kits with dNTPs) (Roche, catalog number: KK5516 , stored at -20 °C)
KAPA2G HiFi HotStart (with dNTPs) (Roche, catalog number: KK2502 , stored at -20 °C)
RNAClean XP (Beckman Coulter, catalog number: A63987 , stored at 4 °C)
AMPure XP (Beckman Coulter, catalog number: A63881 , stored at 4 °C)
Ethanol 200 Proof (Decon Laboratories, catalog number: 2716 )
DNase/RNase-free water
MinElute Gel Extraction kit (QIAGEN, catalog number: 28606 , stored at 4 °C for MinElute columns, room temperature for other reagents)
Qubit dsDNA BR Assay Kit (ThermoFisher Scientific, Invitrogen, catalog number: Q32853 , stored at 4 °C for standard 1 and standard 2, room temperature for other reagents)
Experion DNA 12K analysis kit (Bio-Rad, catalog number: 7007108 , stored at room temperature for the chips, 4 °C for other reagents)
MiSeq Reagent Kit v3 (600-cycle) (Illumina, catalog number: MS-102-3003 , stored at -20 °C for box 1 and 4 °C for box 2)
Sodium hydroxide (Sigma-Aldrich, catalog number: 221465 , stored at room temperature)
Tris hydrochloride (Sigma-Aldrich, catalog number: 10812846001 , stored at room temperature)
PhiX Control V3 (Illumina, catalog number: FC-110-3001 , stored at -20 °C)
10 mM Tris-HCl with 0.1% TWEEN 20 (Teknova, catalog number: T7724 , stored at room temperature)

Equipment

Pipettes
Magnetic rack
Thermal cycler (ThermoFisher Scientific, Applied Biosystems, catalog number: 4384638 )
Centrifuge (Eppendorf, model: 5424 )
DynaMag-2 Magnet (ThermoFisher Scientific, Invitrogen, catalog number: 12321D )
Qubit 2.0 Fluorometer (ThermoFisher Scientific, Invitrogen, catalog number: Q32866 )
Experion Electrophoresis Station (Bio-Rad, catalog number: 7007010 )
MiSeq System (Illumina, catalog number: SY-410-1003 )

Software

Primer-BLAST (NCBI, https://www.ncbi.nlm.nih.gov/tools/primer-blast/) (Ye et al., 2012)
bcl2fastq pipeline (v.2.20.0) (Illumina, https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html)
TCS web app (UNC, https://tcs-dr-dept-tcs.cloudapps.unc.edu/)
TCS pipeline (v.1.3.8) (UNC, https://github.com/SwanstromLab/PID)
RUBY package viral_seq (v.1.0.8) (UNC, https://github.com/ViralSeq/viral_seq)

Procedure

English

中文翻译

文章信息

版权信息

如何引用

Readers should cite both the Bio-protocol article and the original research article where this protocol was used:

Zhou, S., Hill, C. S., Clark, M. U., Sheahan, T. P., Baric, R. and Swanstrom, R. (2021). Primer ID Next-Generation Sequencing for the Analysis of a Broad Spectrum Antiviral Induced Transition Mutations and Errors Rates in a Coronavirus Genome . Bio-protocol 11(5): e3938. DOI: 10.21769/BioProtoc.3938.
Sheahan, T. P., Sims, A. C., Zhou, S., Graham, R. L., Pruijssers, A. J., Agostini, M. L., Leist, S. R., Schafer, A., Dinnon, K. H., 3rd, Stevens, L. J., Chappell, J. D., Lu, X., Hughes, T. M., George, A. S., Hill, C. S., Montgomery, S. A., Brown, A. J., Bluemling, G. R., Natchus, M. G., Saindane, M., Kolykhalov, A. A., Painter, G., Harcourt, J., Tamin, A., Thornburg, N. J., Swanstrom, R., Denison, M. R., and Baric, R. S. (2020). An orally bioavailable broad-spectrum antiviral inhibits SARS-CoV-2 in human airway epithelial cell cultures and multiple coronaviruses in mice. Sci Transl Med 12(541): eabb5883.

Download Citation in RIS Format

分类

您对这篇实验方法有问题吗？

在此处发布您的问题，我们将邀请本文作者来回答。同时，我们会将您的问题发布到Bio-protocol Exchange，以便寻求社区成员的帮助。

发布问题

0 Q&A

提交稿件