发布: 2021年03月05日第11卷第5期 DOI: 10.21769/BioProtoc.3938 浏览次数: 5956
评审: Alka MehraSuresh PantheeKirsten A. Copren
Abstract
Next generations sequencing (NGS) has become an important tool in biomedical research. The Primer ID approach combined with the MiSeq platform overcomes the limitation of PCR errors and reveals the true sampling depth of population sequencing, making it an ideal tool to study mutagenic effects of potential broad-spectrum antivirals on RNA viruses. In this report we describe a protocol using Primer ID sequencing to study the mutations induced by antivirals in a coronavirus genome from an in vitro cell culture model and an in vivo mouse model. Viral RNA or total lung tissue RNA is tagged with Primer ID-containing cDNA primers during the initial reverse transcription step, followed by two rounds of PCR to amplify viral sequences and incorporate sequencing adaptors. Purified and pooled libraries are sequenced using the MiSeq platform. Sequencing data are processed using the template consensus sequence (TCS) web-app. The Primer ID approach provides an accurate sequencing protocol to measure mutation error rates in viral RNA genomes and host mRNA. Sequencing results suggested that β-D-N4-hydroxycytidine (NHC) greatly increased the transition substitution rate but not the transversion substitution rate in the viral RNA genomes, and cytosine (C) to uridine (U) was found as the most frequently seen mutation.
Keywords: Coronavirus (冠状病毒)Background
Next generation sequencing (NGS) has been extensively used in biomedical research for the last decade. When applying NGS to study intra-host viral populations for RNA viruses, modifications in library prep and sequencing protocols need to be considered. The virus titers (or viral loads) vary greatly from specimen to specimen. While conventional NGS platforms require 1-500 ng of DNA (or RNA) in a sequencing run, in most cases a clinical sample will have less than 100 fg of viral RNA, requiring that the viral RNA first be converted to cDNA using reverse transcriptase, followed by one or two rounds of PCR amplification to generate enough material for sequencing. However, the extensive cycles of PCR amplification will cause nucleotide mis-incorporation, recombination and amplification bias. Furthermore, the NGS platforms have relatively high error rates adding additional uncertainty to the resulting sequences (Liu et al., 2012).
We have developed the Primer ID NGS approach to overcome the errors and bias from the conventional NGS approaches (Jabara et al., 2011; Zhou et al., 2015). During the initial cDNA synthesis step, we use cDNA primers with an embedded 11-base degenerate block of nucleotides (the Primer ID) to tag each viral RNA template with a unique ID, a version of the approach of adding a unique molecular identifier (UMI). The Primer ID is carried on throughout the downstream amplifications and sequencing. After sequencing, all raw sequences with the same ID are collapsed to make a template consensus sequence (TCS). Each TCS is linked to an original template that was queried in the initial cDNA synthesis step, and the number of TCS shows the sequencing sampling depth of the viral population. By the construction of consensus sequence from multiple raw sequencing reads for each viral template, we can greatly reduce the PCR and sequencing errors. We have shown that the Primer ID approach can greatly reduce the sequencing error to 1 in 10,000 nucleotides (100-fold reduction from the raw sequencing reads) and also reveal the true sampling depth of the viral population, making it possible to accurately measure the substitution rates and diversity in the viral genomes in a population (Figure 1) (Zhou et al., 2015). We have further developed a multiplexing Primer ID approach, allowing the sequencing of multiple regions of the viral genome in a single cDNA synthesis/PCR (Figure 2) (Dennis et al., 2018).
Figure 1. Primer ID Approach. A. The structure of the Primer ID primer and its binding to the viral RNA template. B. An example of creating the template consensus sequence from raw sequence reads.
Figure 2. Library prep steps for the Multiplexed Primer ID sequencing. Biological sequence region in blue color, Primer ID region in yellow color, forward spacer region in orange, MiSeq barcode sequence region in purple.
This approach is ideal for studying intra-host RNA virus populations due to its ability to accurately characterize individual viral genomes. We have used this approach to determine the genetic structures of intra-host viral populations in HIV (Zhou et al., 2016; Lee et al., 2017; Dennis et al., 2018; Abrahams et al., 2019; Adewumi et al., 2020; Council et al., 2020), to study drug resistance mutations in HIV and HCV (Clutter et al., 2017; Zhou et al., 2018; Keys et al., 2015), and to detect nucleotide mutation rates in single-stranded DNA molecule after heating (Lewis et al., 2016).
In this report we describe a protocol using the Primer ID MiSeq sequencing to detect the mutagenic effect of EIDD-1931(β-D-N4-hydroxycytidine, NHC) and EIDD-2801 (an orally bioavailable prodrug of EIDD-1931 and is also recognized as MK-4482 or Molnupiravir) in the Middle East respiratory syndrome coronavirus (MERS-CoV) genome from in vitro cell culture and in vivo mouse models. In the cell culture sequencing experiment, we used a multiplexed Primer ID protocol to sequence multiple regions in the MERS-CoV ORF1b in a single library prep and sequencing reaction, while in the sequencing of total lung RNA from MERS-CoV infected mice, we used a multiplexed Primer ID protocol targeting both MERS-CoV genomes and two selected mouse mRNAs in a single library prep and sequencing reaction, in which we can directly compared the mutagenic effect induced by EIDD-2801 on MERS-CoV genomes and host mRNA. This protocol is not restricted to MERS-CoV and EIDD-1931 study only, but it can be used to detect viral RNA mutation rate in general.
Materials and Reagents
Pipette tips
Collection tubes (QIAGEN, catalog number: 19201 , stored at room temperature)
Qubit Assay Tubes (ThermoFisher Scientific, Invitrogen, catalog number: Q32856 , stored at room temperature)
SuperScript III One-Step RT-PCR System (ThermoFisher Scientific, Invitrogen, catalog number: 12574026 , stored at -20 °C)
Primer ID primers (Integrated DNA Technologies) at 100 µM in Tris-HCl buffer stored at -20 °C
SuperScript III Reverse Transcriptase (ThermoFisher Scientific, Invitrogen, catalog number: 18080085 , stored at -20 °C)
RNaseOUT Recombinant Ribonuclease Inhibitor (ThermoFisher Scientific, Invitrogen, catalog number: 10777019 , stored at -20 °C)
Ribonuclease H (ThermoFisher Scientific, Invitrogen, catalog number: 18021071 , stored at -20 °C)
KAPA2G Robust PCR kits (Hotstart PCR kits with dNTPs) (Roche, catalog number: KK5516 , stored at -20 °C)
KAPA2G HiFi HotStart (with dNTPs) (Roche, catalog number: KK2502 , stored at -20 °C)
RNAClean XP (Beckman Coulter, catalog number: A63987 , stored at 4 °C)
AMPure XP (Beckman Coulter, catalog number: A63881 , stored at 4 °C)
Ethanol 200 Proof (Decon Laboratories, catalog number: 2716 )
DNase/RNase-free water
MinElute Gel Extraction kit (QIAGEN, catalog number: 28606 , stored at 4 °C for MinElute columns, room temperature for other reagents)
Qubit dsDNA BR Assay Kit (ThermoFisher Scientific, Invitrogen, catalog number: Q32853 , stored at 4 °C for standard 1 and standard 2, room temperature for other reagents)
Experion DNA 12K analysis kit (Bio-Rad, catalog number: 7007108 , stored at room temperature for the chips, 4 °C for other reagents)
MiSeq Reagent Kit v3 (600-cycle) (Illumina, catalog number: MS-102-3003 , stored at -20 °C for box 1 and 4 °C for box 2)
Sodium hydroxide (Sigma-Aldrich, catalog number: 221465 , stored at room temperature)
Tris hydrochloride (Sigma-Aldrich, catalog number: 10812846001 , stored at room temperature)
PhiX Control V3 (Illumina, catalog number: FC-110-3001 , stored at -20 °C)
10 mM Tris-HCl with 0.1% TWEEN 20 (Teknova, catalog number: T7724 , stored at room temperature)
Equipment
Pipettes
Magnetic rack
Thermal cycler (ThermoFisher Scientific, Applied Biosystems, catalog number: 4384638 )
Centrifuge (Eppendorf, model: 5424 )
DynaMag-2 Magnet (ThermoFisher Scientific, Invitrogen, catalog number: 12321D )
Qubit 2.0 Fluorometer (ThermoFisher Scientific, Invitrogen, catalog number: Q32866 )
Experion Electrophoresis Station (Bio-Rad, catalog number: 7007010 )
MiSeq System (Illumina, catalog number: SY-410-1003 )
Software
Primer-BLAST (NCBI, https://www.ncbi.nlm.nih.gov/tools/primer-blast/) (Ye et al., 2012)
bcl2fastq pipeline (v.2.20.0) (Illumina, https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html)
TCS web app (UNC, https://tcs-dr-dept-tcs.cloudapps.unc.edu/)
TCS pipeline (v.1.3.8) (UNC, https://github.com/SwanstromLab/PID)
RUBY package viral_seq (v.1.0.8) (UNC, https://github.com/ViralSeq/viral_seq)
Procedure
文章信息
版权信息
© 2021 The Authors; exclusive licensee Bio-protocol LLC.
如何引用
Readers should cite both the Bio-protocol article and the original research article where this protocol was used:
分类
微生物学 > 抗微生物试验 > 抗病毒试验
分子生物学 > RNA
系统生物学 > 转录组学
您对这篇实验方法有问题吗?
在此处发布您的问题,我们将邀请本文作者来回答。同时,我们会将您的问题发布到Bio-protocol Exchange,以便寻求社区成员的帮助。
提问指南
+ 问题描述
写下详细的问题描述,包括所有有助于他人回答您问题的信息(例如实验过程、条件和相关图像等)。
Share
Bluesky
X
Copy link