2.4. Analysis of simulated and observed coefficient of variants (CVs)

DL Donglai Liu
HZ Haiwei Zhou
TX Teng Xu
QY Qiwen Yang
XM Xi Mo
DS Dawei Shi
JA Jingwen Ai
JZ Jingjia Zhang
YT Yue Tao
DW Donghua Wen
YT Yigang Tong
LR Lili Ren
WZ Wen Zhang
SX Shumei Xie
WC Weijun Chen
WX Wanli Xing
JZ Jinyin Zhao
YW Yilan Wu
XM Xianfa Meng
CO Chuan Ouyang
ZJ Zhi Jiang
ZL Zhikun Liang
HT Haiqin Tan
YF Yuan Fang
NQ Nan Qin
YG Yuanlin Guan
WG Wei Gai
SX Sihong Xu
WW Wenjuan Wu
WZ Wenhong Zhang
CZ Chuntao Zhang
YW Youchun Wang
request Request a Protocol
ask Ask a question
Favorite

Fastq data from the 10 repeated replicates of each sample were merged, randomly re-sampled according to the original data sizes (total number of reads), and analyzed by the CLARK-based pipeline. The CVs were calculated based on the read numbers mapped to each microbe within each re-sampled replicate. This above process was repeated 10 times to obtain a total of 10 simulated CVs for each microbe. The average of these simulated CVs represented the CV derived from variations in data size. A linear regression model was used to evaluate the contribution of these CVs to the observed overall CVs. In addition, we used a linear mixed model to further evaluate whether the sequencing platform, library method, and class of microorganism affected the observed CV. The formula of the linear mixed model was defined as:

Cv_observed ∼ Cv_datasize + Library Prep + Microbial Class + Platform + (1+Cv_observed|Center) where Center was a random effect, and the read depth CV (Cv_datasize), library preparation method (Library Prep), microbial class, and sequencing platform (Platform) were fixed effects.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A