Read name

ED Eric M. Davis
YS Yu Sun
YL Yanling Liu
PK Pandurang Kolekar
YS Ying Shao
KS Karol Szlachta
HM Heather L. Mulder
DR Dongren Ren
SR Stephen V. Rice
ZW Zhaoming Wang
JN Joy Nakitandwe
AG Alexander M. Gout
BS Bridget Shaner
SH Salina Hall
LR Leslie L. Robison
SP Stanley Pounds
JK Jeffery M. Klco
JE John Easton
XM Xiaotu Ma
request Request a Protocol
ask Ask a question
Favorite

The raw sequencing reads have names formatted as follows: <instrument>:<run number>:<flowcell ID>:<lane>:<tile>:<x-pos>:<y-pos> (https://help.basespace.illumina.com/articles/descriptive/fastq-files/; last accessed February 11, 2020). For example, the first record in dataset ERR3790565 has a read name of A00363:103:H3CMMDRXX:1:1101:21124:1000, which indicates that the sequencer ID is A00363, and this dataset was generated on its 103rd run, on a flow cell with ID H3CMMDRXX. This read was generated in lane 1, on tile 1101, with x position 21,124 and y position 1000. Our algorithm parses the read name to obtain information on sequencer, flow cell, and tiles according to this format.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

post Post a Question
0 Q&A