For each virus, the GM calculated a score (the logarithm of the probability of being viral host) for all prokaryotic genomes available in the dataset. For example, in the test dataset, each of the 60,105 prokaryotic genomes would be assigned a score (the logarithm of the probability of being viral host) by the GM. The prokaryotic species with the highest score was considered as the predicted host of the virus. The predicted host was compared to the actual host at different taxonomic levels. If the predicted host belonged to the same taxonomic unit such as genus with the actual host, the prediction was considered as correct at the level. The accuracy of virus host prediction at a certain taxonomic level was defined as the ratio of correctly predicted host at this taxonomic level.

Note: The content above has been extracted from a research article, so it may not display correctly.

Please log in to submit your questions online.
Your question will be posted on the Bio-101 website. We will send your questions to the authors of this protocol and Bio-protocol community members who are experienced with this method. you will be informed using the email address associated with your Bio-protocol account.

We use cookies on this site to enhance your user experience. By using our website, you are agreeing to allow the storage of cookies on your computer.