A survey of public honey bee sequence data was performed to detect infections by Lake Sinai Virus (LSV). The Sequence Read Archive of the National Center for Biotechnology Information (NCBI) was queried to identify accessions of RNA sequence data derived from honey bee. These were filtered as described below and then up to 50 million reads or read pairs were downloaded and searched against a reference database of conserved LSV sequence. Accessions with matches above a specified threshold were downloaded in their entirety and assembled into longer contiguous sequences (contigs). The result contigs were searched against each open reading frame (ORF) of the reference LSV genome present in the NCBI database (accession NC_032433.1) and matching regions from each contig. These ORF sequences were aligned with additional sequences identified in NCBI databases through the BLAST web service. These alignments provide the basis for computing phylogenetic trees, rates of nucleotide substitution, codon usage bias, and other evolutionary parameters.
|Title||Metagenomic detection and reconstruction of Lake Sinai Virus from honey bee sequence data|
|Authors||Robert Scott Cornman|
|Product Type||Data Release|
|Record Source||USGS Digital Object Identifier Catalog|
|USGS Organization||Fort Collins Science Center|