Databases

Downloading Required ViroMatch Databases

As outlined in Input File Types, you will need to supply required reference genome databases in order to run the ViroMatch pipeline. These resources have already been indexed and are ready for processing, but you will need to download all of the files first.

We provide file access and transfer through Globus Connenct.

What is Globus?

Globus Connect enables your system to use the Globus file transfer and sharing service. It makes it simple to create a Globus endpoint on practically any system, from a personal laptop to a national supercomputer. Globus Connect is free to install and use for users at non-profit research and education institutions. Globus Connect Versions (Source: Globus Connect)

Globus Connect Personal is designed for use by a single user on a personal machine and is free for users at non-profit research and education institutions. Once Globus Connect Personal is installed and you are logged in, you may click on the link below to download the ViroMatch databases.

Download ViroMatch Databases Using Globus

The main sub-directories for download are listed below, organized by database type. For detailed information on each sub-directory’s files, see the corresponding Input File Type.

Sub-Directory Input File Type
adaptor/ Adaptor File
host/ Host File
ncbi/nr/ NCBI nr Files
ncbi/nt/ NCBI nt Files
taxonomy/ Taxid File
viral-only/nuc/ Viralfna File
viral-only/trans_nuc/ Viralfaa File

You will need all of the underlying database files in these sub-directories in order to run the pipeline. Be aware the databases are quite large in cumulative size (~860 GB).