As outlined in Input File Types, you will need to supply required reference genome databases in order to run the ViroMatch pipeline. These resources have already been indexed and are ready for processing, but you will need to download all of the files first.
We provide file access and transfer through Globus Connenct.
What is Globus?
Globus Connect enables your system to use the Globus file transfer and sharing service. It makes it simple to create a Globus endpoint on practically any system, from a personal laptop to a national supercomputer. Globus Connect is free to install and use for users at non-profit research and education institutions. Globus Connect Versions (Source: Globus Connect)
Globus Connect Personal is designed for use by a single user on a personal machine and is free for users at non-profit research and education institutions. Once Globus Connect Personal is installed and you are logged in, you may click on the link below to download the ViroMatch databases.
Download ViroMatch Databases Using GlobusThe main sub-directories for download are listed below, organized by database type. For detailed information on each sub-directory’s files, see the corresponding Input File Type.
Sub-Directory | Input File Type |
---|---|
adaptor/ |
Adaptor File |
host/ |
Host File |
ncbi/nr/ |
NCBI nr Files |
ncbi/nt/ |
NCBI nt Files |
taxonomy/ |
Taxid File |
viral-only/nuc/ |
Viralfna File |
viral-only/trans_nuc/ |
Viralfaa File |
You will need all of the underlying database files in these sub-directories in order to run the pipeline. Be aware the databases are quite large in cumulative size (~860 GB).