Updated 2015-05-29 The following files are provided: 620.mhT2D.RefGeneCatalog.faa.gz - protein sequences of predcicted open reading frames 620.mhT2D.RefGeneCatalog.fna.gz - nucleotide sequences of predicted open reading frames 620.mhT2D.RefGeneCatalog.padded.1 - padded** nucleotide sequences, part 1 620.mhT2D.RefGeneCatalog.padded.2 - padded** nucleotide sequences, part 2 620.mhT2D.RefGeneCatalog.padded.1.coord - start and stop of proteins in padded database, part 1 620.mhT2D.RefGeneCatalog.padded.2.coord - start and stop of proteins in padded database, part 2 620.mhT2D.RefGeneCatalog.eggnog3.annotations - annotations of predicted ORFs to the eggNOG v3 database (protein - COG)* 620.mhT2D.RefGeneCatalog.kegg62.annotations - annotations of predicted ORFs to the KEGG v62 database (protein - KO - module -pathway)* * Not all genes have an annotation at any / each level. ** padded means the gene sequences were, if possible, extended up to plus minus 100 base pairs from the start and stop location of the predicted open reading frames. These sequences were extracted from the contiguous assembled sequences upon which the open reading frames were predicted. - end -