This directory contains the following data from 124 samples. A. Contigs from the 124 samples the non-redundant gene set derived from them. Functional annotation and taxonomic classification information are also included. 1. contigs : .seq.fa.gz 2. genes : BGI_GeneSet20090523.fa.gz 3. annotation : BGI_GeneSet20090523_annotation.gz 4. taxonomic : BGI_GeneSet20090523_taxonomic.gz B. Protein set containing 319812 proteins from 89 frequent reference microbial genomes. 1. frequent_microbe_proteins.fasta.gz