Supplementary data for the enterotypes article

Enterotypes

Our knowledge of species and functional composition of the human gut microbiome is rapidly increasing, but it is still based on very few cohorts and little is known about variation across the world. By combining 22 newly sequenced faecal metagenomes of individuals from four countries with previously published data sets, here we identify three robust clusters (referred to as enterotypes hereafter) that are not nation or continent specific. We also confirmed the enterotypes in two published, larger cohorts, indicating that intestinal microbiota variation is generally stratified, not continuous. This indicates further the existence of a limited number of well-balanced host–microbial symbiotic states that might respond differently to diet and drug intake. The enterotypes are mostly driven by species composition, but abundant molecular functions are not necessarily provided by abundant species, highlighting the importance of a functional analysis to understand microbial communities. Although individual host properties such as body mass index, age, or gender cannot explain the observed enterotypes, data-driven marker genes or functional modules can be identified for each of these host properties. For example, twelve genes significantly correlate with age and three functional modules with the body mass index, hinting at a diagnostic potential of microbial markers.

See the enterotype article in Nature.

Keywords: enterotypes, nature, metagenomics

Human gut metagenome data from the enterotype article

Posted on Apr 20, 2011

Enterotypes were identified by the MetaHIT consortium by analyzing 22 European human gut metagenomes along with 13 Japanase and 4 American human gut metagenomes. They were further supported by identifying them in Illumina based metagenomes from 85 Danish individuals as well as 16S pyrosequencing data from 154 American individuals.

You can download all the data associated with this article in the downloads section.

To reproduce the analysis performed in the article, raw metagenome sequences must be processed by the SmashCommunity metagenomic annotation and analysis pipeline as explained in the supplementary material of the article.

Keywords: MetaHIT, SmashCommunity