Descripción del proyecto
METAGENOMICS HAS REVOLUTIONIZED THE WAY WE LOOK AT THE MICROBIAL WORLD, OVER THE LAST DECADE, BIODIVERSITY MEASUREMENTS HAVE EXCEEDED MOST GENEROUS ESTIMATES OF THE SO-CALLED MICROBIAL DARK MATTER, HAVING ACHIEVED NO PLATEAU, THE AMOUNT OF GENETIC MATERIAL SAMPLED BY SHOTGUN STUDIES IS EQUALLY OVERWHELMING, NOT ONLY BECAUSE OF ITS SIZE BUT ALSO THE FUNCTIONAL VARIABILITY IT REPRESENTS, OUR PRELIMINARY ANALYSES INDICATE THAT BETWEEN 20% AND 40% OF THE OBSERVED SEQUENCES ARE NEW SCIENCE, AND AN ADDITIONAL 25% COULD BE CONSIDERED HIGHLY DIVERGENT TO KNOWN GENES, DESPITE THE EXPECTED HIGH RATE OF SEQUENCING AND ASSEMBLING ERRORS, THIS LARGE FRACTION OF NOVEL DATA IS STILL RECOGNISED TO CARRY CRUCIAL INFORMATION TO UNDERSTAND THE ECOLOGY OF MICROBIAL COMMUNITIES, THEIR RELATIONSHIPS WITH HOSTS AND ENVIRONMENTS, AND THE POTENTIAL FOR DISCOVERING NEW BIOMARKERS, NOVEL SEQUENCES, HOWEVER, ARE SYSTEMATICALLY NEGLECTED IN CURRENT METAGENOMIC STUDIES DUE TO THE LACK OF PROPER BIOINFORMATIC METHODS AND DEDICATED RESOURCES, WE ALSO HYPOTHESIZE THAT FUNCTIONAL INFORMATION FOR THE UNKNOWN METAGENOMIC SEQUENCES COULD BE ACQUIRED BY STUDYING THEIR CORRELATION WITH ECOLOGICAL DATA (I,E, PHYSICOCHEMICAL PARAMETERS OF THEIR ASSOCIATED HABITAT), CO-EXPRESSION PATTERNS WITH KNOWN GENES, AND EVOLUTIONARY ANALYSIS, HERE, WE WILL ADDRESS THE ANALYSIS OF ENVIRONMENTAL METAGENOMICS DATA FROM A GLOBAL PERSPECTIVE, WITH THE FOLLOWING SPECIFIC OBJECTIVES: I) BUILDING A COMPUTATIONAL FRAMEWORK TO IDENTIFY NOVEL GENE FAMILIES OUT OF METAGENOMICS AND METATRANSCRIPTOMICS DATA; II) CATALOGUING METAGENOMICS SEQUENCES BY THEIR GENE FAMILY AFFILIATION, FUNCTIONAL ANNOTATION AVAILABLE, PHYLOGENETIC PROFILE AND ECOLOGICAL DISTRIBUTION; III) PREDICTING FUNCTIONAL ROLES OF HYPOTHETICAL GENE FAMILIES BASED ON THE COLLECTED ECOLOGICAL AND GENOMIC CONTEXT DATA, AND IV) SEARCHING FOR POTENTIAL NEW BIOMARKERS IN SOIL AND OCEAN METAGENOMIC SAMPLES, METAGENOMICA\BIOINFORMATICA\ANOTACION FUNCIONAL