Innovating Works

RNADOMAIN

Financiado
Computational genomics of long noncoding RNA domains across metazoans
From junk DNA to genomic dark matter, the road to understanding RNAs that do not encode for proteins has been full of surprises. Compared to 19,000 protein-coding genes, recent estimates point that our genome contains between 25,0... From junk DNA to genomic dark matter, the road to understanding RNAs that do not encode for proteins has been full of surprises. Compared to 19,000 protein-coding genes, recent estimates point that our genome contains between 25,000 and 100,000 long noncoding RNA (lncRNA) genes. Far from being inert, some lncRNAs are involved in development and disease, particularly, cancer. It has also been shown that the function of a lncRNA can be associated with its localisation in subcellular compartments. Nevertheless, to experimentally characterize and validate interesting lncRNAs is an arduous task. Computational approaches based on machine learning could be designed to complement and scale-up such efforts. Based on recent experimental discoveries, it has been proposed that lncRNAs are separable into functional domains, and that these domains are intimately related to transposable elements and repeats. Nevertheless, how functions are encoded in primary RNA sequence is a fundamental unsolved problem. I propose to develop the first high-throughput computational approach to map lncRNA domains across metazoan genomes. Domains will be first identified according to statistical evidence supported by current biological insights. Putative domains will be queried against state-of-the-art databases on lncRNA function, localisation, and disease. Machine learning algorithms will then be employed to predict new functional domains, and new mechanistic insights will be offered for promising candidates. Lastly, the obtained maps will be stored and disseminated in a database, that will be regularly updated and readily accessible for the research community. This will be a foundational resource to finally shed light on the role of lncRNAs, their regulation and involvement in disease. ver más
30/06/2023
197K€
Perfil tecnológico estimado
Duración del proyecto: 39 meses Fecha Inicio: 2020-03-02
Fecha Fin: 2023-06-30

Línea de financiación: concedida

El organismo H2020 notifico la concesión del proyecto el día 2023-06-30
Línea de financiación objetivo El proyecto se financió a través de la siguiente ayuda:
Presupuesto El presupuesto total del proyecto asciende a 197K€
Líder del proyecto
UNIVERSITY COLLEGE DUBLIN NATIONAL UNIVERSITY... No se ha especificado una descripción o un objeto social para esta compañía.
Perfil tecnológico TRL 4-5