Hola,
¿eres nuevo aquí?

Regístrate gratis y conecta tu empresa con financiación pública, partners y proyectos.

Tengo cuenta

Regístrate

Ver video

IMAGINE

Financiado

Cerrado

IMAGINE Informing Multi modal lAnguage Generation wIth world kNowledgE

Deep neural networks have caused lasting change in the fields of natural language processing and computer vision. More recently, much effort has been directed towards devising machine learning models that bridge the gap between vi... ver más

23/04/2022

UvA

232K€

Presupuesto del proyecto: 232K€

Líder del proyecto

UNIVERSITEIT VAN AMSTERDAM No se ha especificado una descripción o un objeto social para esta compañía.

TRL 4-5

Fecha límite participación Sin fecha límite de participación.

Ver 3 Participantes

Financiación concedida El organismo H2020 notifico la concesión del proyecto el día 2022-04-23

MSCA-IF-2018: Individual Fellowships

I+D

Cerrada hace 6 años

0% 100%

Información adicional privada

No hay información privada compartida para este proyecto. Habla con el coordinador.

3 Participantes

UvA

232.39K€ | Lider

NEW YORK UNIVERSITY

NYU

144.61K€ | Participante

Conecta tu I+D

¿Tienes un proyecto y buscas un partner? Gracias a nuestro motor inteligente podemos recomendarte los mejores socios y ponerte en contacto con ellos. Te lo explicamos en este video

Proyectos interesantes

PCIN-2015-251 REPRESENTACION CONTINUA MULTI-MODAL Y MULTI-IDIOMA PARA LA C... 88K€ Cerrado

PCIN-2015-226 PROCESAMIENTO MULTI-MODAL DE EXPRESIONES ESPACIALES Y TEMPOR... 75K€ Cerrado

PID2020-113903RB-I00 KIT-IA: KNOWLEDGE-DRIVEN TECHNIQUES FOR INTELLIGENT APPLICAT... 51K€ Cerrado

MAGIC Multimodal Agents Grounded via Interactive Communication 158K€ Cerrado

GraViLa Graphs without Labels: Multimodal Structure Learning without... 1M€ Cerrado

CALCULUS Commonsense and Anticipation enriched Learning of Continuous... 2M€ Cerrado

Duración del proyecto: 36 meses Fecha Inicio: 2019-03-27
Fecha Fin: 2022-04-23

Líder del proyecto

UNIVERSITEIT VAN AMSTERDAM No se ha especificado una descripción o un objeto social para esta compañía.

TRL 4-5

Presupuesto del proyecto 232K€

Fecha límite de participación Sin fecha límite de participación.

Descripción del proyecto Deep neural networks have caused lasting change in the fields of natural language processing and computer vision. More recently, much effort has been directed towards devising machine learning models that bridge the gap between vision and language (V&L). In IMAGINE, I propose to lead this even further and to integrate world knowledge into natural language generation models of V&L. Such knowledge is easily taken for granted and is necessary to perform even simple human-like reasoning tasks. For example, in order to properly answer the question What are the children doing? about an image which shows parents with children playing in a park, a model should be able to (a) tell children from parents (e.g. children are considerably shorter), and infer that (b) because they are in a park, laughing, and with other children, they are very likely playing. Much of this knowledge is presently available in large-scale machine-friendly multi-modal knowledge bases (KBs) and I will leverage these to improve multiple natural language generation (NLG) tasks that require human-like reasoning abilities. I will investigate (i) methods to learn representations for KBs that incorporate text and images, as well as (ii) methods to incorporate these KB representations to improve multiple NLG tasks that reason upon V&L. In (i) I will research how to train a model that learns KB representations (e.g. learning that children are young adults and likely do not work) jointly with the component that understands the image content (e.g. identifies people, animals, objects and events in an image). In (ii) I will investigate how to jointly train NLG models for multiple tasks together with the KB entity linking, so that these models benefit from one another by sharing parameters (e.g. a model that answers questions about an image benefits from the training data of a model that describes the contents of an image), and also benefit from the world knowledge representations in the KB.

Conecta tu I+D

Entra hoy

¿Olvidé mi contraseña?

Financiación

Empresas

CTIs/Universidades

Proyectos

Investigadores