Image captioning is the process of mapping a visual scene to a short textual description. Automating this process is vital for many computer applications, including information retrieval from visual data, computerized assistance t...
ver más
¿Tienes un proyecto y buscas un partner? Gracias a nuestro motor inteligente podemos recomendarte los mejores socios y ponerte en contacto con ellos. Te lo explicamos en este video
Proyectos interesantes
PCIN-2013-047
VISUAL SENSE, ETIQUETADO DE INFORMACION VISUAL CON DESCRIPCI...
130K€
Cerrado
PID2021-125051OB-I00
RECOLECCION DE DATOS VISUALES: PERMITIENDO LA VISION POR COM...
117K€
Cerrado
TIN2015-70924-C2-2-R
CONTEXTUALIZACION DE CONTENIDOS EN EL RECONOCIMIENTO DE IMAG...
86K€
Cerrado
PID2019-104174GB-I00
TRANSFERENCIA DE CONOCIEMIENTO PARA REPRESENTACIONES EN REDE...
78K€
Cerrado
EIN2019-103240
DOCTORADO CONJUNTO EN TRATAMIENTO DE IMAGENES Y VISION ARTIF...
14K€
Cerrado
Información proyecto ROCAP
Duración del proyecto: 22 meses
Fecha Inicio: 2022-08-12
Fecha Fin: 2024-06-30
Líder del proyecto
UNIVERSITEIT UTRECHT
No se ha especificado una descripción o un objeto social para esta compañía.
TRL
4-5
Presupuesto del proyecto
150K€
Fecha límite de participación
Sin fecha límite de participación.
Descripción del proyecto
Image captioning is the process of mapping a visual scene to a short textual description. Automating this process is vital for many computer applications, including information retrieval from visual data, computerized assistance to visually impaired people, and automatic tour guiding. State-of-the-art captioning systems are limited by their heavy reliance on visual contents. As a result, generated captions are often purely descriptive and miss important information that is needed in order to understand the image. This PoC project develops a captioning tool that will be useful for knowledge-intensive areas like Geography, Radiology or Art History, where captions need to include information that cannot be extracted from images alone. It builds on results of the ROCKY ERC AdG project, whose innovative captioning system integrates external knowledge into the captioning process. This allowed the ROCKY project to employ standard methods of image captioning, with a deep convolutional neural network (CNN) for image understanding and a Transformer network for language generation. Thanks to the external knowledge integration, the ROCKY captioning prototype gets substantially closer to human-generated captions than standard captioning systems that do not take external knowledge into account. This PoC project will use this result by implementing a knowledge-aware captioning system that is scalable for practical purposes. The project examines the feasibility of the ROCKY captioning method for Medical Imaging and Art History and implements it for one of these domain as a use case. The project will engage with experts in these domains, specify a practical captioning system, implement it as an open-source tool and test it in realistic situations. The anticipated value of this effort is in the development of a general method that would allow one open-source platform to be multi-purpose, thereby cost-effectively adjustable to needs of different domains.