Descripción del proyecto
DataGEMS is a data discovery platform with Generalized Exploratory, Management, and Search capabilities. DataGEMS is built on the principles of data FAIRness, openness and re-use. It aims to seamlessly integrate data sharing, discovery and analysis into a system that addresses the whole data lifecycle, i.e., sharing, storing, managing, discovering, analyzing and reusing (data and/or metadata), bridging the gap between the data provider and the data consumer. DataGEMS is a next-generation data discovery and management ecosystem that engulfs different types of data (structured, unstructured, real-time and historical) and enables users to (a) enrich data through powerful data profiling mechanisms (b) seamlessly discover and analyze data across and within datasets using user-intuitive discovery and analysis mechanisms, such as using natural language and patterns, and (c) effectively explore and combine data with the help of stepwise guidance mechanisms during dataset discovery and analysis. The effective and efficient functioning of these mechansims will be powered by a data and model management layer that decouples data management at the low level from the data analytics at the higher level. DataGEMS is informed by and will be initially tested and deployed to promote data FAIRness and benefit diverse user communities and types of users on core domains: education, meteorology, and language data infrastructures.