
Titre : « Des Bases de données aux Data Lakes »
Résumé : De nouveaux défis émergent dans la recherche scientifique et technologique provoqués par l’engouement autour de la data et plus particulièrement autour du Big data. La data pose de nombreux challenges liés à son exploitation. Son périmètre va au delà des structures classiques, telles que les bases de données ou les entrepôts de données, aujourd’hui les Data Lakes, voire même les MDM (Modern Data Management). De plus, elle est en majorité non structurée ce qui a pour conséquence une nécessité de nouvelles approches pour l’explorer. Les différents processus existant doivent être repensés. Leur évolution pose de nouveaux verrous scientifiques dès qu’ils sont projetées dans un cadre de Big data.
Les Data Lakes représentent aujourd’hui un concept émergeant ; comment s’organiser autour de la data pour repenser les cycles d’innovations au sein des entreprises. D’autre part, ce concept ouvre de nouvelles pistes d’investigations et promet de véritables challenges à relever, qui permettront aux technologies de l’information d’évoluer vers de nouvelles perspectives dans le monde professionnel, et fait appel à de nouvelles compétences à capitaliser par les futurs utilisateurs.
De quoi s’agit-il en fait ?
Title : « From Databases to Data Lakes »
Resume. New challenges emerge in the scientific and technological research caused by the craze around data and more particularly around big data. The data poses many challenges related to its exploitation.
Its scope goes beyond traditional structures, such as databases or data warehouses, today Data Lakes, or even MDM (Modern Data Management). Furthermore, it is mostly unstructured which results in a need for new approaches to explore it. The different existing processes need to be redesigned. Their evolution poses new scientific obstacles as soon as they are projected in a big data framework. The Data Lakes represent today an emerging concept how to organize around the data to rethink innovation cycles within companies. On the other hand, this concept opens up new issues of investigation and promises real challenges, which will allow information technologies to evolve towards new perspectives in the professional world, and calls on new skills to develop and to capitalize on future users.
What is it really about?
Omar BOUSSAID is full Professor of Computer Science at the Institute of Communication of the Lyon 2 University in France. His main work is in the field of Business Intelligence (BI), and more specifically about the storage and exploitation of complex data. His current research focuses on the evolution of BI in the Big Data environment. The management of massive data around Data Lakes, the design of distributed warehouses including NoSQL, Cloud BI, Semantic analysis through Text OLAP, Social OLAP and Graph OLAP, constitute the various research themes on which his scientific animation and scientific supervision of different PhD theses, are based. His work has been published in more than 150 articles in international journals and international conferences. He is a member of several Program Committees of international journals and international conferences. He evaluated several research projects. He is a founding member of the EDA conference of which he is a member of the Steering Committee. He is also co-founder of the Maghrebin conference on Advanced Decision-Making Information Systems (ASD), and is also a member of its Steering Committee. Furthermore, he is director of the master of Business Intelligence & Big data.
Omar.boussaid@univ-lyon2.fr ; http://eric.univ-lyon2.fr/~boussaid/