
A data lake, inspired by the natural lake, is a centralized data repository that stores all kinds of data in any format and structure. However, in addition to learning from data, we are faced with the issue of data storage and management in a cost-effective and reliable manner. Presently, big data technologies provide multiple solutions and tools towards the semantic analysis of heterogeneous data, including their accessibility and reusability. The great volume of miscellaneous data renders the generation of new knowledge a complex data analysis process.

The realm of big data has brought new venues for knowledge acquisition, but also major challenges including data interoperability and effective management.
