Cloud Data Lake
A cloud data lake is a repository of data in raw form, allowing multiple potential uses of data from a single load. It contains all types of data from unstructured machine generated IoT data, data from human interactions through emails, twitter feeds, videos, audios, semi structured data like JSON, XML, in addition to the structured data. Modern data lakes, when managed right, work as a great platform to easily store, load, integrate, and analyze data.
Key technology players like Microsoft and Amazon offer data lake platforms by utilizing their existing storage services. Azure Data Lake Storage (ADLS) Gen2 uses Azure Blob storage and ADLS Gen1 to provide a platform combining rich features of low cost storage tiers from the Blob storage and hierarchical file system (HFS) from the ADLS Gen1. Similarly, AWS uses the Amazon S3 storage along with some of its other services to define its data lake architecture.
Why consider cloud data lake?