Data Lake
centralized repo to store all structured / unstructured data at any scale
Last updated
centralized repo to store all structured / unstructured data at any scale
Last updated
A centralized repo to store all structured / unstructured data at any scale.
Typically has at least 3 layers (zones)
Raw
Staging
Consumption
For analytic: using analytic tools such as
Google BigQuery
Apache Spark
Amazon Athena