# Data Lake

[What is Data Lake](https://aws.amazon.com/what-is/data-lake/) |

## Overview

<figure><img src="https://static-xf1.vietnix.vn/wp-content/uploads/2021/06/data-lake-la-gi.webp" alt="" width="563"><figcaption><p>What is Data Lake?</p></figcaption></figure>

* A centralized repo to store all structured / unstructured data at any scale.
* Typically has at least 3 layers (zones)
  * Raw
  * Staging
  * Consumption

### Use cases

* For analytic: using analytic tools such as
  * Google BigQuery
  * Apache Spark
  * Amazon Athena

## Trivia
