Pre-built data Lake (Hadoop)
Products
- Pre-built data Lake (Hadoop)
A repository for large quantities and verities of data, both Structured and Unstructured. It take advantage of commodity cluster computing techniques for massively scalable, low cost storage of data files in any format..
Data Lake Features:
Data Lake can serve as a staging area for the Data warehouse, the location for more carefully “treated” data for reporting and analysis in batch mode.
The Data Lake accepts input from various sources and can preserve both the original data fidelity and the lineage of data transformations.
Data models emerge with usage over time rather than being imposed up front.
Data Scientists use the Data Lake for discovery and Ideation.
Data generalists / programmers can tap the stream data for real-time analytics.