Data Lake์™€ Data Warehouse

  • Data Lake: ์›๋ณธ(raw) ๋ฐ์ดํ„ฐ๋ฅผ ์ €์žฅํ•˜๋Š” ๊ณต๊ฐ„. ํ˜•์‹, ๊ตฌ์กฐ, ๊ฐ€๊ณต ์—ฌ๋ถ€์— ๊ด€๊ณ„์—†์ด ๋ฐ์ดํ„ฐ๋ฅผ ๊ทธ๋Œ€๋กœ ๋ณด๊ด€. โ†’ ์˜ˆ: AWS S3, GCS, Azure Data Lake

  • Data Warehouse: ๊ฐ€๊ณต/์ •์ œ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ ๊ฐ€๋Šฅํ•œ ๊ตฌ์กฐ๋กœ ์ €์žฅํ•˜๋Š” ๊ณต๊ฐ„. ์Šคํ‚ค๋งˆ ๊ธฐ๋ฐ˜, ์ฟผ๋ฆฌ ๋ฐ ๋ฆฌํฌํŒ…์— ์ตœ์ ํ™”. โ†’ ์˜ˆ: BigQuery, Redshift, Snowflake

ํ•ญ๋ชฉ
Data Lake
Data Warehouse

์ €์žฅ ํ˜•ํƒœ

๋น„์ •ํ˜•/๋ฐ˜์ •ํ˜•/์ •ํ˜• ๋ชจ๋‘

์ •ํ˜• (Schema-on-write)

๋ชฉ์ 

์ €์žฅ ๋ฐ ๋ฐฑ์—… ์ค‘์‹ฌ

๋ถ„์„ ๋ฐ ์‹œ๊ฐํ™” ์ค‘์‹ฌ

ํ™•์žฅ์„ฑ

๋งค์šฐ ๋†’์Œ (object storage ๊ธฐ๋ฐ˜)

์ƒ๋Œ€์ ์œผ๋กœ ๋‚ฎ์Œ

๋น„์šฉ

์ €๋ ด

์ƒ๋Œ€์ ์œผ๋กœ ๋น„์Œˆ

Last updated