Its been fun working on humongous data with an abstraction created by the realm of different projects under
hadoop.
PIG,
HIVE being the query language which provides even better idiots layers on top it. Aspects of keeping a cluster running, with managing data sizes with various compression initially driven by LZO (
elephant-bird) drifting towards block compression. Making ones life easier by using scheduler like
Azkaban. Humongous data and heavy loading, thats the way to go
No comments:
Post a Comment