|
Post by Admin on Mar 15, 2014 19:21:12 GMT
Ease of programming:- It is trivial to achieve parallel execution of simple, “EMBRASSINGLY parallel” data analysis tasks.
Complex tasks comprised of multiple interrelated data transformations are explicitly encoded as data flow sequences, making them easy to write, understand and maintain. Pig takes care of …. •Schema and type checking. •Translating into efficient physical data flow. •(I.E sequence of one or more map reduce jobs) •Exploiting data reduction opportunities. •(EG early partial aggregation via a combiner) •Executing the system-level data flow. •(I.E running the map reduce jobs) •Tracking progress, errors, ETC.
|
|