Pig provides a scripting language to execute MapReduce jobs as an alternative to writing Java code. Pig’s scripting language is called Pig Latin.
Who doesn’t love the Pig?! We love Apache Pig for data processing (to be more precise) it’s easy to learn, it works with all kinds of data and it plays well with Python, Java and other popular languages. With Pig’s ability to run on Hadoop it’s obviously well built for high-scale data science.
Whether you’re just getting started with Pig or you’ve already written a variety of Pig scripts, this compact reference guide gathers in one place many of the tools you’ll need to make the most of your data using Pig. Contents are organized as follows:
Data Types
Diagnostic Operators
Relational Operators
Syntax Tips
How to Use UDFs
Mathematical Functions
Eval Functions
String Functions
DateTime Functions
Bag and Tuple Functions
Load/Store Functions
SQL -> Pig Translations
Sources, Downloads & Additional Resources
Who doesn’t love the Pig?! We love Apache Pig for data processing (to be more precise) it’s easy to learn, it works with all kinds of data and it plays well with Python, Java and other popular languages. With Pig’s ability to run on Hadoop it’s obviously well built for high-scale data science.
Whether you’re just getting started with Pig or you’ve already written a variety of Pig scripts, this compact reference guide gathers in one place many of the tools you’ll need to make the most of your data using Pig. Contents are organized as follows:
Data Types
Diagnostic Operators
Relational Operators
Syntax Tips
How to Use UDFs
Mathematical Functions
Eval Functions
String Functions
DateTime Functions
Bag and Tuple Functions
Load/Store Functions
SQL -> Pig Translations
Sources, Downloads & Additional Resources