Introduction to Pig

Take our free course

Pig Course

Introduction to Pig

with Warren Pettit

Audience:
Pig Beginners

Time to complete:
4 Hours

Available in:
English

Pig was initially developed at Yahoo! to allow people using Hadoop® to focus more on analyzing large data sets and spend less time having to write mapper and reducer programs. Like actual pigs, who eat almost anything, the Pig programming language is designed to handle any kind of data—hence the name!

Pig is made up of two components: the first is the language itself, which is called PigLatin, and the second is a runtime environment where PigLatin programs are executed.

This course begins with an overview of Pig. It explains the data structures supported by Pig and how to access data using the LOAD operator. The next lesson covers the Pig relational operators. This is followed by the Pig evaluation functions, as well as math and string functions.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more!

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program.  By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

The labs for this course were recently updated and tested using BigInsights Quick Start 4.1

 

Course Syllabus

  • Pig Basics
  • Pig Relational Operators
  • Pig Evaluation Operators and Other Functions

General Information

  • This course is free.
  • It is self-paced.
  • It can be taken at any time.
  • It can be taken as many times as you wish.
  • Labs can be performed on the Cloud, or using a 64-bit system. If using a 64-bit system, you can install the required software (Linux-only), or use the supplied VMWare image. More details are provided in the section "Labs setup".
  • Students passing the course (by passing the final exam) will have immediate access to printing their online certificate of achievement. Your name in the certificate will appear exactly as entered in your profile in BigDataUniversity.com.
  • If you did not pass the course, you can take it again at any time.

Recommended skills prior to taking this course

  • Basic Linux Operating System knowledge
  • Basic programming skills

Grading Scheme

  • The minimum passing mark for the course is 60%, where the final test is worth 100% of the course mark.
  • You have 3 attempts to take the test.