The Resource Programming Pig, Alan Gates and Daniel Dai

Programming Pig, Alan Gates and Daniel Dai

Label
Programming Pig
Title
Programming Pig
Statement of responsibility
Alan Gates and Daniel Dai
Creator
Contributor
Author
Subject
Genre
Language
eng
Summary
For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the Apache Pig scripting platform. With Pig, you can batch-process data without having to create a full-fledged application, making it easy to experiment with new datasets. Updated with use cases and programming examples, this second edition is the ideal learning tool for new and experienced users alike. You'll find comprehensive coverage on key features such as the Pig Latin scripting language and the Grunt shell. When you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig. Delve into Pig's data model, including scalar and complex data typesWrite Pig Latin scripts to sort, group, join, project, and filter your dataUse Grunt to work with the Hadoop Distributed File System (HDFS)Build complex data processing pipelines with Pig's macros and modularity featuresEmbed Pig Latin in Python for iterative processing and other advanced tasksUse Pig with Apache Tez to build high-performance batch and interactive data processing applicationsCreate your own load and store functions to handle data formats and storage mechanisms
Cataloging source
TEFOD
Dewey number
005.13/3
Illustrations
illustrations
Index
index present
LC call number
QA76.73.P54
Literary form
non fiction
Nature of contents
  • dictionaries
  • handbooks
Label
Programming Pig, Alan Gates and Daniel Dai
Publication
Copyright
Note
  • "Dataflow scripting with Hadoop"--Cover
  • Includes index
Antecedent source
unknown
http://library.link/vocab/branchCode
  • net
Carrier category
online resource
Carrier category code
cr
Carrier MARC source
rdacarrier
Color
multicolored
Content category
text
Content type code
txt
Content type MARC source
rdacontent
Contents
1. What is Pig? -- 2. Installing and running Pig -- 3. Pig's data model -- 4. Introduction to Pig Latin -- 5. Advanced Pig Latin -- 6. Developing and testing Pig Latin scripts -- 7. Making Pig fly -- 8. Embedding Pig -- 9. Writing evaluation and filter functions -- 10. Writing load and store functions -- 11. Pig on Tez -- 12. Pig and other members of the Hadoop community -- 13. Use cases and programming examples
Control code
ocn964523786
Dimensions
unknown
Edition
Second edition
Extent
1 online resource
File format
unknown
Form of item
online
Isbn
9781491937068
Media category
computer
Media MARC source
rdamedia
Media type code
c
Other physical details
illustrations
http://library.link/vocab/ext/overdrive/overdriveId
a12240b5-cab1-40c5-88f7-34ec1db5370c
Quality assurance targets
unknown
http://library.link/vocab/recordID
.b36491172
Sound
unknown sound
Specific material designation
remote
System control number
  • (OCoLC)964523786
  • safari1491937041

Library Locations

    • Deakin University Library - Geelong Waurn Ponds CampusBorrow it
      75 Pigdons Road, Waurn Ponds, Victoria, 3216, AU
      -38.195656 144.304955
Processing Feedback ...