Beginning Apache Pig: Big Data Processing Made Easy

Beginning Apache Pig: Big Data Processing Made Easy

English | 29 Dec. 2016 | ISBN: 1484223365 | 300 Pages | PDF | 4.9 MB

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.
The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.
You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance.
What You Will Learn
Use all the features of Apache Pig
Integrate Apache Pig with other tools
Extend Apache Pig
Optimize Pig Latin code
Solve different use cases for Pig Latin
Who This Book Is For
All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators


[Fast Download] Beginning Apache Pig: Big Data Processing Made Easy

Ebooks related to "Beginning Apache Pig: Big Data Processing Made Easy" :
DevOps, DBAs, and DBaaS: Managing Data Platforms to Support Continuous Integration
Professional Sitecore 8 Development A Complete Guide to Solutions and Best Practices
Web Programming with PHP and MySQL: A Practical Guide
Pro Apache Phoenix: An SQL Driver for HBase
Pro Tableau A Step-by-Step Guide
SQL Server 2014 Development Essentials
Hadoop and Big Data: Introduction to Basics of Big data analytics
MOC 6231B Enu Beta Maintaining A Sql Server 2008 R2 Databaes Trainer HandBook Volume1-2
Admin911: SQL Server 2000
Research and Development in Intelligent Systems XXVII
Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.