Results 1 to 1 of 1
  1. #1
    Hot Member
    Join Date
    Feb 2012


    Default Hadoop: The Definitive Guide (Early Release)

    Follow us on Social Media

    Hadoop: The Definitive Guide (Early Release)

    Hadoop: The Definitive Guide (Early Release) By Tom White
    Publisher: O'R[eil]ly M[e.di]a; 3 edition 2012 | 630 Pages | ISBN: 1449311520 | EPUB + PDF | 6 MB + 8 MB

    Ready to unleash the power of your massive dataset? With the latest edition of this comprehensive resource, youíll learn how to use Apache Hadoop to build and maintain reliable, scalable, distributed systems. Itís ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
    This third edition covers recent changes to Hadoop, including new material on the new MapReduce API, as well as version 2 of the MapReduce runtime (YARN) and its more flexible execution model. Youíll also find illuminating case studies that demonstrate how Hadoop is used to solve specific problems.

    Store large datasets with the Hadoop Distributed File System (HDFS), then run distributed computations with MapReduce
    Use Hadoopís data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence
    Discover common pitfalls and advanced features for writing real-world MapReduce programs
    Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud
    Use Pig, a high-level query language for large-scale data processing
    Analyze datasets with Hive, Hadoopís data warehousing system
    Load data from relational databases into HDFS, using Sqoop
    Take advantage of HBase, the database for structured and semi-structured data
    Use ZooKeeper, the toolkit for building distributed systems




Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts