Hadoop: The Definitive Guide

Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers...

Mô tả đầy đủ

Đã lưu trong:
Chi tiết về thư mục
Tác giả chính: White, Tom
Định dạng: Sách
Ngôn ngữ:English
Được phát hành: O'Reilly Media 2011
Những chủ đề:
Truy cập trực tuyến:http://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/25905
Các nhãn: Thêm thẻ
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
id oai:scholar.dlu.edu.vn:DLU123456789-25905
record_format dspace
spelling oai:scholar.dlu.edu.vn:DLU123456789-259052012-03-03T04:45:12Z Hadoop: The Definitive Guide White, Tom Tin học Database Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems 2011-09-20T06:54:40Z 2011-09-20T06:54:40Z 2011 Book 978-1-449-38973-4 http://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/25905 en application/pdf O'Reilly Media
institution Thư viện Trường Đại học Đà Lạt
collection Thư viện số
language English
topic Tin học
Database
spellingShingle Tin học
Database
White, Tom
Hadoop: The Definitive Guide
description Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems
format Book
author White, Tom
author_facet White, Tom
author_sort White, Tom
title Hadoop: The Definitive Guide
title_short Hadoop: The Definitive Guide
title_full Hadoop: The Definitive Guide
title_fullStr Hadoop: The Definitive Guide
title_full_unstemmed Hadoop: The Definitive Guide
title_sort hadoop: the definitive guide
publisher O'Reilly Media
publishDate 2011
url http://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/25905
_version_ 1757676463987884032