Hadoop in action

Hadoop is an open source distributed computing platform owned by the Apache Software Foundation. Based on Hadoop Distributed Filesystem (HDFS) and MapReduce (the open source implementation of Google MapReduce), Hadoop provides users with a Distributed infrastructure with transparent details at the bottom of the system. HDFS has the advantages of high fault tolerance and high scalability, allowing users to deploy Hadoop on inexpensive hardware to form distributed systems.

The MapReduce distributed programming model allows users to develop parallel applications without understanding the underlying details of a distributed system. Therefore, users can easily organize computer resources with Hadoop to build their own distributed computing platform, and make full use of the computing and storage capabilities of clusters to complete the processing of massive data.

This book is a systematic and practical Hadoop reference book and reference book. The content is comprehensive, giving a comprehensive explanation of the entire technical system of Hadoop, including not only the two core contents of HDFS and MapReduce, It also includes Hadoop related sub-projects such as Hive, HBase, Mahout, Pig, ZooKeeper, Avro, and Chukwa. Strong practical, for each knowledge point carefully designed a large number of classic small cases, easy to understand, strong operability.

directory

  • Haoop profile
  • Hadoop installation and configuration
  • Haoop application case analysis
  • MapReduce computing model
  • Develop MapReduce applications
  • Example of MapRe Duce
  • MapRe Duce. Working mechanism
  • Hadoop I / 0 operations
  • HDFS,
  • The management of the Hadoop
  • Hive,
  • HBase,
  • Mahout,
  • Pig,
  • Zooeeper,
  • Avro,
  • Chulkv a rounding
  • Common plug-ins and development of Hadoop

Spark big data analysis

Spark big data technology is still developing in full bloom. Spark China Summit was held and meetup was held in various places, and the open source software Spark is also gaining momentum. Many companies have implemented and applied Spark on a large scale. The requirements of Spark users have changed from initial deployment, installation, and running instances to building rich data analysis applications using Spark. Writing a technical book on Spark, a practical case study, is – an idea that has been around for a long time. Due to my intense work, I only summarized the Spark related cases I participated in or learned at first. However, as time goes by, I intend to abstract and simplify the common algorithms, system architecture and application scenarios, which can also be regarded as a summary and sharing.

directory

  • Introduction of the Spark
  • Spark development and environment configuration program
  • BDAS profile
  • Lamda architecture log analysis pipeline
  • Recommendation system based on cloud platform and user log
  • Twitter Sentiment Analysis
  • Hot news analysis system
  • Build a distributed collaborative filtering recommendation system
  • Social network analysis based on Sparl
  • Large-scale news topic analysis based on Spark
  • Build a distributed search engine

Third, Hbase

HBase is a NoSQL storage system designed to quickly and randomly read and write large-scale data. HBase runs on common commercial servers and can scale smoothly to support data sets with billions of rows and millions of columns. .

This book is an experience-based guide that teaches you how to design, build, and run big data application systems using HBase. The book is divided into four parts. The first two parts respectively introduce the development history of distributed systems and large-scale data processing, and explain the basic principles and patterns of HBase and how to use advanced features of HBase. In the third part, some practical HBase technologies are further explored through real applications and code examples as well as theoretical knowledge supporting these practical techniques. Part four explains how to upgrade a prototype development system to a full-fledged production system.

directory

Part 1 HBase Basics

  1. HBase is introduced
  2. An introduction to
  3. Distributed Base, HDFS, and MapReduce

Part II Advanced concepts

  1. HBase table design
  2. Extend iBase with a coprocessor
  3. Select other HBase clients

The third part is the application system example

  1. Learn HBase: OpenTSDB by example
  2. Query the GEOGRAPHIC information system on HBase

Part four: Get HBase working

  1. Deploy the IBase
  2. operations

Principle, design and practice of distributed service architecture

This book takes the current popular distributed service architecture as the main line, explains the principle, design and practice of distributed service architecture.

This book first introduces the background and evolution of distributed service architecture, and then deeply expounds the design ideas and implementable schemes to ensure the uniformity, high performance and high availability of distributed service. Then, the emergency flow and technical tackling process of large-scale and high-concurrency online service are introduced, and the effective and common tool set for finding and locating problems is given. Finally, the tools of containerization process analysis, agile development and online in distributed service architecture are introduced in detail, which provide convenience for developers engaged in high concurrency service architecture

directory

  • Design principles of distributed microservices architecture
  • Completely solve the problem of distributed system consistency
  • Capacity assessment and performance assurance of servitization system
  • The construction of big data logging system
  • Design and implementation of service governance system based on call chain
  • Java services online emergency and technical breakthrough
  • The containerization process of the service
  • Automated tools for Agile development 2

5. Netty actual combat Principle

Netty is a Java framework for rapid development of high-performance Web applications. It encapsulates the complexity of network programming and makes the latest advances in network programming and Web technology accessible to a wider range of developers than ever before. Netty is more than just a collection of interfaces and classes; It also defines an architectural model and a rich set of design patterns. But until now, the lack of a comprehensive, systematic user guide has been a barrier to getting started with Netty, something this book aims to change. In addition to explaining the details of the framework’s components and APIS, the book shows how Netty can help developers write more efficient, reusable, and maintainable code.

directory

The first part is the concept and architecture of Netty

  1. Netty- Asynchronous and event-driven
  2. Your first Hetty app
  3. Netty components and design
  4. transmission
  5. ByteBuf
  6. ChannelHandler and ChannelPipeline
  7. EventLoop and thread model
  8. guide
  9. Unit testing

Part ii codec

  1. Codec framework
  2. Preset ChannelHandl er and codec

The third part is network protocol

  1. WebSocket
  2. Broadcast events using VDP

The fourth part is case study

  1. Case study, Part I
  2. Case Study, Part II

These Internet industry is very hot, hot PDF xiaobian has been sorted out for everyone, need to receive friends forward, forward, forward xiaobian this article + private letter xiaobian [structure books] can be.