Archives for 

Apache Hadoop

Hadoop Apache Flume Introduction

Apache-Flume Introduction Apache Flume –is the streaming framework in Hadoop family. The server log, twitter feeds, stock market share price movements are known for streaming data. These streaming data were earlier processed by conventional technologies or frame works. However, after Hadoop comes into existence Apache Flume is playing key role to process the streaming data. This video […]

The Future of Hadoop by Hadoop creator Doug Cutting

Hadoop Creator and Cloudera Chief Architect Doug Cutting Hadoop on Demand The current hot technology in the market is Big Data Hadoop  All Software persons are trying to move this technology because of high demand exists and they anticipates the demand for another 15 years are also promising. Hadoop the Apache open source frame work which was invented by Doug Cutting. This Stanford […]

In Map Reduce – Record reader importance

record-reader-in-map-reduce Hadoop is running on Hadoop Distributed File System (HDFS) that means it is based on Distributed computing. If one data set is passing through Hadoop system it is split as blocks. The default size is 64 MB in Hadoop system. It can be split as multiple of these default size that means 64 MB or […]

Relationship with Input Splits and HDFS Blocks

Input Splits and HDFS Blocks Hadoop is very dynamic technology which rules the technology world now. When you learning about Hadoop you have to understand the what is Hadoop. Hadoop is usually defined as the framework running on distributed data systems. The Data which is injected into Hadoop this data is divided as block of data and it stored in […]