Hadoop Ecosystem

Actually, Spark Adds Power To Hadoop In Real-Time Processing

By |2016-10-31T14:21:27+00:00January 5th, 2016|Big Data|

Since Apache Spark came to existence in 2014, it received massive recognition and developer community just loved it, all for good reasons. Apache Spark is a fast, in-memory data processing engine with elegant development APIs to allow developers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets. However, [...]

Hive Query Performance Optimization On Hadoop For Big Data

By |2016-10-31T14:21:27+00:00March 31st, 2015|Big Data|

If you have been around big data for any length and worked on Hadoop, you have seen plenty of Pig and Hive. If you are even learning Hadoop and Big Data, Pig and Hive must seem to be the things you should know in order to have some control on big data. Well, that’s the [...]

Big Data And Hadoop – Features And Core Architecture

By |2016-10-31T14:21:27+00:00February 28th, 2015|Big Data|

The term Big Data is often used to denote a storage system where different types of data in different formats can be stored for analysis and driving business decisions. Big Data is an assortment of such a huge and complex data that it becomes very tedious to capture, store, process, retrieve and analyze it with [...]