Beginning Big Data Analytics with Hadoop ! It Just Start !

12 Apr



The Apache Hadoop is a popular software framework that supports data-intensive distributed applications. It is a map-reduce implementation inspired by Google’s MapReduce and Google File System (GFS) papers. Hadoop is written in Java, and many large-scale applications and organizations (e.g. Yahoo and Facebook) use it to run large distributed computations on commodity hardware. It is designed in a way that it can scale up from a single server node to thousands of machines, each offering local computation and storage. All these features and capabilities make Hadoop the best candidate for developers dealing with big data.

Microsoft has long ago revealed its plans to partner with Hadoop provider Cloudera. This is some kind of a sign that Microsoft is ready to launch an ODBC driver for SQL Server that would support Linux. Another indication is that Microsoft is planning to develop connectors for Hadoop. It is definitely a big step forward to move into a rapidly growing space.