Home Training Tutorials

Tutorials

How to become a Professional Data Scientist?

WHAT DO DATA SCIENTIST DO? They apply advanced math and statistics to build the technical cases around the hypotheses that the business analysts build. Data scientists are tasked with building the models required to test...

What is Data Quality?

INTRODUCTION Data Quality is an essential characteristic that determines the reliability of data for making decisions. Data quality help you identify revenue opportunities, meet regulatory compliance requirements and respond to customer issues in a timely manner. We’ve all heard of the many horrors...

What is Hadoop?

INTRODUCTION Apache Hadoop is an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. Hadoop is an Apache top-level project being built and used by a global...

What is Artificial Intelligence?

INTRODUCTION Artificial intelligence is approximating human reasoning more and more closely all the time. Wide-scale adoption by business may be approaching, with important implications for how people live and work. AI is paving the way for...

Free Data Visualization Books

INTRODUCTION If you want to work in exciting analytics and data visualization project, then these books are the starting point for you. Data is the currency of now and potential to use it the right way,...

Fundamentals of Artificial Neural Networks

INTRODUCTION Deep Learning (DL) and Neural Network (NN) is currently driving some of the most ingenious inventions in today’s century. Their incredible ability to learn from data and environment makes them the first choice of...

What is Open Data?

INTRODUCTION Open data is data that can be freely used, shared and built-on by anyone, anywhere, for any purpose. This is the summary of the full Open Definition which the Open Knowledge Foundation created in...

Big Data Solutions

INTRODUCTION With the demand for big data technologies expanding rapidly, Apache Hadoop is at the heart of the big data revolution. It is labelled as the next generation platform for data processing because of its...

History of Big Data

INTRODUCTION Did you know that 90% of the available data has been created in the last two years and the term Big Data has been around 2005, when it was launched by O’Reilly Media in...

What is Big Data?

The term 'Big Data' seems to be popping up everywhere these days. And there seems to be as many uses of this term as there are contexts in which you find it: ‘big data’...

POPULAR POST

SQL Engines for Hadoop: Hive vs Impala vs Spark

INTRODUCCION Hive, Impala and Spark SQL all fit into the SQL-on-Hadoop category. Apache Hive and Spark are both top level Apache projects. Impala is developed...

What is Big Data?