Tutorials

What is Data Governance?

INTRODUCTION Data governance is a set of processes that ensures that important data assets are formally managed throughout the enterprise. Data governance ensures that data can be trusted and that people can be made accountable...

POPULAR POST

SQL Engines for Hadoop: Hive vs Impala vs Spark

INTRODUCCION Hive, Impala and Spark SQL all fit into the SQL-on-Hadoop category. Apache Hive and Spark are both top level Apache projects. Impala is developed...

What is Big Data?