Home Blog

2018 Top 10 Business Intelligence Trends

INTRODUCTION Whether you’re a data rockstar or an IT hero or an executive building your BI empire, these 10 Business Intelligence Trends could help take your organization to the next level. 1.- How Machine Learning Will...

How to become a Professional Data Scientist?

WHAT DO DATA SCIENTIST DO? They apply advanced math and statistics to build the technical cases around the hypotheses that the business analysts build. Data scientists are tasked with building the models required to test...

What is Data Quality?

INTRODUCTION Data Quality is an essential characteristic that determines the reliability of data for making decisions. Data quality help you identify revenue opportunities, meet regulatory compliance requirements and respond to customer issues in a timely manner. We’ve all heard of the many horrors...

The 2017 Big Data Landscape

IS BIG DATA STILL A THING? Observing that since Big Data is largely “plumbing”, it has been subject to enterprise adoption cycles that are much slower than the hype cycle.  As a result, it took...

What is a Graph Database?

INTRODUCTION We live in a connected world. There are no isolated pieces of information, but rich, connected domains all around us. Only a database that embraces relationships as a core aspect of its data model...

SQL vs NoSQL: High-Level Differences

INTRODUCTION Most of you are already familiar with SQL database, and have a good knowledge on either MySQL, Oracle, or other SQL databases. In the last several years, NoSQL database is getting widely adopted to...

Do you need a Relational Databases for Big Data ?

INTRODUCTION Teradata, Greenplum, Netezza, DB2, Oracle's Exadata aren't "Big Data" databases, as defined by meaning databases that are routinely used to handle large data sets that are unstructured, rapidly changing and usually with little or...

How to Choose the Right Chart

INTRODUCTION Every chart is trying to tell a story about your data, but people often run into problems trying to tell that story. Sometimes it’s incomplete, other times it’s misleading and often it’s confusing or...

Hadoop Ecosystem Table

INTRODUCTION The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using...

What is Hadoop?

INTRODUCTION Apache Hadoop is an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. Hadoop is an Apache top-level project being built and used by a global...

POPULAR POST

SQL Engines for Hadoop: Hive vs Impala vs Spark

INTRODUCCION Hive, Impala and Spark SQL all fit into the SQL-on-Hadoop category. Apache Hive and Spark are both top level Apache projects. Impala is developed...

What is Big Data?