Category Archives: Data Analytics

Basic HBase Java Classes and Methods – Part 3: Table Creation

We will cover these basic steps: Instantiating a configuration object Establishing a connection to HBase Manipulating tables using an administration object Manipulating data within a table using a table instance Creating our table I am using Maven, and below is … Continue reading

Posted in Data Analytics | Tagged , | Leave a comment

Basic HBase Java Classes and Methods – Part 2: HBase Shell

For the purpose of these exercises we will be working with a basic table which as two column families.  The first column family is “personal” and will contain first_name, last_name, age, gender, martial_status.  The second column family is “professional” and … Continue reading

Posted in Data Analytics | Tagged , | Leave a comment

Basic HBase Java Classes and Methods – Part 1: Getting Started

This series of articles is to familiarize you with basic HBase Java classes and methods.  The goal of these articles is not for HBase best practices.  In fact, we will be making many compromises as we deploy on what is … Continue reading

Posted in Data Analytics | Tagged , | Leave a comment

Downgrading Apache Hadoop YARN to MapReduce v1

This post is somewhat dated material.  Several years back, when YARN was first making headways and vendors starting adopting it as part of Hadoop 2.x, there were many times where I needed to downgrade to MapReduce v1.  I had written … Continue reading

Posted in Data Analytics | Tagged | Leave a comment

Scaling data for Deep Learning

When building deep learning models, it can be very beneficial to scale your data.  Oftentimes data can have a huge range of unbounded values.  The goal of scaling is to bound these values.  Typically the activation functions of a neuron … Continue reading

Posted in Data Analytics | Tagged | Leave a comment