Category Archives: Data Analytics

Custom Pytorch Dataset Class for Timeseries Sequence Windows

Recently I was working on a problem with Time Series.  Time Series can quickly add up to a lot of data, as you are using previous intervals to predict future intervals.  What some people do is they create a very … Continue reading

Posted in Data Analytics | Tagged | Leave a comment

Finding the ideal num_workers for Pytorch Dataloaders

One of the biggest bottlenecks in Deep Learning is loading data.  having fast drives and access to the data is important, especially if you are trying to saturate a GPU or multiple processors.  Pytorch has Dataloaders, which help you manage … Continue reading

Posted in Data Analytics | Tagged | Leave a comment

Basic HBase Java Classes and Methods – Part 8: Disable and Delete a Table

In order to delete a table in HBase it must be disabled first.  This forces any data in memory to be flushed to disk.  Because this is an admin operation, we must create an Admin object, similar to how we … Continue reading

Posted in Data Analytics | Tagged , | Leave a comment

Basic HBase Java Classes and Methods – Part 7: Delete from a Table

Deleting data from an HBase table is very similar in overall structure to many of our previous operations.  First we have our general skeleton code.  As before we use static variable declarations to make the code look a lot nicer. … Continue reading

Posted in Data Analytics | Tagged , | Leave a comment

Basic HBase Java Classes and Methods – Part 6: Scan a Table

In HBase a Scan is similar to a Select in SQL.  Again we return to skeleton code which is very similar to what we have seen before.  I will put comments into the three areas we will be addressing:

Continue reading

Posted in Data Analytics | Tagged , | Leave a comment