About this blogThis blog is mostly about my pursuits in Data Science. Previous blog entries also dealt with storage, compute, virtualization and professional services. Currently the focus is on Data Science, including Big Data, Hadoop, Business Intelligence, Data Warehouse, Data Integration and Visualization. From time to time I will blog about other things of interest. The opinions expressed in this blog are entirely my own and should not be taken as the opinion of my employer.
Tag Archives: r
Boxplots in R can be a bit tricky (ugly actually), but here is an example below to help. Below is the binomial distribution for p = 0.3, p = 0.5 and p = 0.7 with total number of trials n … Continue reading
In the visualization below, you can see some of the terms that people are talking about on Twitter regarding HIMMS13. The data below was pulled from Twitter on 2/7/2013 at 11:50pm using the twitteR package for R. Data from approximately … Continue reading
Data Analysis is increasingly becoming more and more important in healthcare. Hospitals are using Big Data to make decisions and improve patient care. At the same time, patients are using Big Data, or more likely the published results, to make … Continue reading