About this blog
This blog is mostly about my pursuits in Data Science. Previous blog entries also dealt with storage, compute, virtualization and professional services. Currently the focus is on Data Science, including Big Data, Hadoop, Business Intelligence, Data Warehouse, Data Integration and Visualization. From time to time I will blog about other things of interest. The opinions expressed in this blog are entirely my own and should not be taken as the opinion of my employer.Blogroll
Tag Archives: setNumMapTasks()
Changing MapReduce number of Mappers in Hadoop .20.2
In earlier releases of Hadoop you could change the number of mappers by setting: setNumMapTasks() You did this using JobConf. Things in Hadoop .20.2 have migrated to using the Job class instead of JobConf. Although setNumReduceTasks() is still valid, setNumMapTasks() … Continue reading