This article is a guide to setup a Hadoop cluster. The cluster runs on local CentOS virtual machines using Virtualbox. I use this to have a local environment for development and testing. I followed many of the steps Austin Ouyang laid out in the blog post here. Hopefully, next I can document using moving these …