How to One Hot Encode Sequence Data in Python - useful little write up that explains very simply what a one hot encoding actually is vs say an integer encoding (not as crazy as it sounds but for some reason all other tutorials add a lot of fluff trying to describe it). Good luck on your Machine Learning quest for knowledge! http://machinelearningmastery.com/how-to-one-hot-encode-sequence-data-in-python/