Using Python for Big Data Workloads (Part 1)
In Part 2, we will look at Python for Spark (PySpark), Machine Learning, and deep learning in depth. In this first part, we’ll go over the basics, some examples, and some tutorials to get you started.
Get the latest Python for your environment — Linux, OSX, and even Windows are supported. There’s a debate whether to finally move to Python 3.x; try it and see if it works for all your tools. Since my Hadoop installation has Python 2.7, I am going to use that for my work.
via DZone.com Feed https://dzone.com
May 13, 2017 at 07:57AM