General Training Session: Optimizing Hadoop Environments
General Training Session: Optimizing Hadoop Environments


Big Data is dependent upon supercomputing environments capable of storing and processing massive amounts of data. Hadoop is the most prominent Big Data platform in production today; it is an impressive architecture providing tremendous capabilities. Deep dive into this architecture to discover how it really works, and from this knowledge gain insights into data lifecycle management and job optimization for both MapReduce and Spark compute jobs. Improve your value to the Big Data team by learning the functions and systems of Hadoop.


Pushing data from small to large to huge summarize Will’s 20-year career in technology. A natural educator he shares his enthusiasm for the technology of Hadoop with clear explanations mixed with acute insights wrapped in humorous antidotes from the many lessons he has learned over the years. Will currently works in the center of the Big Data vortex for Hortonworks, where he travels the globe teaching Hadoop engineers.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google