Developer Review – Machine Learning, Drew & John, O’Reilly

Introduction

It is very interesting to read a hard core computing book from authors those are not in the computing industry.  Their area of studies really make them as right choice for writing this book.  Drew Conway studies international relations, conflicts and terrorism using tools of mathematics, statistics and computer science.  John Myles White in “how humans make decisions”.  Obviously, both are Ph.D students.

Machine learning is one of the grooming area in the computing world which is actually a branch of artificial intelligence.  Based on the given data, we can capture characteristics of interest of our unknown underlying probability distribution.  This book thoroughly covers  various disciplines such as classification, ranking, regression, regularization, optimization, etc  with practical examples using “R” language.

In Detail

If you are a programmer, you may little bit astonished first time to use R.  Later, you will understand that “R” is not the language of programming, but excellent companion for people in statistical field.  So, you may not fully comfortable with “R” after reading the first chapter “Using R”.  Authors also mentioned that “R remains a relatively niche language, even among experienced programmers”.  However, It would be good if they explain R syntax and its usage crisp and short in the Appendix section.

The chapter 1 (Using R) and 2 (Data Exploration) make you start the journey slowly with many good theories on data analysis.  The visual explanation of “data as rectangle”, MxN matrix of data into single row or single column view are good learning.  Inferring data is another good point.   Wherever required,visual representations come for you to understand.  Otherwise, they comfortably explains the concepts textually.  Agile people should be patience on this.

“Chapter 3Classification : Spam filtering” make your journey at the highest speed, with lot of interesting turns.  After two chapters with academical effect, this chapter make you feel like reading fictions. They starts with an example of how can you predict a person is man or woman based on the weight and height.  They called this mechanism as “separating hyperplane” and also explained a way of taking decision called “kernel trick”.

“Chapter 4 Ranking: Priority Inbox” is another chart buster which provides you some good knowledge on “sorting with unknown ordering”.  GMail’s priority inbox and Amazon’s book recommendation are few examples.
“Chapter 5 Regression: Predicting Page Views” thoroughly explained the regression concepts and linear regression.  He frightened us with an example of smoking habits and their longevity. :)
The remaining chapters gives you the same good experience on concepts like Regularization, Optimization.  Chapter 8 takes you another level and teaches you unsupervised learning.

Conclusion

When you have enough time on the week-end and want to learn truly some interesting and futuristic concepts in computing.  Do read this book followed by working out the examples.  If you are serious developers and coding is your passion, then this book will take you to some level up and incite your innovative ideas for your products.  For academic people, this should be one of the paper in your course.  A very good book from O’Reilly  by actual field experienced authors.  You can buy this book at http://shop.oreilly.com/product/483.do or .
Mark this!
  • Digg
  • del.icio.us
  • Add to favorites
  • RSS