A Gentle Introduction to Scikit-Learn: A Python Machine Learning Library

Last Updated on August 16, 2020 If you are a Python programmer or you are looking for a robust library you can use to bring machine learning into a production system then a library that you will want to seriously consider is scikit-learn. In this post you will get an overview of the scikit-learn library and useful references of where you can learn more. Kick-start your project with my new book Machine Learning Mastery With Python, including step-by-step tutorials and […]

Read more

Bootstrapping Machine Learning: An Upcoming Book on Prediction APIs

Last Updated on June 7, 2016 I came across an upcoming book that might interest you. It is titled Bootstrapping Machine Learning by Louis Dorard, PhD. A 40-page sample is provided and I enjoyed it. I think the final book will be a valuable read. Cover of the upcoming book: Bootstrapping Machine Learning Louis takes the position that machine learning is commoditized to the point where if you are an application developer, you don’t need to learn machine learn ing algorithms, you only need to learn machine […]

Read more

The Data Analytics Handbook: Data Analysts and Data Scientists

Last Updated on June 7, 2016 What is the difference between a Data Analyst and a Data Scientist and what type of work do they do all day? These questions and questions like them are answered in the new free ebook The Data Analytics Handbook: Data Analysts and Data Scientists. Cover of the The Data Analytics Handbook: Data Analysts and Data Scientists The ebook was created by Brian Liou, Tristan Tao and Elizabeth Lin. Brian and Tristan are Computer Science + Statistics grads and run the blog statsguys. Although […]

Read more

How to Get Started with Machine Learning in Python

Last Updated on August 21, 2019 The Python conference PyCon2014 has held recently and the videos for the conference are online. I have been working my way through the interesting machine learning ones and will share a few on this over the coming weeks. A great talk if you are starting out in data science or machine learning in python was given by Melanie Warrick titled How to Get Started with Machine Learning. It’s about 25 minutes long. The abstract […]

Read more

IPython from the shell to a book with a single tool with Fernando Perez

Last Updated on August 15, 2020 If you get serious with data analysis and machine learning in python then you will make good use of IPython notebooks. In this post we will review some takeaway points made by Fernando Perez, the creator of IPython in a keynote presentation at SciPy 2013. The title of the talk was IPython: from the shell to a book with a single tool; the method behind the madness. Kick-start your project with my new book Machine […]

Read more

The Seductive Trap of Black-Box Machine Learning

Last Updated on April 4, 2018 For as long as I have been participating in data mining and machine learning competitions, I have thought about automating my participation. Maybe it shows that I want to solve the problem of building the tool more than I want to solve the problem at hand. When working on a dataset, I typically spend a disproportionate amount of time thinking about algorithm tuning and running tuning experiments. I am prone to performing post-competition analysis […]

Read more

Best Programming Language for Machine Learning

Last Updated on September 27, 2016 A question I get asked a lot is: What is the best programming language for machine learning? I’ve replied to this question many times now it’s about time to explore this further in a blog post. Ultimately, the programming language you use for machine learning should consider your own requirements and predilections. No one can meaningfully address those concerns for you. No one can meaningfully address those concerns for you. What Languages Are Being Used Before […]

Read more

How to Layout and Manage Your Machine Learning Project

Last Updated on June 7, 2016 Project layout is critical for machine learning projects just as it is for software development projects. I think of it like language. A project layout organizes thoughts and gives you context for ideas just like knowing the names for things gives you the basis for thinking. In this post I want to highlight some considerations in the layout and management of your machine learning project. This is very much related to the goals of […]

Read more

The Data Analytics Handbook: CEOs and Managers

Last Updated on August 15, 2020 In a previous blog post we looked at the ebook of interviews with data analysts and data scientists put together by Liou, Tao and Lin. In this blog post we look at the second book in the series titled The Data Analytics Handbook CEOs and Managers. The Data Analytics Handbook CEOs and Managers What are managers looking for in a Data Analyst and a Data Science position, what skills do they require and how do […]

Read more

Lessons for Machine Learning from Econometrics

Last Updated on August 15, 2020 Hal Varian is the chief economist at Google and gave a talk to Electronic Support Group at EECS Department at the University of California at Berkeley in November 2013. The talk was titled Machine Learning and Econometrics and was really focused on what lessons the machine learning can take away from the field of Econometrics. Hal started out by summarizing a recent paper of his titled “Big Data: New Tricks for Econometrics” (PDF) which […]

Read more
1 763 764 765 766 767 906