Data Science – learn R or Python?

Hi Folks, I have a query around whether to learn R from scratch or should I leverage my basic python knowledge to extend into Data Science with scikit,numpy ,pandas? So I am bit confused … I am not shy to learn New programming language like R etc bur really need to know who edges out whom in market. Maybe i should learn R too along with Python so  your valuable opinion matters.             Also i […]

Read more

Data Science In The Cloud With DataJoy

DataJoy is an unbelievably fantastic way for a working data scientist to have their favorite tools at hand. I am a minimalist when it comes to being mobile, whether working on the road, traveling for leisure, and sometimes both. I do not like to keep files on my laptop and I do not, for the most part, like to worry about keeping updated applications on my laptop. I have tried as much as possible to push my life into the […]

Read more

Picking an Analytic Platform

Summary: Picking an analytic platform when first starting out in data science almost always means working with what we’re most comfortable.  But as organizations grow larger there is a need for standardization and for selecting one, or a few analytic tools.   Picking an analytic platform when first starting out in data science almost always means working with what we’re most comfortable.  That in turn almost always means whatever we used in college (or your certificate course) be it R, […]

Read more

Machine Learning – Anomaly Detection: “Finding a Needle in a Haystack”

After exploring formulation, classification, benchmarking, we explore another facet of Machine Learning: anomaly detection. This part is key in the IoT transformation, as it enables internet-connected AI devices to alert, adapt and respond accordingly. Once properly trained, an IoT could not only warn and prevent imminent failure, but also execute a response, adaptive to the anomaly detected. In this process, we’ll explore intrinsic hurdles that makes the anomaly detection process a non-trivial task of “finding a needle in haystack”. Opportunities abound to explore, and any univariate sequential […]

Read more

Characteristics of Good Visual Analytics and Data Discovery Tools

Visual Analytics and Data Discovery allow analysis of big data sets to find insights and valuable information. This is much more than just classical Business Intelligence (BI). See this article for more details and motivation: “Using Visual Analytics to Make Better Decisions: the Death Pill Exa…“. Let’s take a look at important characteristics to choose the right tool for your use cases. Visual Analytics Tool Comparison and Evaluation Several tools are available on the market for Visual Analytics and Data […]

Read more

What’s That Beer Style? Ask a Neighbor, or Two

Beer is delicious but it is not one thing. If you disagree with the former part of the previous sentence please keep the latter in mind[1]. Think of sports, for instance. Many would agree with the blanket statement “sports are fun” but depending on what you have in mind two people can easily have opposite reactions to being presented the opportunity to play ping-pong. Sports are not one thing, music is not one thing, and neither is beer. Presented with […]

Read more

R, Python or SAS: Which one should you learn first?

Python, R and SAS are the three most popular languages in data science. If you are new to the world of data science and aren’t experienced in either of these languages, it makes sense to be unsure of whether to learn R, SAS or Python. Don’t fret, by the time you’re done reading this article, you will know without a doubt which language is the right one for you. Overview R – R is the lingua franca of statistics. It is a […]

Read more

Naive Bayes Classification explained with Python code

Machine Learning is a vast area of Computer Science that is concerned with designing algorithms which form good models of the world around us (the data coming from the world around us). Within Machine Learning many tasks are – or can be reformulated as – classification tasks. In classification tasks we are trying to produce a model which can give the correlation between the input data  and the class  each input belongs to. This model is formed with the feature-values of the input-data. […]

Read more

What are the Big Guys Using?

Summary:  The largest companies utilizing the most data science resources are moving rapidly toward more integrated advanced analytic platforms.  The features they are demanding are evolving to promote speed, simplicity, quality, and manageability.  This has some interesting implications for open source R and Python widely taught in schools but significantly less necessary with these more sophisticated platforms.   We continue to be dazzled, and perhaps rightly so, by the advances in deep learning and question answering machines like Watson.  And […]

Read more
1 38 39 40 41 42 54