Steps for effective text data cleaning (with case study using Python)

Introduction   The days when one would get data in tabulated spreadsheets are truly behind us. A moment of silence for the data residing in the spreadsheet pockets. Today, more than 80% of the data is unstructured – it is either present in data silos or scattered around the digital archives. Data is being produced as we speak – from every conversation we make in the social media to every content generated from news sources. In order to produce any […]

Read more

Beginners Guide to Topic Modeling in Python

Introduction Analytics Industry is all about obtaining the “Information” from the data. With the growing amount of data in recent years, that too mostly unstructured, it’s difficult to obtain the relevant and desired information. But, technology has developed some powerful methods which can be used to mine through the data and fetch the information that we are looking for. One such technique in the field of text mining is Topic Modelling. As the name suggests, it is a process to […]

Read more

30 Questions to test a data scientist on Natural Language Processing [Solution: Skilltest – NLP]

Introduction Humans are social animals and language is our primary tool to communicate with the society. But, what if machines could understand our language and then act accordingly? Natural Language Processing (NLP) is the science of teaching machines how to understand the language we humans speak and write. We recently launched an NLP skill test on which a total of 817 people registered. This skill test was designed to test your knowledge of Natural Language Processing. If you are one […]

Read more

Natural Language Processing for Beginners: Using TextBlob

Introduction Natural Language Processing (NLP) is an area of growing attention due to increasing number of applications like chatbots, machine translation etc. In some ways, the entire revolution of intelligent machines in based on the ability to understand and interact with humans. I have been exploring NLP for some time now.  My journey started with NLTK library in Python, which was the recommended library to get started at that time. NLTK is a perfect library for education and research, it becomes […]

Read more

Learn how to Build your own Speech-to-Text Model (using Python)

Overview Learn how to build your very own speech-to-text model using Python in this article The ability to weave deep learning skills with NLP is a coveted one in the industry; add this to your skillset today We will use a real-world dataset and build this speech-to-text model so get ready to use your Python skills!   Introduction “Hey Google. What’s the weather like today?” This will sound familiar to anyone who has owned a smartphone in the last decade. […]

Read more

A Complete List of Important Natural Language Processing Frameworks you should Know (NLP Infographic)

Overview Here’s a list of the most important Natural Language Processing (NLP) frameworks you need to know in the last two years From Google AI’s Transformer to Facebook Research’s XLM/mBERT, we chart the rise of NLP through the lens of these seismic breakthroughs   Introduction Have you heard about the latest Natural Language Processing framework that was released recently? I don’t blame you if you’re still catching up with the superb StanfordNLP library or the PyTorch-Transformers framework! There has been […]

Read more

An Intuitive Understanding of Word Embeddings: From Count Vectors to Word2Vec

Introduction Before we start, have a look at the below examples. You open Google and search for a news article on the ongoing Champions trophy and get hundreds of search results in return about it. Nate silver analysed millions of tweets and correctly predicted the results of 49 out of 50 states in 2008 U.S Presidential Elections. You type a sentence in google translate in English and get an Equivalent Chinese conversion.   So what do the above examples have […]

Read more

Tutorial on Text Classification (NLP) using ULMFiT and fastai Library in Python

Introduction Natural Language Processing (NLP) needs no introduction in today’s world. It’s one of the most important fields of study and research, and has seen a phenomenal rise in interest in the last decade. The basics of NLP are widely known and easy to grasp. But things start to get tricky when the text data becomes huge and unstructured. That’s where deep learning becomes so pivotal. Yes, I’m talking about deep learning for NLP tasks – a still relatively less […]

Read more

A Step-by-Step NLP Guide to Learn ELMo for Extracting Features from Text

Introduction I work on different Natural Language Processing (NLP) problems (the perks of being a data scientist!). Each NLP problem is a unique challenge in its own way. That’s just a reflection of how complex, beautiful and wonderful the human language is. But one thing has always been a thorn in an NLP practitioner’s mind is the inability (of machines) to understand the true meaning of a sentence. Yes, I’m talking about context. Traditional NLP techniques and frameworks were great when […]

Read more

11 Superb Data Science Videos Every Data Scientist Must Watch

Overview Presenting 11 data science videos that will enhance and expand your current skillset We have categorized these videos into three fields – Natural Language Processing (NLP), Generative Models, and Reinforcement Learning Learn how the concepts in these videos work and build your own data science project!   Introduction I love learning and understanding data science concepts through videos. I simply do not have the time to pour through books and pages of text to understand different ideas and topics. […]

Read more
1 7 8 9 10 11 15