Sentiment Analysis in Python With TextBlob

Introduction State-of-the-art technologies in NLP allow us to analyze natural languages on different layers: from simple segmentation of textual information to more sophisticated methods of sentiment categorizations. However, it does not inevitably mean that you should be highly advanced in programming to implement high-level tasks such as sentiment analysis in Python. Sentiment Analysis The algorithms of sentiment analysis mostly focus on defining opinions, attitudes, and even emoticons in a corpus of texts. The range of established sentiments significantly varies from […]

Read more

Python: Check Index of an Item in a List

Introduction Lists are useful in different ways compared to other datatypes because of how versatile they are. In this article we’ll take a look at one of the most common operations with lists – finding the index of an element. We will take a look at different scenarios of finding an element, i.e. finding the first, last, and all occurrences of an element. As well as what happens when the element we’re looking for doesn’t exist. Using the index() Function […]

Read more

How to Iterate over Rows in a Pandas DataFrame

Introduction Pandas is an immensely popular data manipulation framework for Python. In a lot of cases, you might want to iterate over data – either to print it out, or perform some operations on it. In this tutorial, we’ll take a look at how to iterate over rows in a Pandas DataFrame. If you’re new to Pandas, you can read our beginner’s tutorial. Once you’re familiar, let’s look at the three main ways to iterate over DataFrame: items() iterrows() itertuples() […]

Read more

Matplotlib Scatter Plot – Tutorial and Examples

Introduction Matplotlib is one of the most widely used data visualization libraries in Python. From simple to complex visualizations, it’s the go-to library for most. In this tutorial, we’ll take a look at how to plot a scatter plot in Matplotlib. Import Data We’ll be using the Ames Housing dataset and visualizing correlations between features from it. Let’s import Pandas and load in the dataset: import pandas as pd df = pd.read_csv(‘AmesHousing.csv’) Plot a Scatter Plot in Matplotlib Now, with […]

Read more

Python: Slice Notation on String

Introduction The term slicing in programming usually refers to obtaining a substring, sub-tuple, or sublist from a string, tuple, or list respectively. Python offers an array of straightforward ways to slice not only these three but any iterable. An iterable is, as the name suggests, any object that can be iterated over. In this article, we’ll go over everything you need to know about Slicing Strings in Python. Slicing a String in Python There are a couple of ways to […]

Read more

Python: Slice Notation on List

Introduction The term slicing in programming usually refers to obtaining a substring, sub-tuple, or sublist from a string, tuple, or list respectively. Python offers an array of straightforward ways to slice not only these three but any iterable. An iterable is, as the name suggests, any object that can be iterated over. In this article, we’ll go over everything you need to know about Slicing Lists in Python. Slicing a List in Python There are a couple of ways to […]

Read more

Change Font Size in Matplotlib

Introduction Matplotlib is one of the most widely used data visualization libraries in Python. Much of Matplotlib’s popularity comes from its customization options – you can tweak just about any element from its hierarchy of objects. In this tutorial, we’ll take a look at how to change the font size in Matplotlib. Change Font Size in Matplotlib There are a few ways you can go about changing the size of fonts in Matplotlib. You can set the fontsize argument, change […]

Read more

How I used NLP (Spacy) to screen Data Science Resumes

Resume making is very tricky. A candidate has many dilemmas, whether to state a project at length or just mention the bare minimum whether to mention many skills or just mention his/her core competency skill whether to mention many programming languages or just cite a few whether to restrict the resume to 2 pages or 1 page These dilemmas are equally hard for Data Scientists looking for a change or even for aspiring Data Scientist. Now before you wonder where […]

Read more

Stocks, Significance Testing & p-Hacking: How volatile is volatile?

October is historically the most volatile month for stocks, but is this a persistent signal or just noise in the data? Stocks, Significance Testing & p-Hacking. Follow me on Twitter (twitter.com/pdquant) for more. Over the past 32 years, October has been the most volatile month on average for the S&P500 and December the least, in this article we will use simulation to assess the statistical significance of this observation and to what extent this observation could occur by chance. All code […]

Read more

Apache Kafka + KSQL + TensorFlow for Data Scientists via Python + Jupyter Notebook

Why would a data scientist use Kafka Jupyter Python KSQL TensorFlow all together in a single notebook? There is an impedance mismatch between model development using Python and its Machine Learning tool stack and a scalable, reliable data platform. The former is what you need for quick and easy prototyping to build analytic models. The latter is what you need to use for data ingestion, preprocessing, model deployment and monitoring at scale. It requires low latency, high throughput, zero data […]

Read more
1 14 15 16 17 18 54