Few words on Natural Language Processing and User Autonomy

As natural language processing (NLP) finds its way from university labs and becomes a crucial element of many user-facing technologies (machine translation, search, language-model-based assistants), people start to get concerned about the ethics of this technology. When people talk about NLP ethics, the main topics are: biases that the models get from training data, replication of toxic behavior found on the Internet, underrepresentation of already underprivileged groups, differences between the technology availability between the global north and global south. Now, […]

Read more

Using OrderedDict in Python

Sometimes you need a Python dictionary that remembers the order of its items. In the past, you had only one tool for solving this specific problem: Python’s OrderedDict. It’s a dictionary subclass specially designed to remember the order of items, which is defined by the insertion order of keys. This changed in Python 3.6. The built-in dict class now keeps its items ordered as well. Because of that, many in the Python community now wonder if OrderedDict is still useful. […]

Read more

Python News: What’s New From March 2023

In March 2023, Python 3.12.0 alpha 6 was released! With several exciting new features, improvements, and optimizations, this release is a must-try for all Python enthusiasts. If you want to stay on the cutting edge of Python development, then you should really take it for a spin. But keep in mind that this release isn’t recommended for production. The Python community has been buzzing with activities and events. PyCascades 2023 took place in Vancouver, British Columbia. The event featured talks […]

Read more

Polars for initial data analysis, Polars for production

Initial data analysis (IDA) has different goals than your final, production data analysis: With IDA you need to examine the initial data and intermediate results, check your assumptions, and try different approaches. Exploratory data analysis has similar requirements. Once you’re happy with your approach, and you’re ready to run the analysis in an automated manner, you care a lot more about speed and resource usage. These different goals often benefit from different implementation strategies and tools—unless you have a sufficiently […]

Read more

Highlights from Machine Translation and Multilinguality in March 2023

Here is what I found the most interesting in MT and multilinguality in March I only feature two papers (both from Microsoft, co-incidence), not because there were too few on arXiv, but because I did not manage to read that much this month. In this paper, folks from Microsoft in India experiment with zero-shot crosslingual transfer for classification. They use a multi-task learning setup. Besides performing the task in the source language, they teach the model using a two-player game. […]

Read more

Python 3.12 Preview: Ever Better Error Messages

Python 3.12 will be released in October 2023. Even though October is still months away, you can already preview some of the upcoming features, including how Python 3.12 will offer even more readable and actionable error messages. In this tutorial, you’ll: Experience the improved communication in various error situations Learn about the background and limitations of these changes Peek into the CPython source code of the pull requests that made these error messages sparkle There are many other improvements and […]

Read more
1 100 101 102 103 104 913