Highlights from Machine Translation and Multilinguality in December 2023 and January 2024

Many things happened in the field in December: EMNLP, Google released Gemini, and Mixtral appeared. January was seemingly not that packed with new events, but plenty of new interesting work popped up on arXiv. Predicting Human Translation Difficulty with Neural Machine Translation Folks from the University of Melbourne found out that features from NMT, most notably the target sentence perplexity and something they call flow features, are a good predictor of human translation time. Turning English-centric LLMs Into Polyglots: How […]

Read more

Python Basics Exercises: Lists and Tuples

In Python Basics: Lists and Tuples, you learned that Python lists resemble real-life lists in many ways. They serve as containers for organizing and storing collections of objects, allowing for the inclusion of different data types. You also learned about tuples, which are also collections of objects. However, while lists are mutable, tuples are immutable. In this Python Basics Exercises course, you’ll test and reinforce your knowledge of Python lists and tuples. Along the way, you’ll also get experience with […]

Read more

Feature Relationships 101: Lessons from the Ames Housing Data

In the realm of real estate, understanding the intricacies of property features and their impact on sale prices is paramount. In this exploration, we’ll dive deep into the Ames Housing dataset, shedding light on the relationships between various features and their correlation with the sale price. Harnessing the power of data visualization, we’ll unveil patterns, trends, and insights that can guide stakeholders from homeowners to real estate developers. Let’s get started. Feature Relationships 101: Lessons from the Ames Housing DataPhoto […]

Read more

Building Enumerations With Python’s enum

Some programming languages, such as Java and C++, have built-in support for a data type called enumerations, commonly referred to as enums. Enums enable you to create sets of logically related constants that you can access through the enumeration itself. Unlike these languages, Python doesn’t have a dedicated syntax for enums. However, the Python standard library provides an enum module that offers support for enumerations through the Enum class. If you’re familiar with enums from other languages and wish to […]

Read more

Python News: What’s New From January 2024

In January 2024, Python 3.13.0a3 was released! With several exciting features, improvements, and optimizations, this release is the third of six planned alpha releases. During the alpha phase, features may be added up until the start of the beta phase on May 7. This is a pre-release, and you shouldn’t use it for production environments. However, it’s a great way to try out some new and exciting language features. The steering council had its elections last December, and the Python […]

Read more

Exploring Dictionaries, Classifying Variables, and Imputing Data in the Ames Dataset

The real estate market is a complex ecosystem driven by numerous variables such as location, property features, market trends, and economic indicators. One dataset that offers a deep dive into this complexity is the Ames Housing dataset. Originating from Ames, Iowa, this dataset comprises various properties and their characteristics, ranging from the type of alley access to the property’s overall condition. In this post, your aim is to take a closer look at this dataset using data science techniques. Specifically, […]

Read more

What’s Your Story: Ivan Tashev

IVAN TASHEV: So I’m born in a city called Yambol in Bulgaria, my origin country. The city [was] created 2,000 years B.C. and now sits on the two shores of the river called Tundzha. It always has been an important transportation and agricultural center in the entire region, and I grew up there in a family of two lecturers. My parents were teaching history. And they loved to travel. So everywhere I go, I had two excellent tourist guides with […]

Read more

Microsoft Research Forum: New series explores bold ideas in technology research in the era of AI

Microsoft Research Forum (opens in new tab) is a new series of conversations that explore recent advances, bold new ideas, and important discussions within the global research community. Leading Microsoft researchers will share insights into their work, followed by live online discussions with audience participants. This post provides an overview of the inaugural Microsoft Research Forum conversation, with a summary of each presentation. Full details, including the copilot experience (opens in new tab) and replays of each session (opens in […]

Read more

Python’s Format Mini-Language for Tidy Strings

When you’re doing string interpolation in your Python code, you often need to format the interpolated values to meet some formatting requirements. To do this, Python provides what is known as the format mini-language, which defines the syntax of a format specifier. Perhaps you’re comfortable working with strings, but you want to take even more control of them. With proficiency in the format mini-language, you’ll be able to use format specifiers to do things like formatting numbers as currency values, […]

Read more

Profiling your Numba code

If you’re writing numeric Python code, Numba can be a great way to speed up your program. By compiling a subset of Python to machine code, Numba lets you write for loops and other constructs that would be too slow in normal Python. In other words, it’s similar to Cython, C, or Rust, in that it lets you write compiled extensions for Python. Numba code isn’t always as fast as it could be, however. This is where profiling is useful: […]

Read more
1 57 58 59 60 61 908