Research Focus: Week of August 12, 2024

Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. EVENT Register now for Research Forum on September 3 Discover what’s next in the world of AI at Microsoft Research Forum (opens in new tab), an event series that explores recent research advances, bold new ideas, and important discussions with  

Read more

Sorting Dictionaries in Python: Keys, Values, and More

You’ve got a dictionary, but you’d like to sort the key-value pairs. Perhaps you’ve tried passing a dictionary to the sorted() function but didn’t receive the results you expected. In this video course, you’ll go over everything you need to know to sort dictionaries in Python. In this video course, you’ll: Review how to use the sorted() function Learn how to get dictionary views to iterate over Understand how dictionaries are cast to lists during sorting Learn how to specify […]

Read more

The Strategic Use of Sequential Feature Selector for Housing Price Predictions

To understand housing prices better, simplicity and clarity in our models are key. Our aim with this post is to demonstrate how straightforward yet powerful techniques in feature selection and engineering can lead to creating an effective, simple linear regression model. Working with the Ames dataset, we use a Sequential Feature Selector (SFS) to identify the most impactful numeric features and then enhance our model’s accuracy through thoughtful feature engineering. Let’s get started. The Strategic Use of Sequential Feature Selector […]

Read more

Building a Simple RAG Application Using LlamaIndex

Image by Author In this tutorial, we will explore Retrieval-Augmented Generation (RAG) and the LlamaIndex AI framework. We will learn how to use LlamaIndex to build a RAG-based application for Q&A over the private documents and enhance the application by incorporating a memory buffer. This will enable the LLM to generate the response using the context from both the document and previous interactions. What is RAG in LLMs? Retrieval-Augmented Generation (RAG) is an advanced methodology designed to enhance the performance […]

Read more

5 Free Podcasts That Demystify Machine Learning Concepts

Image by Editor | Midjourney Machine learning (ML) has become a buzzword in recent years, with applications ranging from voice assistants to self-driving cars. Yet, for many, the inner workings of these technologies remain a mystery. Podcasts offer a great way to learn about this field without getting overwhelmed. They break down complex ideas into simpler terms and let you learn at your own pace. In this article, I will share 5 of my favorite ML podcasts, which excel at […]

Read more

Python News Roundup: August 2024

In July, there was some exciting news for the Python community as the Python core development team released versions 3.13.0b4 and 3.13.0rc1 of the language. The 3.13.0b4 release marked the end of the beta phase and paved the way for the release candidate phase. Note that 3.13.0rc1 is a pre-release, so you shouldn’t use it for production environments. However, it provides a great way to try some new and exciting language features. There’s also great some news from the Python […]

Read more

From Train-Test to Cross-Validation: Advancing Your Model’s Evaluation

Many beginners will initially rely on the train-test method to evaluate their models. This method is straightforward and seems to give a clear indication of how well a model performs on unseen data. However, this approach can often lead to an incomplete understanding of a model’s capabilities. In this blog, we’ll discuss why it’s important to go beyond the basic train-test split and how cross-validation can offer a more thorough evaluation of model performance. Join us as we guide you […]

Read more

5 Tips for Getting Started with Time Series Analysis

Image by Author | Created on Canva As a machine learning engineer or a data scientist, you’ll likely need to work with time series data. Time series analysis focuses on data indexed by time, such as stock prices, temperature, and the like. If you’re already comfortable with machine learning fundamentals but new to time series, this guide will provide you with five actionable tips to get started. These tips will help you understand the aspects of time series data, preprocess […]

Read more

Large-scale pathology foundation models show promise on a variety of cancer-related tasks

Imagine if pathologists had tools that could help predict therapeutic responses just by analyzing images of cancer tissue. This vision may someday become a reality through the revolutionary field of computational pathology. By leveraging AI and machine learning, researchers are now able to analyze digitized tissue samples with unprecedented accuracy and scale, potentially transforming how we understand and treat cancer. When a patient is suspected of having cancer, a tissue specimen is sometimes removed, stained, affixed to a glass slide, […]

Read more

Collaborators: AI and the economy with Brendan Lucier and Mert Demirer

[TEASER ENDS]  GRETCHEN HUIZINGA: You’re listening to Collaborators, a Microsoft Research Podcast showcasing the range of expertise that goes into transforming mind-blowing ideas into world-changing technologies. I’m Dr. Gretchen Huizinga. [MUSIC FADES]  On today’s episode, I’m talking to Dr. Brendan Lucier, a senior principal researcher in the economics and computation group at Microsoft Research, and Dr. Mert Demirer, an assistant professor of applied economics at the MIT Sloan School of Management. Brendan and Mert are exploring the economic impact of […]

Read more
1 23 24 25 26 27 914