The Open Agent Leaderboard

How good are general purpose AI agents? We built an open evaluation framework to find out. Most evaluations in AI report a simple result: what score each model got on which benchmarking task. When you deploy an agent, you’re not just choosing a model. You’re choosing a full system: what tools the agent can use, how it plans its steps, what it remembers between actions, how it recovers when something goes wrong. Change any of those and the same model […]

Read more

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

PaddleOCR 3.5 brings OCR and document parsing tasks closer to the Hugging Face ecosystem. With this release, supported PaddleOCR models can run with Hugging Face Transformers as an inference backend by setting: engine=”transformers” PaddleOCR continues to provide OCR model series such as PP-OCRv5 and document parsing model series such as PaddleOCR-VL 1.5, while Transformers becomes one of the supported backends for running them. Try the live demo on Hugging Face Spaces: https://huggingface.co/spaces/PaddlePaddle/paddleocr-3.5-transformers-demo What changed? PaddleOCR 3.5 introduces a more flexible […]

Read more

Introducing the Ettin Reranker Family

Today I’m releasing six new Sentence Transformers CrossEncoder rerankers, state-of-the-art at their respective sizes, built on top of the Ettin ModernBERT encoders, together with the data and full training recipe that produced them: The models were trained with a distillation recipe: pointwise MSE on mixedbread-ai/mxbai-rerank-large-v2 scores over cross-encoder/ettin-reranker-v1-data, which is a subset of lightonai/embeddings-pre-training mixed with a reranked subset of lightonai/embeddings-fine-tuning. Our six rerankers paired with google/embeddinggemma-300m on MTEB(eng, v2) Retrieval. See Results for five more embedder pairings. If you’re […]

Read more

OlmoEarth v1.1: A more efficient family of Earth observation models

đź§  Models: https://huggingface.co/collections/allenai/olmoearth | đź“„ Tech Report: https://allenai.org/papers/olmoearth_v1_1 | đź’» Code: https://github.com/allenai/olmoearth_pretrain We released OlmoEarth (v1) in November 2025. Since then, partners have applied it across a wide range of tasks, from tracking mangrove change to classifying drivers of forest loss to producing country-scale crop-type maps in days, scaling deployments to national, continental, and global areas. Every release moves us closer to our mission: bringing state-of-the-art AI to organizations and communities working to protect people and our planet. When OlmoEarth […]

Read more

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Large language models (LLMs) have become the default interface for code generation, math problem solving, summarization, document understanding, and many other developer workflows. Under the hood, though, many LLMs still generate text the same way: one token at a time, and each token depends on the tokens that appeared before it. As such, these models are called autoregressive, since they consume their own outputs. That autoregressive (AR) approach has been remarkably successful. It is stable to train, simple to serve, […]

Read more

Quiz: Build a Tic-Tac-Toe Game With Python and Tkinter

Interactive Quiz â‹… 9 QuestionsBy Joseph Peart Share In this quiz, you’ll test your understanding of Build a Tic-Tac-Toe Game With Python and Tkinter. By working through this quiz, you’ll revisit how to design game logic with Python classes, lay out and update Tkinter widgets, and wire up button clicks to a handler through the event loop. The quiz contains 9 questions and there is no time limit. You’ll get 1 point for each correct answer. At the end of […]

Read more

Quiz: Context Managers and Using Python’s with Statement

Interactive Quiz â‹… 8 QuestionsBy Joseph Peart Share In this quiz, you’ll test what you learned in the video course Context Managers and Using Python’s with Statement. By working through this quiz, you’ll revisit how the with statement runs setup and teardown for you, how to use standard-library context managers like open(), and how to write your own context managers as classes or with the @contextmanager decorator. The quiz contains 8 questions and there is no time limit. You’ll get […]

Read more

Quiz: How to Use the Claude API in Python

Interactive Quiz â‹… 9 QuestionsBy Joseph Peart Share In this quiz, you’ll test your knowledge of How to Use the Claude API in Python. By working through this quiz, you’ll revisit how to install the anthropic SDK, send prompts to Claude with client.messages.create(), shape responses with a system parameter, and return structured JSON output using a schema or Pydantic. The quiz contains 9 questions and there is no time limit. You’ll get 1 point for each correct answer. At the […]

Read more

How to Use the Claude API in Python

The fastest way to use the Claude API in Python is to install anthropic, set your API key, and call client.messages.create(). You’ll have a working response in under a minute: Example of Using the Claude API in Python Claude is Anthropic’s large language model, accessible via a clean REST API with an official Python SDK. Unlike heavier AI frameworks that require you to wire up multiple components before    

Read more
1 2 3 1,038