Voice Cloning with Consent
In this blog post, we introduce the idea of a ‘voice consent gate’ to support voice cloning with consent. We provide an example Space and accompanying code to start the ball rolling on the idea.
Read moreDeep Learning, NLP, NMT, AI, ML
In this blog post, we introduce the idea of a ‘voice consent gate’ to support voice cloning with consent. We provide an example Space and accompanying code to start the ball rolling on the idea.
Read moreToday we are excited to share Granite 4.0 Nano, our smallest models yet, released as part of IBM’s Granite 4.0 model family. Designed for the edge and on-device applications, these models demonstrate excellent performance for
Read moreSimulation has been a cornerstone in medical imaging to address the data gap. However, in healthcare robotics until now, it’s often been too slow, siloed, or difficult to translate into real-world systems. That’s now changing. With new advances in GPU-accelerated simulation and digital twins, developers can design, test, and validate robotic workflows entirely in virtual environments – reducing prototyping time from months to days,
Read moreA hands-on guide to collecting data, training policies, and deploying autonomous medical robotics workflows on real hardware Table-of-Contents
Read moreThe status quo of AI chip usage, that was once almost entirely U.S.-based, is changing. China’s immense progress in open-weight AI development is now being met with rapid domestic AI chip development. In the past few months,
Read moreIt’s been fantastic to see the community dive into our new MiniMax M2, with many highlighting its impressive skills in complex agentic tasks. This is particularly exciting for me, as my work was centered on the agent alignment part of its post-training. In this post, I’d like to share some of the key insights and lessons we learned during that process.
Read moreToday, we are happy to announce a new and deeper partnership with Google Cloud, to enable companies to build their own AI with open models. “Google has made some of the most impactful contributions to open AI, from
Read moreLooking to show off your robotics aptitude? The AMD Open Robotics Hackathon hosted by AMD, Hugging Face, and Data Monsters is the place to do it. Whether you’re a student, hobbyist, startup
Read moreWe converted our 15B reasoning model to a Mamba hybrid achieving 2.1x throughput with minimal quality loss. The key? A non-obvious insight about what data to distill on, and why intuition fails here. When MiniMax published their M2 post-mortem in October explaining why they abandoned efficient attention at 230B scale, the narrative briefly became “efficient attention is dead.” Within days, Kimi Linear proved otherwise. The real lesson: it depends on your constraints. Our constraint was simple: we had a strong […]
Read moreWhile everyone (and their grandma 👵) is spinning up new ASR models, picking the right one for your use case can feel more overwhelming than choosing your next Netflix show. As of 21 Nov 2025, there are 150 Audio-Text-to-Text and 27K ASR models on the Hub 🤯 Most benchmarks focus on short-form English transcription (<30s), and overlook other important tasks, such as (1) multilingual performance and (2) model throughput, which can a be deciding factor for long-form audio like meetings […]
Read more