Hugging Face Transformers Python Library

NVIDIA Diffusion LLM Hits 2.42x Throughput Without Retraining: Nemotron TwoTower Released

NVIDIA diffusion language model Nemotron TwoTower achieves 2.42x LLM inference throughput without a full retraining run, ...

GitHub

A Lightweight Library for AI Observability

We differentiate between observers and stores. Observers wrap generative AI APIs (like OpenAI or llama-index) and track their interactions. Stores are classes that sync these observations to different ...

GitHub

Speech To Speech: Build local voice agents with open-source models

The pipeline provides a fully open and modular approach, with a focus on leveraging models available through the Transformers library on the Hugging Face hub. The code is designed for easy ...

note

[Deep Dive] The Era Where Libraries Are Evaluated by 'Agent Usability' — The Token Cost ...

On June 18, 2026, Hugging Face published a blog post titled "Is it agentic enough?". As coding agents (systems where AI autonomously writes, executes, and fixes code) increasingly interact directly ...

什么值得买社区频道 on MSN

HuggingFace + FreeLLMAPI：16 家免费 tier > 月17亿token

如果你平时调大模型 API，大概率遇到过这几种情况：某个平台今天额度用完了，代码直接报错想试新模型，又要去另一个网站申请 Key，重复配置 base ...

TechRepublic

Malicious Hugging Face Models Could Trigger Remote Code Execution

A flaw in Hugging Face Transformers could allow malicious AI models to execute code, exposing credentials and highlighting AI supply chain risks. Organizations using vulnerable versions of the Hugging ...

一些您可能无法访问的结果已被隐去。