Live Coding Decoding - 搜索 News

GLM-5.2 Open Weights Live: Top Coding Benchmark, but API Use Carries China Data Risk

a mobile phone's screen showing the logo of Chinese AI Zhipu in Beijing on January 21, 2026. Investor confidence in Chinese AI startups is riding high, but obstacles to their long-term success range ...

5 天

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

Tech Times

DeepSeek Releases DSpark: Speculative Decoding Makes V4 Up to 85 Percent Faster

DeepSeek speculative decoding framework DSpark went live June 27 on V4-Flash and V4-Pro, reporting up to 85 percent faster ...

18 天

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for ...

It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock ...

the-decoder

Zhipu AI's GLM-5.2 closes in on closed-source leaders in coding marathons

Chinese AI lab Zhipu AI releases GLM-5.2 with a stable 1-million-token context under the MIT license. On hours-long coding tasks, the open-source model trails Anthropic's Opus models by just a few ...

1 天

Waterloo's PAW compiles task specs into 23MB LoRA adapters a 600M-parameter model runs ...

Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2, 2026, a system that compiles any natural-language task spec into a 23MB ...

GitHub

Qwen3.6-35B-A3B-heretic NVFP4 + DFlash on DGX Spark

A production-stable deployment of AEON-7/Qwen3.6-35B-A3B-heretic-NVFP4 with DFlash speculative decoding on NVIDIA DGX Spark (GB10 / sm_121a). ⚠️ READ THE REQUIREMENTS SECTION FIRST. This image and its ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果