Close Menu
MMJ News NetworkMMJ News Network
  • Home
  • Cannabis
  • Psychedelics
  • Crypto & Web3
  • AI
  • CBD
  • Wellness & Counterculture
  • MMJNEWS

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

SEC Halts Trading of Bitcoin, Ethereum Treasury Firm QMMM After 2,000% Stock Surge

September 29, 2025

Vibe-coding startup Anything nabs a $100M valuation after hitting $2M ARR in its first two weeks

September 29, 2025

DOJ Asks Federal Court To Further Delay Lawsuit On Marijuana Rescheduling Process As Trump Weighs Reform Proposal

September 29, 2025
Facebook X (Twitter) Instagram
MMJ News NetworkMMJ News Network
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • Cannabis
  • Psychedelics
  • Crypto & Web3
  • AI
  • CBD
  • Wellness & Counterculture
  • MMJNEWS
MMJ News NetworkMMJ News Network
Home » DeepSeek releases ‘sparse attention’ model that cuts API costs in half
AI

DeepSeek releases ‘sparse attention’ model that cuts API costs in half

EditorBy EditorSeptember 29, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Researchers at DeepSeek on Monday released a new experimental model called V3.2-exp, designed to have dramatically lower inference costs when used in long-context operations. DeepSeek announced the model with a post on Hugging Face, also posting a linked academic paper on GitHub.

The most important feature of the new model is called DeepSeek Sparse Attention, an intricate system described in detail in the diagram below. In essence, the system uses a module called a “lightning indexer” to prioritize specific excerpts from the context window. After that, a separate system called a “fine-grained token selection system” chooses specific tokens from within those excerpts to load into the module’s limited attention window. Taken together, they allow the Sparse Attention models to operate over long portions of context with comparatively small server loads.

Screenshot

For long-context operations, the benefits of the system are significant. Preliminary testing by DeepSeek found that the price of a simple API call could be reduced by as much as half in long-context situations. Further testing will be required to build a more robust assessment, but because the model is open-weight and freely available on Hugging Face, it won’t be long before third-party tests can assess the claims made in the paper.

DeepSeek’s new model is one of a string of recent breakthroughs tackling the problem of inference costs — essentially, the server costs of operating a pre-trained AI model, as distinct from the cost of training it. In DeepSeek’s case, the researchers were looking for ways to make the fundamental transformer architecture operate more efficiently — and finding that there are significant improvements to be made.

Based in China, DeepSeek has been an unusual figure in the AI boom, particularly for those who view AI research as a nationalist struggle between the U.S. and China. The company made waves at the beginning of the year with its R1 model, trained using primarily reinforcement learning at a far lower cost than its American competitors. But the model has not sparked a wholesale revolution in AI training, as some predicted, and the company has receded from the spotlight in the months since.

The new “sparse attention” approach is unlikely to produce the same uproar as R1 — but it could still teach U.S. providers some much needed tricks to help keep inference costs low.



Source link

China deepseek inference
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor
  • Website
  • Facebook
  • Instagram

Related Posts

Vibe-coding startup Anything nabs a $100M valuation after hitting $2M ARR in its first two weeks

September 29, 2025

Anthropic launches Claude Sonnet 4.5, its best AI model for coding

September 29, 2025

Complex Chaos thinks AI can help people find common ground

September 29, 2025
Leave A Reply Cancel Reply

Don't Miss
Crypto & Web3

SEC Halts Trading of Bitcoin, Ethereum Treasury Firm QMMM After 2,000% Stock Surge

In brief Digital advertising firm QMMM Holdings announced that it was buying Bitcoin, Ethereum, and…...

Free Membership Required

You must be a Free member to access this content.

Join Now

Already a member? Log in here

Vibe-coding startup Anything nabs a $100M valuation after hitting $2M ARR in its first two weeks

September 29, 2025

DOJ Asks Federal Court To Further Delay Lawsuit On Marijuana Rescheduling Process As Trump Weighs Reform Proposal

September 29, 2025

AI Study Finds Chatbots Can Strategically Lie—And Current Safety Tools Can’t Catch Them

September 29, 2025
Top Posts

Steel Your Cannabis Crops Against Iron Deficiency

September 29, 2025

Massachusetts Initiative Petition to Kill Adult-Use Market Leads CBT’s Top Stories in September

September 26, 2025

Cannabis Advertising Compliance 2026: Strategies That Scale

September 25, 2025

Red Imported Fire Ants Ravage South Carolina Hemp Crop

September 24, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to MMJ News Network, your premier source for cutting-edge insights into cannabis, psychedelics, crypto & Web3, wellness, counterculture, and market trends. We are dedicated to bringing you the latest news, research, and developments shaping these fast-evolving industries.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

SEC Halts Trading of Bitcoin, Ethereum Treasury Firm QMMM After 2,000% Stock Surge

September 29, 2025

Vibe-coding startup Anything nabs a $100M valuation after hitting $2M ARR in its first two weeks

September 29, 2025

DOJ Asks Federal Court To Further Delay Lawsuit On Marijuana Rescheduling Process As Trump Weighs Reform Proposal

September 29, 2025
Most Popular

Ethereum Falls as Crypto Exchange Bybit Confirms $1.4 Billion Hack

February 21, 2025

Florida Woman Accused of $850K Trump Solana Meme Coin Theft, Faces Deportation

February 21, 2025

Bitcoin, XRP and Dogecoin Sink Amid Inflation Fears and Bybit Hack Fallout

February 23, 2025
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 mmjnewsnetwork. Designed by mmjnewsnetwork.

Type above and press Enter to search. Press Esc to cancel.