Model Quantization and Distillation

DeepSeek, Model Distillation, and the Future of AI IP Protection

DeepSeek’s R1 release has generated heated discussions on the topic of model distillation and how companies may protect against unauthorized distillation. Model distillation has broad IP implications ...

CNBC

How DeepSeek used distillation to train its artificial intelligence model, and what it means for companies such as OpenAI

Chinese artificial intelligence lab DeepSeek roiled markets in January, setting off a massive tech and semiconductor selloff after unveiling AI models that it said were cheaper and more efficient than ...

Wired

Distillation Can Make AI Models Smaller and Cheaper

The original version of this story appeared in Quanta Magazine. The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it ...

InfoWorld

Model quantization and the dawn of edge AI

Model quantization bridges the gap between the computational limitations of edge devices and the demands for highly accurate models and real-time intelligent applications. The convergence of ...

InfoWorld

What is model quantization? Smaller, faster LLMs

Reducing the precision of model weights can make deep neural networks run faster in less GPU memory, while preserving model accuracy. If ever there were a salient example of a counter-intuitive ...

Quanta Magazine

How Distillation Makes AI Models Smaller and Cheaper

The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it focused on the fact that a relatively small and unknown company said ...

Semiconductor Engineering

Neural Network Model Quantization On Mobile

The general definition of quantization states that it is the process of mapping continuous infinite values to a smaller set of discrete finite values. In this blog, we will talk about quantization in ...

TechCrunch

Pruna AI open sources its AI model optimization framework

Pruna AI, a European startup that has been working on compression algorithms for AI models, is making its optimization framework open source on Thursday. Pruna AI has been creating a framework that ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results