Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Quantization

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More

Quantization Unit Test

 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)
How To Unit Test Quantization Implementation

 
Deep Learning, 
Mathematics, 
Quantization, 
Software Engineering, 
Unit Test  
  Read More

Function Approximation Using Lookup Table and Interpolation

 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)
Using Motorola CPU32 as an Example

 
Deep Learning, 
Quantization, 
Computer Architecture  
  Read More

PyTorch Quantization Aware Training

 12-06-2020 04-29-2021 blog 17 minutes read (About 2475 words)
PyTorch Inference Optimized Training Using Fake Quantization

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Static Quantization

 11-28-2020 04-29-2021 blog 29 minutes read (About 4408 words)
PyTorch Static Quantization for Convolutional Neural Networks

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
Quantization, 
Transformer, 
PyTorch, 
HuggingFace  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1218

Categories

8

Tags

756

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog542
  • essay309
  • life273
  • miscellaneous2
  • photography44
  • project20
  • reading8

follow.it

Recents

11-04-2025

Nsight Streamer

blog

11-02-2025

Coyote Lagoon at Don Edwards

photography

11-02-2025

Coyote Lagoon at Don Edwards 徒步

life

11-02-2025

2025 年纽约马拉松

essay

10-31-2025

2025 年 9 月和 10 月该入手的模型手办

essay

Archives

  • November 20254
  • October 202524
  • September 202515
  • August 202527
  • July 202523
  • See All >>

Tags

Outdoors278
Hiking212
California209
CPP114
Mathematics102
Deep Learning84
CUDA65
Running55
Photography53
Software Engineering36
Wildlife36
Bird35
Racing35
Machine Learning34
Python34
Movie32
Statistics32
Park31
Linux30
China29
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×