Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Quantization

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More

Quantization Unit Test

 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)
How To Unit Test Quantization Implementation

 
Deep Learning, 
Mathematics, 
Quantization, 
Software Engineering, 
Unit Test  
  Read More

Function Approximation Using Lookup Table and Interpolation

 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)
Using Motorola CPU32 as an Example

 
Deep Learning, 
Quantization, 
Computer Architecture  
  Read More

PyTorch Quantization Aware Training

 12-06-2020 04-29-2021 blog 17 minutes read (About 2475 words)
PyTorch Inference Optimized Training Using Fake Quantization

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Static Quantization

 11-28-2020 04-29-2021 blog 29 minutes read (About 4408 words)
PyTorch Static Quantization for Convolutional Neural Networks

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
Quantization, 
Transformer, 
PyTorch, 
HuggingFace  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Santa Clara, California

Posts

1093

Categories

8

Tags

703

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog517
  • essay279
  • life239
  • miscellaneous2
  • photography8
  • project20
  • reading8

follow.it

Recents

06-17-2025

美国中餐盒饭的价格

essay

06-15-2025

Lake Cunningham Regional Park

photography

06-15-2025

PyPI Package Supported Python Versions

blog

06-14-2025

Round Valley Regional Preserve

photography

06-14-2025

Round Valley Regional Preserve 徒步

life

Archives

  • June 202515
  • May 202527
  • April 202521
  • March 202525
  • February 202521
  • See All >>

Tags

Outdoors243
Hiking184
California174
CPP110
Mathematics92
Deep Learning81
CUDA50
Running48
Software Engineering35
Machine Learning34
Python33
Racing31
Statistics31
Linux30
Park30
Movie28
Docker26
China25
Physics24
Museum23
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×