Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Quantization

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More

Quantization Unit Test

 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)
How To Unit Test Quantization Implementation

 
Deep Learning, 
Mathematics, 
Quantization, 
Software Engineering, 
Unit Test  
  Read More

Function Approximation Using Lookup Table and Interpolation

 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)
Using Motorola CPU32 as an Example

 
Deep Learning, 
Quantization, 
Computer Architecture  
  Read More

PyTorch Quantization Aware Training

 12-06-2020 04-29-2021 blog 17 minutes read (About 2475 words)
PyTorch Inference Optimized Training Using Fake Quantization

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Static Quantization

 11-28-2020 04-29-2021 blog 29 minutes read (About 4408 words)
PyTorch Static Quantization for Convolutional Neural Networks

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
Quantization, 
Transformer, 
PyTorch, 
HuggingFace  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Santa Clara, California

Posts

1205

Categories

8

Tags

750

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog539
  • essay306
  • life269
  • miscellaneous2
  • photography41
  • project20
  • reading8

follow.it

Recents

10-16-2025

CuTe Tiled Copy

blog

10-14-2025

单车开启 B 站直播

essay

10-13-2025

CuTe Thread-Value Layout

blog

10-12-2025

2025 年芝加哥马拉松

essay

10-11-2025

Coyote Creek Parkway

photography

Archives

  • October 202515
  • September 202515
  • August 202527
  • July 202523
  • June 202547
  • See All >>

Tags

Outdoors274
Hiking208
California205
CPP114
Mathematics101
Deep Learning84
CUDA63
Running53
Photography49
Software Engineering36
Machine Learning34
Python34
Racing34
Bird32
Movie32
Wildlife32
Statistics31
Linux30
Park30
China29
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×