Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayArchivesCategoriesTagsFAQs
  • Tags
  • Quantization

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More

Quantization Unit Test

 03-25-2024 03-25-2024 blog 5 minutes read (About 748 words)
How To Unit Test Quantization Implementation

 
Deep Learning, 
Mathematics, 
Quantization, 
Software Engineering, 
Unit Test  
  Read More

Function Approximation Using Lookup Table and Interpolation

 09-22-2023 09-22-2023 blog 7 minutes read (About 1001 words)
Using Motorola CPU32 as an Example

 
Deep Learning, 
Quantization, 
Computer Architecture  
  Read More

PyTorch Quantization Aware Training

 12-06-2020 04-29-2021 blog 17 minutes read (About 2475 words)
PyTorch Inference Optimized Training Using Fake Quantization

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Static Quantization

 11-28-2020 04-29-2021 blog 29 minutes read (About 4408 words)
PyTorch Static Quantization for Convolutional Neural Networks

 
Quantization, 
PyTorch, 
CNN  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
Quantization, 
Transformer, 
PyTorch, 
HuggingFace  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Santa Clara, California

Posts

1061

Categories

7

Tags

689

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog509
  • essay270
  • life232
  • miscellaneous2
  • project20
  • reading8

follow.it

Recents

05-14-2025

Optimal Brain Surgeon

blog

05-12-2025

唐僧肉

essay

05-11-2025

Golden State Model Railroad Museum 参观

life

05-11-2025

Miller/Knox Regional Shoreline 徒步

life

05-10-2025

2025 Heart & Soles Run 5K 竞赛

life

Archives

  • May 202510
  • April 202521
  • March 202525
  • February 202521
  • January 202523
  • See All >>

Tags

Outdoors235
Hiking178
California167
CPP108
Mathematics90
Deep Learning79
CUDA50
Running47
Software Engineering35
Machine Learning34
Python32
Statistics31
Park30
Racing30
Linux29
Docker26
Movie25
China23
Museum23
Physics23
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×