Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Deep Learning

NeurIPS 2025 Area Chair Experience

 09-21-2025 09-21-2025 blog 8 minutes read (About 1136 words)
Serving The Dataset and Benchmark Track This Year

 
Deep Learning, 
NeurIPS, 
Conference  
  Read More

TensorRT Plugin Version and Namespace

 09-08-2025 09-08-2025 blog 8 minutes read (About 1152 words)
Handling TensorRT Plugin Conflicts Using Version and Namespace

 
Deep Learning, 
Software Engineering, 
NVIDIA, 
TensorRT  
  Read More

Online Safe Softmax

 06-23-2025 06-23-2025 blog 5 minutes read (About 741 words)
Safe and Efficient Online Softmax Calculation

 
Deep Learning, 
Mathematics, 
Accelerated Computing  
  Read More

TensorRT Static Plugin VS Dynamic Plugin

 06-05-2025 06-05-2025 blog 13 minutes read (About 1941 words)
Managing The Lifetime and Registration of TensorRT Plugins

 
CPP, 
Deep Learning, 
TensorRT  
  Read More

Computing Hessian Matrix Via Automatic Differentiation

 05-22-2025 05-22-2025 blog 18 minutes read (About 2664 words)
Computing Higher-Order Derivatives Using Automatic Differentiation

 
Deep Learning, 
Mathematics, 
Neural Networks, 
Calculus, 
Automatic Differentiation  
  Read More

Optimal Brain Surgeon

 05-14-2025 05-14-2025 blog 40 minutes read (About 6047 words)
Derivation and Extension of The Classical Optimal Brain Surgeon Algorithm

 
Deep Learning, 
Mathematics, 
Neural Networks, 
Calculus, 
Neural Network Pruning  
  Read More

ICML 2025 Area Chair Experience

 05-03-2025 05-03-2025 blog 8 minutes read (About 1181 words)
First Time Serving as ICML Area Chair

 
Machine Learning, 
Deep Learning, 
Conference, 
ICML  
  Read More

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

Automatic Differentiation Revisited

 04-12-2025 04-12-2025 blog 15 minutes read (About 2198 words)
Jacobian Matrix, De Novo Chain Rule Expression, Jacobian-Vector Product, Vector-Jacobian Product

 
Deep Learning, 
Mathematics, 
Linear Algebra, 
Calculus, 
Automatic Differentiation  
  Read More

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More
Previous
Next
  • 1
  • 2
  • …
  • 7
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1245

Categories

8

Tags

773

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog550
  • essay316
  • life280
  • miscellaneous2
  • photography49
  • project20
  • reading8

follow.it

Recents

12-11-2025

汽车反光镜的设置

essay

12-10-2025

Install NVIDIA RTX 5080

blog

12-06-2025

NVIDIA Tensor Core TN Layout MMA Instruction

blog

12-06-2025

网红的自杀式挑战

essay

12-06-2025

2025 Midway Shelter Winter Run 10K 竞赛

life

Archives

  • December 20256
  • November 202525
  • October 202524
  • September 202515
  • August 202527
  • See All >>

Tags

Outdoors285
Hiking217
California216
CPP117
Mathematics102
Deep Learning84
CUDA67
Photography62
Running58
Wildlife41
Racing37
Bird36
Software Engineering36
Machine Learning34
Python34
Movie33
Statistics32
Park31
Linux30
China29
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×