Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Deep Learning

TensorRT Plugin Version and Namespace

 09-08-2025 09-08-2025 blog 8 minutes read (About 1152 words)
Handling TensorRT Plugin Conflicts Using Version and Namespace

 
Deep Learning, 
Software Engineering, 
NVIDIA, 
TensorRT  
  Read More

Online Safe Softmax

 06-23-2025 06-23-2025 blog 5 minutes read (About 741 words)
Safe and Efficient Online Softmax Calculation

 
Deep Learning, 
Mathematics, 
Accelerated Computing  
  Read More

TensorRT Static Plugin VS Dynamic Plugin

 06-05-2025 06-05-2025 blog 13 minutes read (About 1941 words)
Managing The Lifetime and Registration of TensorRT Plugins

 
CPP, 
Deep Learning, 
TensorRT  
  Read More

Computing Hessian Matrix Via Automatic Differentiation

 05-22-2025 05-22-2025 blog 18 minutes read (About 2664 words)
Computing Higher-Order Derivatives Using Automatic Differentiation

 
Deep Learning, 
Mathematics, 
Neural Networks, 
Calculus, 
Automatic Differentiation  
  Read More

Optimal Brain Surgeon

 05-14-2025 05-14-2025 blog 40 minutes read (About 6047 words)
Derivation and Extension of The Classical Optimal Brain Surgeon Algorithm

 
Deep Learning, 
Mathematics, 
Neural Networks, 
Calculus, 
Neural Network Pruning  
  Read More

ICML 2025 Area Chair Experience

 05-03-2025 05-03-2025 blog 8 minutes read (About 1181 words)
First Time Serving as ICML Area Chair

 
Machine Learning, 
Deep Learning, 
Conference, 
ICML  
  Read More

TensorRT Implicit Weight Quantization

 04-29-2025 04-29-2025 blog 8 minutes read (About 1265 words)
TensorRT Implicit Weight Quantization Caveats and Tricks

 
Deep Learning, 
Mathematics, 
Quantization, 
TensorRT  
  Read More

Automatic Differentiation Revisited

 04-12-2025 04-12-2025 blog 15 minutes read (About 2198 words)
Jacobian Matrix, De Novo Chain Rule Expression, Jacobian-Vector Product, Vector-Jacobian Product

 
Deep Learning, 
Mathematics, 
Linear Algebra, 
Calculus, 
Automatic Differentiation  
  Read More

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

NeurIPS 2024 Area Chair Experience

 12-26-2024 12-26-2024 blog 9 minutes read (About 1389 words)
First Time Serving as NeurIPS Area Chair

 
Deep Learning, 
NeurIPS, 
Conference  
  Read More
Previous
Next
  • 1
  • 2
  • …
  • 7
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Santa Clara, California

Posts

1179

Categories

8

Tags

740

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog532
  • essay298
  • life263
  • miscellaneous2
  • photography36
  • project20
  • reading8

follow.it

Recents

09-15-2025

CuTe Tilers

blog

09-13-2025

Henry W. Coe State Park China Hole Trail Loop

photography

09-13-2025

Henry W. Coe State Park China Hole Trail Loop 徒步

life

09-09-2025

2025 年再次感染 COVID-19

essay

09-08-2025

TensorRT Plugin Version and Namespace

blog

Archives

  • September 20255
  • August 202526
  • July 202523
  • June 202547
  • May 202527
  • See All >>

Tags

Outdoors267
Hiking202
California198
CPP113
Mathematics99
Deep Learning83
CUDA58
Running51
Photography43
Software Engineering36
Machine Learning34
Python34
Racing33
Movie32
Statistics31
Linux30
Park30
China29
Bird27
Wildlife27
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×