Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Deep Learning

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

AWQ: Activation-Aware Weight Quantization

 01-01-2025 01-01-2025 blog 18 minutes read (About 2738 words)
Same Performance as Group-Wise Weight-Only Quantization But with Better Accuracy

 
Deep Learning, 
Mathematics, 
Quantization, 
Accelerated Computing, 
CUDA  
  Read More

NeurIPS 2024 Area Chair Experience

 12-26-2024 12-26-2024 blog 9 minutes read (About 1389 words)
First Time Serving as NeurIPS Area Chair

 
Deep Learning, 
Conference, 
NeurIPS  
  Read More

Neural Radiance Fields

 07-24-2024 07-24-2024 blog 6 minutes read (About 826 words)
Scene Representation and Differentiable Rendering with Neural Radiance Fields

 
Deep Learning, 
Computer Vision, 
Neural Network, 
NeRF, 
Neural Radiance Fields  
  Read More

LoRA and LoRAPrune

 07-11-2024 07-11-2024 blog 11 minutes read (About 1664 words)
Fine-Tuning and Pruning of Large Language Models Using Low-Rank Adaptation

 
Deep Learning, 
Neural Network, 
Neural Network Pruning, 
LoRA, 
LoRAPrune  
  Read More

Parameter Importance Approximation Via Taylor Expansion In Neural Network Pruning

 06-20-2024 07-07-2024 blog 12 minutes read (About 1778 words)
Scoring Neural Network Parameter Importance Faster

 
Deep Learning, 
Mathematics, 
Taylor Expansion, 
Neural Network Pruning  
  Read More

PyTorch Variational Autoencoder

 06-14-2024 06-14-2024 blog 19 minutes read (About 2921 words)
Implementing Variational Autoencoder in PyTorch

 
Deep Learning, 
Statistics, 
Probability, 
Mathematics, 
Neural Network, 
Bayesian Inference, 
PyTorch, 
Variational Autoencoder, 
VAE  
  Read More

PyTorch Automatic Mixed Precision Training

 06-08-2024 06-08-2024 blog 3 minutes read (About 492 words)
Accelerating PyTorch Deep Learning Training with Automatic Mixed Precision

 
Deep Learning, 
Neural Network, 
PyTorch, 
Mixed Precision Training, 
Automatic Mixed Precision Training  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More

TensorRT Python Inference

 05-18-2024 05-18-2024 blog 12 minutes read (About 1843 words)
TensorRT Python Inference Example

 
Deep Learning, 
Python, 
Inference, 
NVIDIA, 
TensorRT, 
GPU  
  Read More

Reparameterization Trick

 05-08-2024 05-08-2024 blog 12 minutes read (About 1749 words)
How Distribution of a Random Variable Changes when the Variable is Transformed in a Deterministic Way

 
Deep Learning, 
Statistics, 
Probability, 
Mathematics, 
Jacobian, 
Variational Autoencoder  
  Read More
Previous
Next
  • 1
  • 2
  • 3
  • …
  • 8
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1372

Categories

8

Tags

819

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog576
  • essay348
  • life320
  • miscellaneous2
  • photography77
  • project20
  • reading8

follow.it

Recents

06-05-2026

Synchronizations With TorchRec KeyedJaggedTensor

blog

06-01-2026

Pacific Commons Linear Park 徒步

life

06-01-2026

Pacific Commons Linear Park

photography

05-31-2026

2026 San Jose Half Marathon 竞赛

life

05-30-2026

目标

essay

Archives

  • June 20263
  • May 202624
  • April 202618
  • March 202618
  • February 202617
  • See All >>

Tags

Outdoors325
California256
Hiking244
CPP122
Mathematics102
Photography91
Deep Learning87
CUDA75
Running74
Wildlife68
Bird62
Racing50
Movie40
Python37
Software Engineering36
Machine Learning35
China33
Linux32
NVIDIA32
Statistics32
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×