Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs

System Performance Optimizations

 02-16-2026 02-22-2026 article 16 minutes read (About 2338 words)
Principles and Techniques of System Performance Optimizations at Different Levels

 
Performance Optimization, 
High Performance Computing, 
Systems Engineering  
  Read More

CuTe Matrix Transpose

 11-20-2024 09-30-2025 article an hour read (About 10892 words)
Matrix Transpose CUDA Kernel Implementation Using CuTe

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe  
  Read More

CuTe Layout Algebra

 10-20-2024 07-14-2025 article 2 hours read (About 19874 words)
Mathematical Fundamentals to CUTLASS Computing

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe, 
Category Theory  
  Read More

CUDA Matrix Multiplication Optimization

 01-20-2024 01-20-2024 article 2 hours read (About 19282 words)
General Matrix Multiplication CUDA Performance Optimization

 
CPP, 
Accelerated Computing, 
CUDA, 
NVIDIA  
  Read More

How To Debug Deep Learning Inference Applications

 01-01-2024 01-01-2024 article 23 minutes read (About 3511 words)
First Principles of Evaluating Deep Learning Inference

 
Deep Learning, 
Software Engineering, 
Deep Learning Inference, 
Numerical Errors  
  Read More

Interpolation

 06-07-2023 06-07-2023 article 27 minutes read (About 4123 words)
One of the Most Widely Used Estimation Methods

 
Computer Vision, 
Mathematics, 
Signal Processing  
  Read More

OpenAI GPT Models

 04-15-2023 04-15-2023 article 28 minutes read (About 4168 words)
Generative Pre-Trained Transformer Models From OpenAI

 
Deep Learning, 
Natural Language Processing, 
Transformer, 
Reinforcement Learning, 
OpenAI, 
GPT, 
ChatGPT, 
InstructGPT  
  Read More

Transformer Autoregressive Inference Optimization

 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)
Principles for Faster Transformer Inference

 
Deep Learning, 
Inference, 
Natural Language Processing, 
Optimization, 
Transformer, 
Accelerated Computing  
  Read More

Hamming Code

 06-01-2022 06-01-2022 article 29 minutes read (About 4424 words)
Create Perfect Error-Correction Hamming Code From Scratch

 
Telecommunication, 
Computer Science  
  Read More

Automatic Differentiation

 02-21-2022 06-28-2023 article 34 minutes read (About 5117 words)
Mathematical Foundations to Neural Network Optimization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Mathematical Optimization  
  Read More

Pruning for Neural Networks

 03-01-2021 03-01-2021 article 14 minutes read (About 2172 words)
Mathematical Foundations to Neural Network Pruning

 
Machine Learning, 
Deep Learning, 
Mathematics  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Previous
Next
  • 1
  • 2
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1348

Categories

8

Tags

810

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog571
  • essay343
  • life312
  • miscellaneous2
  • photography71
  • project20
  • reading8

follow.it

Recents

05-04-2026

《寻秦记》电影版

essay

05-03-2026

ICML 2026 Area Chair Experience

blog

05-02-2026

2026 Foster City 5K Fun Run 竞赛

life

04-30-2026

2026 年 3 月和 4 月该入手的模型手办

essay

04-29-2026

Docker Container GUI Display Using Wayland

blog

Archives

  • May 20263
  • April 202618
  • March 202618
  • February 202617
  • January 202616
  • See All >>

Tags

Outdoors317
California248
Hiking239
CPP121
Mathematics102
Deep Learning87
Photography85
CUDA74
Running71
Wildlife62
Bird56
Racing47
Movie38
Python36
Software Engineering36
Machine Learning35
China32
Linux32
NVIDIA32
Statistics32
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×