Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs

CuTe Matrix Transpose

 11-20-2024 12-26-2024 article an hour read (About 10825 words)
Matrix Transpose CUDA Kernel Implementation Using CuTe

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe  
  Read More

CuTe Layout Algebra

 10-20-2024 06-05-2025 article 2 hours read (About 17835 words)
Mathematical Fundamentals to CUTLASS Computing

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe, 
Category Theory  
  Read More

CUDA Matrix Multiplication Optimization

 01-20-2024 01-20-2024 article 2 hours read (About 19282 words)
General Matrix Multiplication CUDA Performance Optimization

 
CPP, 
Accelerated Computing, 
CUDA, 
NVIDIA  
  Read More

How To Debug Deep Learning Inference Applications

 01-01-2024 01-01-2024 article 23 minutes read (About 3511 words)
First Principles of Evaluating Deep Learning Inference

 
Deep Learning, 
Software Engineering, 
Deep Learning Inference, 
Numerical Errors  
  Read More

Interpolation

 06-07-2023 06-07-2023 article 27 minutes read (About 4123 words)
One of the Most Widely Used Estimation Methods

 
Computer Vision, 
Mathematics, 
Signal Processing  
  Read More

OpenAI GPT Models

 04-15-2023 04-15-2023 article 28 minutes read (About 4168 words)
Generative Pre-Trained Transformer Models From OpenAI

 
Deep Learning, 
Natural Language Processing, 
Transformer, 
Reinforcement Learning, 
OpenAI, 
GPT, 
ChatGPT, 
InstructGPT  
  Read More

Transformer Autoregressive Inference Optimization

 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)
Principles for Faster Transformer Inference

 
Deep Learning, 
Inference, 
Natural Language Processing, 
Optimization, 
Transformer, 
Accelerated Computing  
  Read More

Hamming Code

 06-01-2022 06-01-2022 article 29 minutes read (About 4424 words)
Create Perfect Error-Correction Hamming Code From Scratch

 
Telecommunication, 
Computer Science  
  Read More

Automatic Differentiation

 02-21-2022 06-28-2023 article 34 minutes read (About 5117 words)
Mathematical Foundations to Neural Network Optimization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Mathematical Optimization  
  Read More

Pruning for Neural Networks

 03-01-2021 03-01-2021 article 14 minutes read (About 2172 words)
Mathematical Foundations to Neural Network Pruning

 
Machine Learning, 
Deep Learning, 
Mathematics  
  Read More

Principal Component Analysis

 05-17-2020 05-17-2020 article 25 minutes read (About 3737 words)
Fundamentals to Principal Component Analysis

 
Mathematics, 
Principal Component Analysis  
  Read More

Quantization for Neural Networks

 05-17-2020 02-09-2023 article an hour read (About 6957 words)
Mathematical Foundations to Neural Network Quantization

 
Machine Learning, 
Deep Learning, 
Mathematics, 
Neural Network, 
Quantization, 
Matrix Multiplication  
  Read More
Previous
Next
  • 1
  • 2
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Santa Clara, California

Posts

1128

Categories

8

Tags

712

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog520
  • essay283
  • life251
  • miscellaneous2
  • photography24
  • project20
  • reading8

follow.it

Recents

07-05-2025

鱿鱼游戏

essay

07-04-2025

2025 Alameda July 4th Parade Race 5K 竞赛

life

07-04-2025

2025 Alameda July 4th Parade 观摩

life

06-30-2025

June 2025 Random Photos

photography

06-30-2025

Load CUDA Kernel at Runtime Using CUDA Driver APIs

blog

Archives

  • July 20253
  • June 202547
  • May 202527
  • April 202521
  • March 202525
  • See All >>

Tags

Outdoors255
Hiking191
California186
CPP111
Mathematics93
Deep Learning82
CUDA51
Running49
Software Engineering35
Machine Learning34
Python33
Racing32
Statistics31
Linux30
Movie30
Park30
Photography30
Docker26
China25
Museum25
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×