Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs

NVIDIA Tensor Core TN Layout MMA Instruction

 12-06-2025 12-06-2025 blog 16 minutes read (About 2389 words)
GEMM Layout, History, Performance, and Implementation

 
CPP, 
CUDA, 
NVIDIA, 
CUTLASS, 
CuTe, 
MMA, 
GEMM, 
Tensor Core  
  Read More

Replacing Thinkpad X1 Yoga CMOS Battery

 12-02-2025 12-02-2025 blog 3 minutes read (About 377 words)
The First Time I Replaced a CMOS Battery on a Computer

 
Thinkpad, 
CMOS, 
DIY  
  Read More

Benchmarking NVIDIA Tensor Core MMA Instruction Peak Performances

 11-26-2025 11-26-2025 blog 11 minutes read (About 1646 words)
Reproducing NVIDIA Advertised GPU AI Peak Performances Using CUTLASS and CuTe

 
CPP, 
CUDA, 
NVIDIA, 
CUTLASS, 
CuTe, 
MMA, 
GEMM, 
Tensor Core  
  Read More

Fix Bluetooth Not Found on Ubuntu Desktop

 11-23-2025 11-23-2025 blog a minute read (About 202 words)
Troubleshooting Bluetooth Not Found Issues

 
Ubuntu, 
Bluetooth, 
Personal Computer  
  Read More

Focus Breathing and Compensation

 11-21-2025 11-21-2025 blog 5 minutes read (About 776 words)
Physics and Mathematics Behind Focus Breathing and Compensation

 
Physics, 
Camera, 
Photography, 
Videography  
  Read More

Core Dump and GDB

 11-15-2025 11-15-2025 blog 7 minutes read (About 1029 words)
Analyzing Core Dump Files Using GDB

 
CPP, 
GDB, 
Core Dump  
  Read More

Image Processing Priorities: Resolution VS Quality

 11-09-2025 11-09-2025 blog 13 minutes read (About 2013 words)
Finding The Sweet Spots Between Image Resolution, JPEG Quality, and File Size

 
Photography, 
JPEG, 
Image Processing  
  Read More

Nsight Streamer

 11-04-2025 11-04-2025 blog 3 minutes read (About 515 words)
Nsight Systems and Nsight Compute GUIs In a Web Browser

 
CUDA, 
NVIDIA, 
Nsight Compute, 
Nsight Systems, 
Nsight Streamer  
  Read More

One-Pass Naive Algorithm for Computing Variance

 10-29-2025 10-29-2025 blog 6 minutes read (About 899 words)
Caveats and Tricks

 
Statistics, 
Numerical Stability, 
Algorithm  
  Read More

CuTe Arithmetic Tuple Tensor

 10-20-2025 10-20-2025 blog 16 minutes read (About 2388 words)
The Tensor Coordinate Generator In CuTe

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe  
  Read More

CuTe Tiled Copy

 10-16-2025 10-16-2025 blog 28 minutes read (About 4216 words)
Understanding CuTe Tiled Copy

 
Mathematics, 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe  
  Read More

CuTe Thread-Value Layout

 10-13-2025 10-13-2025 blog 6 minutes read (About 955 words)
CuTe TV Layout, Inverse TV Layout, and TV Partition

 
Accelerated Computing, 
CUDA, 
CUTLASS, 
CuTe  
  Read More
Previous
Next
  • 1
  • 2
  • 3
  • …
  • 47
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1308

Categories

8

Tags

794

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog561
  • essay331
  • life300
  • miscellaneous2
  • photography65
  • project20
  • reading8

follow.it

Recents

02-28-2026

Fix MacBook Pro Space Key Stuck Problem

blog

02-28-2026

2026 年 1 月和 2 月该入手的模型手办

essay

02-28-2026

2026 Brazen Victory 10K 竞赛

life

02-23-2026

儿时的玩伴李峰

essay

02-21-2026

Marsh Creek Regional Trail 徒步

life

Archives

  • February 202617
  • January 202616
  • December 202535
  • November 202525
  • October 202524
  • See All >>

Tags

Outdoors305
California236
Hiking233
CPP120
Mathematics102
Deep Learning84
Photography79
CUDA71
Running63
Wildlife56
Bird50
Racing41
Python36
Software Engineering36
Machine Learning34
Movie33
Statistics32
China31
NVIDIA31
Park31
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×