Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • CPP

PyTorch Export

 03-31-2026 04-01-2026 blog 6 minutes read (About 857 words)
Exporting Graph-Representable PyTorch Models for Inference

 
CPP, 
Inference, 
PyTorch  
  Read More

C++ Latch and Barrier

 02-06-2026 02-06-2026 blog 8 minutes read (About 1154 words)
Scheduling and Synchronizing Threads Using std::latch and std::barrier

 
CPP, 
Multithreading, 
Parallel Programming  
  Read More

NVIDIA NVML GPU Statistics

 12-25-2025 12-25-2025 blog 15 minutes read (About 2214 words)
Mimicking nvidia-smi dmon Using NVIDIA NVML

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
NVML  
  Read More

Radix Sort

 12-18-2025 12-18-2025 blog 19 minutes read (About 2808 words)
A Non-Comparative Sorting Algorithm

 
CPP, 
Python, 
Algorithm  
  Read More

NVIDIA Tensor Core TN Layout MMA Instruction

 12-06-2025 12-06-2025 blog 16 minutes read (About 2389 words)
GEMM Layout, History, Performance, and Implementation

 
CPP, 
CUDA, 
NVIDIA, 
CUTLASS, 
CuTe, 
MMA, 
GEMM, 
Tensor Core  
  Read More

Benchmarking NVIDIA Tensor Core MMA Instruction Peak Performances

 11-26-2025 11-26-2025 blog 11 minutes read (About 1646 words)
Reproducing NVIDIA Advertised GPU AI Peak Performances Using CUTLASS and CuTe

 
CPP, 
CUDA, 
NVIDIA, 
CUTLASS, 
CuTe, 
MMA, 
GEMM, 
Tensor Core  
  Read More

Core Dump and GDB

 11-15-2025 11-15-2025 blog 7 minutes read (About 1029 words)
Analyzing Core Dump Files Using GDB

 
CPP, 
GDB, 
Core Dump  
  Read More

AddressSanitizer

 09-27-2025 09-27-2025 blog 21 minutes read (About 3161 words)
Compile-Time Instrumentation for Detecting Memory Errors

 
CPP, 
CMake, 
GCC, 
Memory Error  
  Read More

Illegal Memory Access and Segmentation Fault

 08-27-2025 08-27-2025 blog 9 minutes read (About 1381 words)
Memory Access Boundary Checking

 
CPP, 
Operating System, 
Memory Management, 
Memory Safety  
  Read More

Floating Point Constant Values In C++, CUDA, and Python

 08-22-2025 08-22-2025 blog 6 minutes read (About 889 words)
Essential Constants for Numerical Algorithms and Scientific Computations

 
CPP, 
Python, 
CUDA  
  Read More

Load CUDA Kernel at Runtime Using CUDA Driver APIs

 06-30-2025 06-30-2025 blog an hour read (About 11131 words)
Dynamically Loading CUDA Kernels

 
CPP, 
CUDA  
  Read More

TensorRT Static Plugin VS Dynamic Plugin

 06-05-2025 06-05-2025 blog 13 minutes read (About 1941 words)
Managing The Lifetime and Registration of TensorRT Plugins

 
CPP, 
Deep Learning, 
TensorRT  
  Read More
Previous
Next
  • 1
  • 2
  • …
  • 11
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1348

Categories

8

Tags

810

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog571
  • essay343
  • life312
  • miscellaneous2
  • photography71
  • project20
  • reading8

follow.it

Recents

05-04-2026

《寻秦记》电影版

essay

05-03-2026

ICML 2026 Area Chair Experience

blog

05-02-2026

2026 Foster City 5K Fun Run 竞赛

life

04-30-2026

2026 年 3 月和 4 月该入手的模型手办

essay

04-29-2026

Docker Container GUI Display Using Wayland

blog

Archives

  • May 20263
  • April 202618
  • March 202618
  • February 202617
  • January 202616
  • See All >>

Tags

Outdoors317
California248
Hiking239
CPP121
Mathematics102
Deep Learning87
Photography85
CUDA74
Running71
Wildlife62
Bird56
Racing47
Movie38
Python36
Software Engineering36
Machine Learning35
China32
Linux32
NVIDIA32
Statistics32
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×