Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • GPU

Perfetto GPU Flow Artifacts

 02-20-2026 02-20-2026 blog 6 minutes read (About 952 words)
Understanding and Resolving Flow Artifacts in Perfetto GPU Profiling Traces

 
GPU, 
Perfetto  
  Read More

CUDA Shared Memory Bank Conflict-Free Vectorized Access

 02-13-2026 02-13-2026 blog 14 minutes read (About 2060 words)
Instruction-Level Phase Based Bank Conflict-Free Execution

 
CUDA, 
NVIDIA, 
Parallel Computing, 
GPU  
  Read More

CUDA Rendezvous Stream

 01-26-2026 01-26-2026 blog 11 minutes read (About 1690 words)
Simplifying Synchronization Complexities Using CUDA Rendezvous Streams

 
CUDA, 
NVIDIA, 
Parallel Computing, 
GPU  
  Read More

NVIDIA NVML GPU Statistics

 12-25-2025 12-25-2025 blog 15 minutes read (About 2214 words)
Mimicking nvidia-smi dmon Using NVIDIA NVML

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
NVML  
  Read More

Install NVIDIA RTX 5080

 12-10-2025 12-10-2025 blog 5 minutes read (About 703 words)
Installing NVIDIA RTX 5080 on an Old Desktop

 
NVIDIA, 
Ubuntu, 
GPU  
  Read More

Setting Up Environment Variables In SSH Sessions Over TCP On Runpod

 10-10-2025 10-10-2025 blog 12 minutes read (About 1785 words)
Fixing a Environment Variables Issue for Runpod

 
CUDA, 
NVIDIA, 
Docker, 
GPU, 
Cloud Computing, 
Runpod, 
IDE, 
SSH  
  Read More

Setting Up Remote Development Using Custom Template On Runpod

 10-08-2025 10-13-2025 blog 12 minutes read (About 1814 words)
Custom Remote Development Using GPUs on Runpod

 
CUDA, 
NVIDIA, 
Docker, 
GPU, 
Cloud Computing, 
Runpod, 
IDE, 
SSH  
  Read More

CUDA Local Memory

 03-19-2025 03-19-2025 blog 12 minutes read (About 1835 words)
Is Local Array Placed In Registers or In Local Memory?

 
CUDA, 
GPU  
  Read More

CUDA Performance Hot VS Cold Measurement

 03-12-2025 03-12-2025 blog 8 minutes read (About 1200 words)
Flushing GPU L2 Cache

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
Nsight Compute  
  Read More

NVIDIA GPU Compute Capability

 01-02-2025 01-22-2026 blog 15 minutes read (About 2202 words)
A Table of NVIDIA GPUs and Their Compute Capabilities

 
CUDA, 
NVIDIA, 
GPU  
  Read More

SMPlayer GPU Acceleration

 12-06-2024 12-07-2024 blog 2 minutes read (About 328 words)
Playing Videos with GPU Acceleration in SMPlayer

 
CUDA, 
Linux, 
GPU, 
SMPlayer  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More
Previous
Next
  • 1
  • 2
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1308

Categories

8

Tags

794

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog561
  • essay331
  • life300
  • miscellaneous2
  • photography65
  • project20
  • reading8

follow.it

Recents

02-28-2026

Fix MacBook Pro Space Key Stuck Problem

blog

02-28-2026

2026 年 1 月和 2 月该入手的模型手办

essay

02-28-2026

2026 Brazen Victory 10K 竞赛

life

02-23-2026

儿时的玩伴李峰

essay

02-21-2026

Marsh Creek Regional Trail 徒步

life

Archives

  • February 202617
  • January 202616
  • December 202535
  • November 202525
  • October 202524
  • See All >>

Tags

Outdoors305
California236
Hiking233
CPP120
Mathematics102
Deep Learning84
Photography79
CUDA71
Running63
Wildlife56
Bird50
Racing41
Python36
Software Engineering36
Machine Learning34
Movie33
Statistics32
China31
NVIDIA31
Park31
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×