Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • GPU

Perfetto GPU Flow Artifacts

 02-20-2026 02-20-2026 blog 6 minutes read (About 952 words)
Understanding and Resolving Flow Artifacts in Perfetto GPU Profiling Traces

 
GPU, 
Perfetto  
  Read More

CUDA Shared Memory Bank Conflict-Free Vectorized Access

 02-13-2026 02-13-2026 blog 14 minutes read (About 2060 words)
Instruction-Level Phase Based Bank Conflict-Free Execution

 
CUDA, 
NVIDIA, 
Parallel Computing, 
GPU  
  Read More

CUDA Rendezvous Stream

 01-26-2026 01-26-2026 blog 11 minutes read (About 1690 words)
Simplifying Synchronization Complexities Using CUDA Rendezvous Streams

 
CUDA, 
NVIDIA, 
Parallel Computing, 
GPU  
  Read More

NVIDIA NVML GPU Statistics

 12-25-2025 12-25-2025 blog 15 minutes read (About 2214 words)
Mimicking nvidia-smi dmon Using NVIDIA NVML

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
NVML  
  Read More

Install NVIDIA RTX 5080

 12-10-2025 12-10-2025 blog 5 minutes read (About 703 words)
Installing NVIDIA RTX 5080 on an Old Desktop

 
NVIDIA, 
Ubuntu, 
GPU  
  Read More

Setting Up Environment Variables In SSH Sessions Over TCP On Runpod

 10-10-2025 10-10-2025 blog 12 minutes read (About 1785 words)
Fixing a Environment Variables Issue for Runpod

 
CUDA, 
NVIDIA, 
Docker, 
GPU, 
Cloud Computing, 
Runpod, 
IDE, 
SSH  
  Read More

Setting Up Remote Development Using Custom Template On Runpod

 10-08-2025 10-13-2025 blog 12 minutes read (About 1814 words)
Custom Remote Development Using GPUs on Runpod

 
CUDA, 
NVIDIA, 
Docker, 
GPU, 
Cloud Computing, 
Runpod, 
IDE, 
SSH  
  Read More

CUDA Local Memory

 03-19-2025 03-19-2025 blog 12 minutes read (About 1835 words)
Is Local Array Placed In Registers or In Local Memory?

 
CUDA, 
GPU  
  Read More

CUDA Performance Hot VS Cold Measurement

 03-12-2025 03-12-2025 blog 8 minutes read (About 1200 words)
Flushing GPU L2 Cache

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
Nsight Compute  
  Read More

NVIDIA GPU Compute Capability

 01-02-2025 01-22-2026 blog 15 minutes read (About 2202 words)
A Table of NVIDIA GPUs and Their Compute Capabilities

 
CUDA, 
NVIDIA, 
GPU  
  Read More

SMPlayer GPU Acceleration

 12-06-2024 12-07-2024 blog 2 minutes read (About 328 words)
Playing Videos with GPU Acceleration in SMPlayer

 
CUDA, 
Linux, 
GPU, 
SMPlayer  
  Read More

PyTorch Eager Mode Quantization TensorRT Acceleration

 05-24-2024 05-24-2024 blog 7 minutes read (About 1051 words)
TensorRT Acceleration for PyTorch Native Eager Mode Quantization Models

 
Deep Learning, 
Python, 
Inference, 
Quantization, 
Accelerated Computing, 
NVIDIA, 
TensorRT, 
PyTorch, 
GPU  
  Read More
Previous
Next
  • 1
  • 2
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1322

Categories

8

Tags

800

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog564
  • essay334
  • life305
  • miscellaneous2
  • photography68
  • project20
  • reading8

follow.it

Recents

03-22-2026

2026 Oakland Half Marathon 竞赛

life

03-21-2026

Del Valle Regional Park 徒步

life

03-21-2026

Del Valle Regional Park

photography

03-20-2026

CUDA_LAUNCH_BLOCKING=1

blog

03-20-2026

踏切时间

essay

Archives

  • March 202614
  • February 202617
  • January 202616
  • December 202535
  • November 202525
  • See All >>

Tags

Outdoors310
California241
Hiking236
CPP120
Mathematics102
Deep Learning84
Photography82
CUDA72
Running65
Wildlife59
Bird53
Racing43
Python36
Software Engineering36
Machine Learning34
Movie34
Statistics32
China31
NVIDIA31
Park31
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×