Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs

CUDA Local Memory

 03-19-2025 03-19-2025 blog 12 minutes read (About 1835 words)
Is Local Array Placed In Registers or In Local Memory?

 
CUDA, 
GPU  
  Read More

CUDA Performance Hot VS Cold Measurement

 03-12-2025 03-12-2025 blog 8 minutes read (About 1200 words)
Flushing GPU L2 Cache

 
CPP, 
CUDA, 
NVIDIA, 
GPU, 
Nsight Compute  
  Read More

Filing Claims for USPS Missing Parcels

 03-07-2025 03-07-2025 blog 5 minutes read (About 792 words)
Disappointing Experience with USPS

 
USPS, 
Stamp, 
Insurance  
  Read More

Field of View

 03-03-2025 03-03-2025 blog 4 minutes read (About 627 words)
Mathematical Derivation of Field of View Equation

 
Mathematics, 
Physics, 
Camera, 
Photography  
  Read More

Firefox Installation In Ubuntu 24.04 Docker Images

 02-28-2025 02-28-2025 blog 3 minutes read (About 473 words)
Install Firefox In New Ubuntu Docker Images

 
Docker, 
Ubuntu, 
Firefox  
  Read More

Depth of Field

 02-23-2025 02-23-2025 blog 13 minutes read (About 2016 words)
Mathematical Derivation of Depth of Field Equation

 
Mathematics, 
Physics, 
Camera, 
Photography  
  Read More

Dell Reboot Performs Shutdown Issue

 02-20-2025 02-20-2025 blog 2 minutes read (About 339 words)
Dell BIOS Caused Reboot Anomaly

 
Dell, 
BIOS  
  Read More

C++ Load and Save Npy File Load Using xtensor

 02-16-2025 02-16-2025 blog 5 minutes read (About 708 words)
Quick and Convenient Library for Numpy File Load and Save in C++

 
CPP, 
Python, 
Software Engineering, 
Numpy, 
xtensor  
  Read More

Get Available Versions for Python Pip Package

 02-10-2025 02-10-2025 blog 3 minutes read (About 387 words)
The Python Officially Supported Approaches

 
Python, 
Pip  
  Read More

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

Fix NVIDIA Driver After Ubuntu Unattended Upgrade

 01-30-2025 01-30-2025 blog 2 minutes read (About 303 words)
A Quick and Safe Log for Fixing NVIDIA Driver

 
NVIDIA, 
Ubuntu, 
Driver  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More
Previous
Next
  • 1
  • …
  • 5
  • 6
  • 7
  • …
  • 47
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1274

Categories

8

Tags

784

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog556
  • essay322
  • life289
  • miscellaneous2
  • photography57
  • project20
  • reading8

follow.it

Recents

01-19-2026

Randomized SVD

blog

01-17-2026

Don Castro Regional Recreation Area 徒步

life

01-17-2026

Don Castro Regional Recreation Area

photography

01-16-2026

拖车公司的大汉们

essay

01-12-2026

PyTorch CUDA Graph Capture

blog

Archives

  • January 202610
  • December 202525
  • November 202525
  • October 202524
  • September 202515
  • See All >>

Tags

Outdoors294
Hiking226
California225
CPP119
Mathematics102
Deep Learning84
Photography71
CUDA69
Running59
Wildlife48
Bird43
Racing37
Python36
Software Engineering36
Machine Learning34
Movie33
Statistics32
Park31
Linux30
China29
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×