Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Performance Optimization

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Computer Architecture, 
Performance Optimization, 
Large Language Model  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1240

Categories

8

Tags

771

  Follow   Sponsor

Advertisement


Categories

  • article20
  • blog548
  • essay314
  • life279
  • miscellaneous2
  • photography49
  • project20
  • reading8

follow.it

Recents

12-02-2025

Replacing Thinkpad X1 Yoga CMOS Battery

blog

11-30-2025

避免使用劣质湿巾

essay

11-28-2025

Ed R. Levin County Park

photography

11-28-2025

Ed R. Levin County Park 徒步

life

11-27-2025

血谜拼图

essay

Archives

  • December 20251
  • November 202525
  • October 202524
  • September 202515
  • August 202527
  • See All >>

Tags

Outdoors284
Hiking217
California215
CPP116
Mathematics102
Deep Learning84
CUDA66
Photography62
Running57
Wildlife41
Bird36
Racing36
Software Engineering36
Machine Learning34
Python34
Movie32
Statistics32
Park31
Linux30
China29
See All >>
Lei Mao's Log Book

© 2017-2025 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×