Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Large Language Model

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1348

Categories

8

Tags

810

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog571
  • essay343
  • life312
  • miscellaneous2
  • photography71
  • project20
  • reading8

follow.it

Recents

05-04-2026

《寻秦记》电影版

essay

05-03-2026

ICML 2026 Area Chair Experience

blog

05-02-2026

2026 Foster City 5K Fun Run 竞赛

life

04-30-2026

2026 年 3 月和 4 月该入手的模型手办

essay

04-29-2026

Docker Container GUI Display Using Wayland

blog

Archives

  • May 20263
  • April 202618
  • March 202618
  • February 202617
  • January 202616
  • See All >>

Tags

Outdoors317
California248
Hiking239
CPP121
Mathematics102
Deep Learning87
Photography85
CUDA74
Running71
Wildlife62
Bird56
Racing47
Movie38
Python36
Software Engineering36
Machine Learning35
China32
Linux32
NVIDIA32
Statistics32
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×