Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Large Language Model

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1386

Categories

8

Tags

821

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog578
  • essay352
  • life324
  • miscellaneous2
  • photography81
  • project20
  • reading8

follow.it

Recents

06-20-2026

The City Eats & Stack Run Club - Hayward 5K 竞赛

life

06-19-2026

Las Trampas Wilderness Regional Preserve 徒步

life

06-19-2026

Las Trampas Wilderness Regional Preserve

photography

06-16-2026

2026 FIFA World Cup 备受嘲讽的会徽

essay

06-14-2026

Python Debugging Via VS Code In Docker Container

blog

Archives

  • June 202617
  • May 202624
  • April 202618
  • March 202618
  • February 202617
  • See All >>

Tags

Outdoors329
California260
Hiking247
CPP122
Mathematics102
Photography96
Deep Learning87
CUDA75
Running75
Wildlife72
Bird66
Racing51
Movie41
Python38
Software Engineering36
Machine Learning35
China33
Linux32
NVIDIA32
Statistics32
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×