Lei Mao's Log Book
Lei Mao's Log BookCurriculumBlogArticlesProjectsPublicationsReadingsLifeEssayPhotographyArchivesCategoriesTagsFAQs
  • Tags
  • Transformer

Grouped Query Attention Performance Theoretical Analysis

 02-03-2025 03-02-2025 blog 11 minutes read (About 1612 words)
Sharing Key and Value Tensors for a Group of Query Tensors to Mitigate Transformer Attention Layer Performance Bottleneck

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

Transformer Vanilla Attention Performance Theoretical Analysis

 01-27-2025 03-02-2025 blog 9 minutes read (About 1275 words)
Performance Bottleneck for Serving Transformer Models

 
Deep Learning, 
Neural Network, 
Transformer, 
Performance Optimization, 
Computer Architecture, 
Large Language Model  
  Read More

Deformable Attention

 12-16-2023 12-16-2023 blog 34 minutes read (About 5063 words)
Attention with Learned Spatial Feature Sampling

 
Deep Learning, 
Computer Vision, 
Transformer, 
Attention  
  Read More

OpenAI GPT Models

 04-15-2023 04-15-2023 article 28 minutes read (About 4168 words)
Generative Pre-Trained Transformer Models From OpenAI

 
Deep Learning, 
Natural Language Processing, 
Transformer, 
Reinforcement Learning, 
OpenAI, 
GPT, 
ChatGPT, 
InstructGPT  
  Read More

Transformer Autoregressive Inference Optimization

 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)
Principles for Faster Transformer Inference

 
Deep Learning, 
Inference, 
Natural Language Processing, 
Optimization, 
Accelerated Computing, 
Transformer  
  Read More

PyTorch Dynamic Quantization

 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)
PyTorch Dynamic Quantization for Transformers

 
Quantization, 
Transformer, 
PyTorch, 
HuggingFace  
  Read More

Evolved Transformer Explained

 03-07-2020 03-07-2020 blog 5 minutes read (About 777 words)
Evolution Search on Transformer Neural Network Architectures

 
Machine Learning, 
Deep Learning, 
Natural Language Processing, 
Transformer, 
Neural Architecture Search, 
Evolution Algorithm  
  Read More

Transformer Explained In One Single Page

 06-01-2019 02-09-2023 blog 23 minutes read (About 3454 words)
Attention is All You Need Mathematically

 
Deep Learning, 
Natural Language Processing, 
Transformer, 
Machine Translation  
  Read More
Lei Mao

Lei Mao

Artificial Intelligence Machine Learning Computer Science

Menlo Park, California

Posts

1336

Categories

8

Tags

805

  Follow   Sponsor

Advertisement


Categories

  • article21
  • blog568
  • essay338
  • life309
  • miscellaneous2
  • photography70
  • project20
  • reading8

follow.it

Recents

04-16-2026

2023 年恐怖电影《感恩节》

essay

04-12-2026

Page Table for Page-Locked Host Memory

blog

04-12-2026

2026 Airport Runway Run at San Carlos Airport 5K 竞赛

life

04-11-2026

Don Edwards San Francisco Bay National Wildlife Refuge - Ravenswood 徒步

life

04-11-2026

Don Edwards San Francisco Bay National Wildlife Refuge - Ravenswood

photography

Archives

  • April 202610
  • March 202618
  • February 202617
  • January 202616
  • December 202535
  • See All >>

Tags

Outdoors314
California245
Hiking238
CPP121
Mathematics102
Deep Learning86
Photography84
CUDA73
Running67
Wildlife61
Bird55
Racing45
Movie36
Python36
Software Engineering36
Machine Learning34
NVIDIA32
Statistics32
China31
Linux31
See All >>
Lei Mao's Log Book

© 2017-2026 Lei Mao  Powered by Hexo & Icarus
Site UV:  Site PV:

×