Deformable Attention 12-16-2023 12-16-2023 blog 34 minutes read (About 5060 words)Attention with Learned Spatial Feature Sampling Deep Learning, Computer Vision, Transformer, Attention Read More
OpenAI GPT Models 04-15-2023 04-15-2023 article 28 minutes read (About 4161 words)Generative Pre-Trained Transformer Models From OpenAI Deep Learning, Natural Language Processing, Transformer, Reinforcement Learning, OpenAI, GPT, ChatGPT, InstructGPT Read More
Transformer Autoregressive Inference Optimization 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)Principles for Faster Transformer Inference Deep Learning, Inference, Natural Language Processing, Optimization, Transformer, Accelerated Computing Read More
PyTorch Dynamic Quantization 11-14-2020 04-29-2021 blog 8 minutes read (About 1193 words)PyTorch Dynamic Quantization for Transformers Transformer, PyTorch, HuggingFace, Quantization Read More
Evolved Transformer Explained 03-07-2020 03-07-2020 blog 5 minutes read (About 774 words)Evolution Search on Transformer Neural Network Architectures Machine Learning, Deep Learning, Natural Language Processing, Transformer, Neural Architecture Search, Evolution Algorithm Read More
Transformer Explained In One Single Page 06-01-2019 02-09-2023 blog 23 minutes read (About 3453 words)Attention is All You Need Mathematically Deep Learning, Natural Language Processing, Transformer, Machine Translation Read More