Autonomous Driving Trinity: Vision, Natural Language, and Action 09-15-2023 09-15-2023 blog 4 minutes read (About 553 words)Vision, Natural Language, and Action Closed-Loop Driving Model Proposal from Wayve Computer Vision, Natural Language Processing, Autonomous Driving Read more
OpenAI GPT Models 04-15-2023 04-15-2023 article 28 minutes read (About 4161 words)Generative Pre-Trained Transformer Models From OpenAI Deep Learning, Natural Language Processing, Transformer, Reinforcement Learning, OpenAI, GPT, ChatGPT, InstructGPT Read more
Transformer Autoregressive Inference Optimization 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)Principles for Faster Transformer Inference Deep Learning, Inference, Natural Language Processing, Optimization, Transformer, Accelerated Computing Read more
Gated Linear Units (GLU) and Gated CNN 03-09-2020 03-09-2020 blog 6 minutes read (About 826 words)Gated Linear Units and Gated CNN Explained Machine Learning, Deep Learning, Natural Language Processing Read more
Evolved Transformer Explained 03-07-2020 03-07-2020 blog 5 minutes read (About 774 words)Evolution Search on Transformer Neural Network Architectures Machine Learning, Deep Learning, Natural Language Processing, Transformer, Neural Architecture Search, Evolution Algorithm Read more
Bilingual Evaluation Understudy (BLEU) 11-17-2019 11-17-2019 blog 11 minutes read (About 1644 words)Elucidating the Machine Translation BLEU Scoring Mechanism Natural Language Processing, Machine Translation Read more
Word2Vec Models Revisited 08-23-2019 08-23-2019 article 23 minutes read (About 3524 words)A Review on the Classic Word2Vec Models Machine Learning, Natural Language Processing, Optimization Read more
Hierarchical Softmax 08-17-2019 08-17-2019 article 8 minutes read (About 1191 words)Orchestrated Softmax for Fast Language Model Training Machine Learning, Natural Language Processing, Optimization Read more
Noise Contrastive Estimation 07-26-2019 07-26-2019 article 30 minutes read (About 4509 words)Accelerate the Training of Neural Language Models Machine Learning, Natural Language Processing, Optimization Read more
Byte Pair Encoding 07-19-2019 07-19-2019 blog 16 minutes read (About 2377 words)Common Tokenization Method for Natural Language Processing Natural Language Processing, Information Theory, Byte Pair Encoding Read more