Autonomous Driving Trinity: Vision, Natural Language, and Action 09-15-2023 09-15-2023 blog 4 minutes read (About 555 words)Vision, Natural Language, and Action Closed-Loop Driving Model Proposal from Wayve Computer Vision, Natural Language Processing, Autonomous Driving Read More
OpenAI GPT Models 04-15-2023 04-15-2023 article 28 minutes read (About 4168 words)Generative Pre-Trained Transformer Models From OpenAI Deep Learning, Natural Language Processing, Reinforcement Learning, Transformer, GPT, OpenAI, ChatGPT, InstructGPT Read More
Transformer Autoregressive Inference Optimization 04-06-2023 04-06-2023 article 27 minutes read (About 4084 words)Principles for Faster Transformer Inference Deep Learning, Inference, Natural Language Processing, Transformer, Optimization, Accelerated Computing Read More
Gated Linear Units (GLU) and Gated CNN 03-09-2020 03-09-2020 blog 6 minutes read (About 827 words)Gated Linear Units and Gated CNN Explained Machine Learning, Deep Learning, Natural Language Processing Read More
Evolved Transformer Explained 03-07-2020 03-07-2020 blog 5 minutes read (About 777 words)Evolution Search on Transformer Neural Network Architectures Machine Learning, Deep Learning, Natural Language Processing, Transformer, Neural Architecture Search, Evolution Algorithm Read More
Bilingual Evaluation Understudy (BLEU) 11-17-2019 11-17-2019 blog 11 minutes read (About 1644 words)Elucidating the Machine Translation BLEU Scoring Mechanism Natural Language Processing, Machine Translation Read More
Word2Vec Models Revisited 08-23-2019 08-23-2019 article 24 minutes read (About 3525 words)A Review on the Classic Word2Vec Models Machine Learning, Natural Language Processing, Optimization Read More
Hierarchical Softmax 08-17-2019 08-17-2019 article 8 minutes read (About 1194 words)Orchestrated Softmax for Fast Language Model Training Machine Learning, Natural Language Processing, Optimization Read More
Noise Contrastive Estimation 07-26-2019 04-17-2024 article 33 minutes read (About 4917 words)Accelerate the Training of Neural Language Models Machine Learning, Natural Language Processing, Optimization Read More
Byte Pair Encoding 07-19-2019 07-19-2019 blog 16 minutes read (About 2377 words)Common Tokenization Method for Natural Language Processing Information Theory, Natural Language Processing, Byte Pair Encoding Read More