Yam

Feeling, Coding, Thinking

跳至内容
  • Home
  • Series
  • Archives
  • About
  • Projects
2020

Transformer 代码笔记

2020-04-23 •
Feeling
•
  • Attention
  • Decoder
  • Encoder
  • Multi-Head Attention
  • NLP
  • Self-Attention
  • Transformer

Luong Attention 论文+代码笔记

2020-04-14 •
Feeling
•
  • Attention
  • Luong Attention
  • NLP

Bahdanau Attention 论文笔记

2020-02-08 •
Feeling
•
  • Attention
  • Bahdanau Attention
  • NLP
2019

Transformer 论文笔记

2019-08-04 •
Feeling
•
  • Attention
  • NLP
  • Position-Encoding
  • Self-Attention
  • Transformer

Categories

  • Coding48
  • Feeling62
  • Thinking16

Music

Tag Cloud

AE AI ALBERT AR AUC Accuracy Activation Algorithm Array Attention Automatic Speech Processing BERT Backtracking Backward Bahdanau Attention Bart Bayes Bert Bert-Flow Bi-LSTM Binary Search Blending Business C CNN CRF Calculus Catalan ChatBot Chi2 Classification Cognition Collaborative Filtering Computational Linguistics Computer Computer Science Confusing Labels Coordinate Ascent Cosine Cosine Similarity Ctrl DB DP Data Clearing Data Preprocess Data Science Data Structure Database DeBERTa Decoder Deep DeepGraph DeepLearning Dependence Diary Disentangled Attention DistilBERT Django Dynamic-Mask EDA EMD ERNIE Economics Elasticsearch Electra Elixir Embedding Encoder Entropy Evaluation FDW FSM Feature Engineering Feature-based Few-Shot Fine-tuning Forward Full-Text-Search Function Syntax Funk MF Funnel Transformer GBTD GELU GPT-2 GPU GSG Gan Glow Graph GraphQL Grid Grammar HMM Hard-SVM Hinge Loss IQR Imbalance Data Industry Information Theory Isolation Forest ItemCF Jaccard Job KKT KS Kernel Kernel Function Kernel Method Keyword Knowledge Graph LOF LR Language Model Lexicalism Linear Algebra Linear Sturcture Linked List LinkedList Lucene Luong Attention MF Machine Machine Learning Machine Translation Manacher Managemnt Markov Materialized Views Math Matplotlib Matrix Factorization Median Metric Minimum Edit Distance Minkowski Model Evaluation Module Multi-Head Attention Multiway Tree NER NLG NLM NLP NLU Neo4j Ngram Normalizing Flow NumPy Occupation Orientation P-R PEGASUS PageRank Palindromic Pandas Pearson Philosophy Phrase Structure Grammar Pooling Position-Encoding Postgres Pragmatic Automatic Processing Pre-training Precision Pretraining Probabilistic Grammar Probabilistic Model Psychology PyPI Python Quant Query Queue RELU RFE RMSE ROC Recall Recommendation Recursion Reformer Regex Regular Expression Reinforcement Learning Relationship Extraction Representation RoBERTa Rotated Sorted Array SMO SQL SVD++ SVM Seaborn Search Self-Attention Semantic Automatic Processing Semantic Similarity Sentence Similarity Sentence-BERT Siamese Sigmoid Similarity Simon Simpson Paradox Skill Slide Smoothing Soft-SVM Softmax Sort Spell Check SqueezeBERT Stack Stacking Statistics Stirling StratifiedKFold String Style Substring Summarization Swap System TanH Text Generation TextRank Thought Transformer Transformer-XL Tree Tuning Ubuntu Unity Operation UserCF Vagrant Valence VirtualBox Visualization Viterbi Voting WOE Wide Work XLNet Z-Score ZhouZhihua Zipf binning knowledge Graph node2vec ssh

Recents

  • AI 工程师养成记(上)
  • SqueezeBERT 论文笔记
  • 从 Sentence-BERT 谈句子表征
  • Bert-Flow 论文笔记
  • NLP 表征的历史与未来

Tags

  • AE1
  • AI35
  • ALBERT1
  • AR1
  • AUC1
  • Accuracy1
  • Activation1
  • Algorithm5
  • Array1
  • Attention4
  • Automatic Speech Processing1
  • BERT2
  • Backtracking1
  • Backward1
  • Bahdanau Attention1
  • Bart1
  • Bayes1
  • Bert6
  • Bert-Flow1
  • Bi-LSTM1
  • Binary Search3
  • Blending1
  • Business3
  • C2
  • CNN1
  • CRF1
  • Calculus1
  • Catalan1
  • ChatBot1
  • Chi21
  • Classification1
  • Cognition2
  • Collaborative Filtering1
  • Computational Linguistics1
  • Computer1
  • Computer Science4
  • Confusing Labels1
  • Coordinate Ascent1
  • Cosine1
  • Cosine Similarity1
  • Ctrl1
  • DB2
  • DP1
  • Data Clearing1
  • Data Preprocess1
  • Data Science7
  • Data Structure9
  • Database1
  • DeBERTa1
  • Decoder1
  • Deep1
  • DeepGraph1
  • DeepLearning3
  • Dependence1
  • Diary2
  • Disentangled Attention1
  • DistilBERT1
  • Django1
  • Dynamic-Mask1
  • EDA1
  • EMD1
  • ERNIE1
  • Economics1
  • Elasticsearch1
  • Electra1
  • Elixir2
  • Embedding2
  • Encoder1
  • Entropy2
  • Evaluation1
  • FDW1
  • FSM1
  • Feature Engineering1
  • Feature-based1
  • Few-Shot1
  • Fine-tuning1
  • Forward1
  • Full-Text-Search1
  • Function Syntax1
  • Funk MF1
  • Funnel Transformer1
  • GBTD1
  • GELU1
  • GPT-21
  • GPU1
  • GSG1
  • Gan1
  • Glow1
  • Graph2
  • GraphQL2
  • Grid Grammar1
  • HMM1
  • Hard-SVM1
  • Hinge Loss1
  • IQR1
  • Imbalance Data1
  • Industry1
  • Information Theory1
  • Isolation Forest1
  • ItemCF1
  • Jaccard1
  • Job1
  • KKT1
  • KS1
  • Kernel1
  • Kernel Function1
  • Kernel Method1
  • Keyword1
  • Knowledge Graph2
  • LOF1
  • LR1
  • Language Model1
  • Lexicalism1
  • Linear Algebra1
  • Linear Sturcture1
  • Linked List1
  • LinkedList2
  • Lucene1
  • Luong Attention1
  • MF1
  • Machine1
  • Machine Learning7
  • Machine Translation1
  • Manacher1
  • Managemnt3
  • Markov1
  • Materialized Views1
  • Math2
  • Matplotlib1
  • Matrix Factorization1
  • Median1
  • Metric1
  • Minimum Edit Distance1
  • Minkowski1
  • Model Evaluation1
  • Module1
  • Multi-Head Attention1
  • Multiway Tree1
  • NER1
  • NLG1
  • NLM1
  • NLP52
  • NLU1
  • Neo4j1
  • Ngram1
  • Normalizing Flow1
  • NumPy1
  • Occupation1
  • Orientation1
  • P-R1
  • PEGASUS1
  • PageRank1
  • Palindromic1
  • Pandas1
  • Pearson1
  • Philosophy2
  • Phrase Structure Grammar1
  • Pooling1
  • Position-Encoding1
  • Postgres2
  • Pragmatic Automatic Processing1
  • Pre-training2
  • Precision1
  • Pretraining2
  • Probabilistic Grammar1
  • Probabilistic Model1
  • Psychology2
  • PyPI1
  • Python18
  • Quant1
  • Query1
  • Queue1
  • RELU1
  • RFE1
  • RMSE1
  • ROC1
  • Recall1
  • Recommendation5
  • Recursion2
  • Reformer1
  • Regex1
  • Regular Expression1
  • Reinforcement Learning1
  • Relationship Extraction1
  • Representation1
  • RoBERTa1
  • Rotated Sorted Array1
  • SMO1
  • SQL2
  • SVD++1
  • SVM2
  • Seaborn1
  • Search2
  • Self-Attention3
  • Semantic Automatic Processing1
  • Semantic Similarity1
  • Sentence Similarity1
  • Sentence-BERT1
  • Siamese1
  • Sigmoid1
  • Similarity1
  • Simon1
  • Simpson Paradox1
  • Skill1
  • Slide1
  • Smoothing1
  • Soft-SVM1
  • Softmax1
  • Sort2
  • Spell Check1
  • SqueezeBERT1
  • Stack1
  • Stacking1
  • Statistics1
  • Stirling1
  • StratifiedKFold1
  • String1
  • Style1
  • Substring1
  • Summarization1
  • Swap1
  • System2
  • TanH1
  • Text Generation1
  • TextRank1
  • Thought1
  • Transformer11
  • Transformer-XL1
  • Tree1
  • Tuning1
  • Ubuntu1
  • Unity Operation1
  • UserCF1
  • Vagrant1
  • Valence1
  • VirtualBox1
  • Visualization1
  • Viterbi1
  • Voting1
  • WOE1
  • Wide1
  • Work1
  • XLNet1
  • Z-Score1
  • ZhouZhihua1
  • Zipf1
  • binning1
  • knowledge Graph1
  • node2vec1
  • ssh1

© 2021 Yam All rights reserved.

Powered by Hexo