Sooftware NLP - Efficient Attention Paper Review

Soohwan Kim
Co-founder/A.I. engineer at TUNiB.
More posts by Soohwan Kim.

Soohwan Kim

17 Jul 2021•1 min read

Sooftware NLP - Efficient Attention Paper Review

Efficient Attention: Attention with Linear Complexities

Shen Zhuoran et al.

Abstract

Dot-product attention은 들어오는 인풋 길이에 따라 memory & computation cost가 quadratically하게 증가함
어텐션 매커니즘을 조금 수정해서 memory & computation cost를 상당히 줄이는 방법 제안

Method

기존 Dot-product로 similarty를 구하는 방식과 다르게, Key와 value를 곱하는 방식 사용
Dot-product:

Efficient:

Experiment

기존 attention과 제안된 attention 비교 => 상당히 효율적으로 변한것을 확인 가능

성능 면에서도 더 좋은 결과가 나왔다는 표

Subscribe to SOOFTWARE

Get the latest posts delivered right to your inbox

More in nlp

MoE(Mixture of Experts) 기초부터 DeepSeek 혁신까지
15 Jan 2026 - min read
BERT는 사실 Diffusion 모델이였다?!
21 Oct 2025 - min read
RLHF는 수다쟁이를 만든다?! (Does RLHF Breed Verbose Chatterboxes?!)
13 Mar 2024 - min read

2021 AI 온라인 경진대회 1위 후기 발표 cover image

2021 AI 온라인 경진대회 1위 후기 발표 cover image

2021 AI 온라인 경진대회 1위 후기 발표

2021 AI 온라인 경진대회 1위 노하우 발표 이번에 참가한 2021 AI 온라인 경진대회 - 노인 대화 감성 분석 트랙 1위 노하우에 대해 구글밋으로 발표했습니다. 관심 있으신 분들은 위 링크로 접속하셔서 보시면 됩니다 :)

Sooftware NLP - Hugging Face Tokenizers cover image

Sooftware NLP - Hugging Face Tokenizers cover image

huggingface, nlp,

Sooftware NLP - Hugging Face Tokenizers

최근 NLP 토크나이저를 만드는데 가장 많이 사용되는 라이브러와 실제 사용이 가장 많이 되는 라이브러리로의 변환에 대한 코드를 담고 있습니다. 해당 내용은 버젼에서 수행되었습니다. Train 아래 코드는 wordpiece, char-bpe…