Why we need attention mechanism when dealing with nlp