[读论文] Copying Mechanism in Sequence-to-Sequence

​Paper Today:

'Incorporating Copying Mechanism in Sequence-to-Sequence Learning'

This paper develops a model called COPYNET which performs well in an important mechanism called 'copy mechanism'.

In human language communication, there are many situations that we will use 'copy mechanism', such as in a dialogue:

In order to make machine generate such dialogue, there are two things to do.

  • First, to identify what should be copied.
  • Second, to decide where the copy part should be addressed.

Currently there are some popular models like seq2seq, and adding Attention Mechanism to seq2seq.
COPYNET is also an encoder-decoder model, but a different strategy in neural network based models.
RNN and Attention Mechanism requires more 'understanding', but COPYNET requires high 'literal fidelity'.

There are mainly 3 improvements in the decoder part.
Prediction:
Based on the mix of two probabilistic modes, generate mode and copy mode, the model can pick the proper subsentence and generate some OOV words.

State Update:
There's a minor change that they designed a selective read for copy mode, which enables the model to notice the location information.

Reading M:
This model can get a hybrid of content based addressing and location based addressing.

In the experiment, this model did very well in tasks like text summarization.

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • rljs by sennchi Timeline of History Part One The Cognitiv...
    sennchi阅读 7,449评论 0 10
  • 0到3个月的感觉统合发展。因而所需要的是能够自我调节适应新环境新的生活作息,情绪安稳少哭闹。睡眠时间逐渐发展出日夜...
    温明春晓阅读 136评论 0 0
  • 黄皮书 口语 1.你要买什么呢? 2.日用品,还有食物。冰箱里已经都空了。 3.是吗。 4.哦,对了。还有我想买瑜...
    scmsuki阅读 214评论 0 0
  • 【前言】如今思维导图相关知识很多,甚至还有各种所谓的思维导图的比赛,稍微看了一下觉得有点偏离我们最开始的初衷了,我...
    李庆文阅读 1,189评论 1 6