degeneration
2022-08-17 17:39:29 0 举报
AI智能生成
a map illustrating the main idea about NLG degeneration
作者其他创作
大纲/内容
learning-based<br>
CTRL-baseline<br>
mentioned <b>fine-grained</b> control
train on seq of raw text prepended with <b>control codes</b>
famous for story generation
unlikelihood training-baseline
new loss
seq-level & token-level
simCTG
new loss
CTLoss
new loss
decoding-based<br>a.k.a. guided decoding
basic-baseline
random sample
topp
topk
temperature
beam search
create forbid set
heuristic
heuristic random sample
heuristic beam search - beast first beam search
heuristic guided decoding strategy
gamma sampling
raise diversity
<b>fine-grained </b>control including topic, end, repeatness
composition decoding
random sample+beam search
typical decoding
select based on conditional entropy
direct beam search
conbine <b>similarity</b> and beam search
future
prompt?
other guided decoding
previous question answering
modeling decoding strategy selection
discussion
"n best search"
<b>intrinsic uncertainty </b>could be solved by increase num_beam<br>harder to be trapped in <b>local optimum</b>
decoding limitation
<b>different task </b>would have different optimal decoding strategy
<b>high quality and high prob positive/negative correlation</b> upon different task
high quality!=high prob
appropriate<b> information entropy </b>is decisive upon quality (encourage model to generate tokens based on <b>information</b>)<br>information theory related
new idea
gamma-enhanced sample
idea : gamma sampling treat equally every homoionyms<br>this new sample method treat homoionyms differently sorted by their <b>cosine similarity towards keywords appeared in source text</b>
<span style="font-size: inherit;">change topic_tokens to key_words directly : works</span><br>
add key_gamma : doesn't work
change probs before gamma sampling
delete the extreme condition dealing : works
answer towards previous question
收藏
0 条评论
下一页
为你推荐
查看更多