熱門話題
#
Bonk 生態迷因幣展現強韌勢頭
#
有消息稱 Pump.fun 計劃 40 億估值發幣,引發市場猜測
#
Solana 新代幣發射平臺 Boop.Fun 風頭正勁
考慮到他發佈的高品質和/或引人深思的品味,這位海報的關注者實在是太少了。

10月18日 12:33
I think the observation that LLMs are "bad tutors" in that they cannot precisely probe understanding is accurate. The fact that "upweighting the entire rollout" is stupid is also true. However its not obvious to me that the remedy for that is LLM-reflection as to "what went well". I think this runs into very similar issues of collapse-risk or misallocation of supervision. Because while we might be sucking supervision through a straw, the only thing thats even worse is sucking tainted supervision through a straw.
這不是說 Mike 是什麼小眾發帖者,但我只是在想有多少垃圾帳號擁有 2-10 倍的數量。
30.78K
熱門
排行
收藏

