petrescatraian@libranet.de to Technology@beehaw.org · 8 days agoDeepseek when asked about sensitive topicsi.postimg.ccimagemessage-square85fedilinkarrow-up1319arrow-down10file-text
arrow-up1319arrow-down1imageDeepseek when asked about sensitive topicsi.postimg.ccpetrescatraian@libranet.de to Technology@beehaw.org · 8 days agomessage-square85fedilinkfile-text
minus-squareAatube@kbin.melroy.orglinkfedilinkarrow-up1·8 days agoDid you use the -Zero model, which doesn’t have the “cold-start data before RL” which prevents it from language mixing?
Did you use the -Zero model, which doesn’t have the “cold-start data before RL” which prevents it from language mixing?