star-bits

Self-Taught Reasoner

reasoning trace 데이터셋의 self-feeding amplification loop

생각