Our code is based on open-r1, with our customized Trainer for mixed SFT+GRPO training. Some other updates focus on the white-box RL (reward function design) and post-completion training (replacement ...
Abstract: Maximum Distance Profile (MDP) convolutional codes are an important class of channel codes due to their maximal delay-constrained error correction ...
The Book Completion Award (BCA) supports faculty who are developing their research projects into publishable book manuscripts. Funds are awarded on a competitive basis to faculty in the arts, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results