A Reinforcement Learning Based Universal Sequence Design for Polar Codes
To advance Polar code design for 6G applications, we develop a reinforcement learning-based universal sequence design framework that is extensible and adaptable to diverse channel conditions and decoding strategies. Crucially, our method scales to code lengths up to 2048, making it suitable for use in standardization. Across all configurations supported in 5G, our approach achieves competitive performance relative to the NR sequence adopted in 5G and yields up to a 0.2 dB gain over the beta-expansion baseline at . We further highlight the key elements that enabled learning at scale: (i) incorporation of physical law constrained learning grounded in the universal partial order property of Polar codes, (ii) exploitation of the weak long term influence of decisions to limit lookahead evaluation, and (iii) joint multi-configuration optimization to increase learning efficiency.
Related readings and updates.
BISCUIT: Scaffolding LLM-Generated Code with Ephemeral UIs in Computational Notebooks
August 3, 2024research area Human-Computer Interaction, research area Tools, Platforms, Frameworksconference IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)
This paper was accepted at IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC) 2024.
Programmers frequently engage with machine learning tutorials in computational notebooks and have been adopting code generation technologies based on large language models (LLMs). However, they encounter difficulties in understanding and working with code produced by LLMs. To mitigate these challenges, we introduce a novel workflow into…
On a Neural Implementation of Brenier’s Polar Factorization
July 16, 2024research area Methods and Algorithmsconference ICML
In 1991, Brenier proved a theorem that generalizes the polar decomposition for square matrices — factored as PSD unitary — to any vector field . The theorem, known as the polar factorization theorem, states that any field can be recovered as the composition of the gradient of a convex function with a measure-preserving map , namely . We propose a practical…
