The advancement of unmanned aerial vehicle (UAV) technology, coupled with breakthroughs in artificial intelligence (AI), has significantly expanded the potential of drone swarms, particularly within ...
Microrobotics has emerged as a transformative technology for biomedical applications, enabling unprecedented capabilities in targeted drug delivery [1], minimally invasive surgery [2], cell ...
Reinforcement Learning with Verifiable Rewards (RLVR) often suffers from Recursive Space Contraction (RSC), where the policy irreversibly collapses into narrow reasoning paths, sacrificing diversity ...
Latent Thought Policy Optimization (LTPO) is a parameter-free framework that enhances Large Language Model (LLM) reasoning entirely at test time by treating intermediate "thought" vectors as dynamic ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results