Meta-Reinforcement Learning Optimization for Movable Antenna-aided Full-Duplex CF-DFRC Systems with Carrier Frequency Offset

Yue Xiu; Wanting Lyu; You Li; Ran Yang; Phee Lep Yeoh; Wei Zhang; Guangyi Liu; Ning Wei

doi:10.1109/TCOMM.2026.3669488

Back

Meta-Reinforcement Learning Optimization for Movable Antenna-aided Full-Duplex CF-DFRC Systems with Carrier Frequency Offset

Journal article

Peer reviewed

Meta-Reinforcement Learning Optimization for Movable Antenna-aided Full-Duplex CF-DFRC Systems with Carrier Frequency Offset

Yue Xiu, Wanting Lyu, You Li, Ran Yang, Phee Lep Yeoh, Wei Zhang, Guangyi Liu and Ning Wei

IEEE Transactions on Communications, Vol.74, pp.5803-5819

2026

DOI: https://doi.org/10.1109/TCOMM.2026.3669488

Related material

Has preprint

Meta-Reinforcement Learning Optimization for Movable Antenna-aided Full-Duplex C...

Abstract

cell-free

carrier frequency offset

dual-functinal radar communication

manifold optimization

penalty dual decomposition

meta-reinforcement learning

By enabling spectrum sharing between radar and communication operations, the cell-free dual-functional radar–communication (CF-DFRC) system is a promising candidate to significantly improve spectrum efficiency in future sixth-generation (6G) wireless networks. However, in wideband scenarios, synchronization errors caused by carrier frequency offset (CFO) can severely reduce both communication capacity and sensing accuracy, especially when multiple geographically distributed full-duplex (FD) access points (APs) are jointly coordinated. In this paper, we consider a wideband FD CF-DFRC system where each AP is equipped with movable antennas (MAs). This setting is fundamentally different from existing DFRC or MA-aided designs that typically assume fixed-position antennas, half-duplex operation, or perfect synchronization. First, we develop a field-response-based channel model and derive a worst-case weighted communication–sensing rate (WCSR) that explicitly captures the impact of inter-AP CFO on both the uplink communication signal-to-interference-plus-noise ratio (SINR) and the radar echo SINR. Our analysis reveals that CFO increases the Cramér–Rao lower bound (CRLB) of target position estimation, thereby degrading sensing accuracy. Based on this characterization, we formulate a robust worst-case WCSR maximization problem that jointly optimizes MA positions, transmit beamforming vectors, receive filters, and CFO-related parameters under transmit power and MA-position constraints. To tackle the resulting highly non-convex problem, we propose a two-stage robust optimization framework. In the first stage, we employ fractional programming together with manifold optimization (MO) and penalty dual decomposition (PDD) to solve the worst-case CFO subproblem on the complex unit-modulus manifold, thus obtaining a CFO-robust closed-form structure for the WCSR. In the second stage, we design a meta–reinforcement learning (MRL) based resource allocation scheme that jointly optimizes the MA positions and beamforming vectors in a data-driven manner for dynamic wireless environments. Unlike conventional deep reinforcement learning (DRL) methods, the proposed MRL framework learns a meta-policy that can rapidly adapt to varying channel and CFO realizations, substantially improving convergence speed and scalability. Simulation results show that the proposed robust MO–PDD–MRL framework significantly outperforms existing DRL-based and non-robust CF-DFRC schemes in terms of both communication and sensing performance under CFO impairments. Furthermore, compared to fixed-position antenna (FPA) architectures, the MA-aided CF-DFRC system exhibits markedly enhanced robustness and adaptability to CFO effects and target mobility.

Details

Title: Meta-Reinforcement Learning Optimization for Movable Antenna-aided Full-Duplex CF-DFRC Systems with Carrier Frequency Offset
Authors: Yue Xiu - University of Electronic Science and Technology of China
Wanting Lyu - University of Electronic Science and Technology of China
You Li - Southwest Research Institute
Ran Yang - University of Electronic Science and Technology of China
Phee Lep Yeoh - University of the Sunshine Coast, Queensland, School of Science, Technology and Engineering
Wei Zhang - UNSW Sydney
Guangyi Liu - China Mobile (China)
Ning Wei (Corresponding Author) - University of Electronic Science and Technology of China
Publication details: IEEE Transactions on Communications, Vol.74, pp.5803-5819
Publisher: Institute of Electrical and Electronics Engineers
Date published: 2026
DOI: 10.1109/TCOMM.2026.3669488
ISSN: 1558-0857
Grant note: This work was supported in part by the National Science and Technology Major Project of China under Grant 2025ZD1302000; and in part by the National Natural Science Foundation of China (NSFC) under Grant 61871070, Grant 61831004, Grant 62203451, and Grant 91938202.
Organisation Unit: School of Science, Technology and Engineering
Language: English
Record Identifier: 991212777602621
Output Type: Journal article

Metrics

1 Record Views

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types: Industry collaboration; Domestic collaboration; International collaboration
Web Of Science research areas: Engineering, Electrical & Electronic; Telecommunications