Authors
Tongli He, Xiwen Yang, Wanzhen Huang, Fan Zhang, Guodong Li, Ze Niu, Jianhong Gan, Zhibin Li, Xun Deng, Tinghui Chen, Peiyang Wei, Shuai Li, Xiaoli Peng
Published in
Biomimetics (Basel, Switzerland). Volume 11. Issue 6. Jun 01, 2026. Epub Jun 01, 2026.
Abstract
Agricultural cold chain logistics is characterized by inherent challenges-product perishability, high carbon emissions, and stringent time windows-which are further exacerbated by dynamic disruptions. Existing methods suffer from slow adaptability, unstable multi-objective convergence, and severe cold-start issues. This work falls within the broad scope of biomimetics-the science of emulating nature's time-tested strategies to solve complex engineering problems-and bio-inspired data-driven methods and their applications in engineering control, optimization, and artificial intelligence. The proposed H-MODRL framework embodies core biomimetic principles: the Genetic Algorithm (GA) mimics Darwinian natural selection and genetic inheritance, the Sparrow Search Algorithm (SSA) abstracts the cooperative foraging and anti-predation behaviors of sparrow populations in nature, and the Arrhenius-based freshness-decay model captures the biochemical kinetics governing perishable biological products. By synergistically integrating these biological evolution principles, swarm intelligence, and deep learning, the framework tackles real-world logistics complexity in a manner directly inspired by living systems. This study presents a well-organized hybrid optimization framework (H-MODRL) that couples a three-stage hybrid evolutionary mechanism, synergistically integrating heuristic warm-start, evolutionary policy guidance, and deep reinforcement learning decision-making. First, an improved genetic algorithm combined with the earliest deadline first strategy constructs a feasible initial population satisfying hard time-window constraints. Second, a large neighborhood search-enhanced chaotic sparrow search algorithm builds a high-quality elite guidance set for policy learning. Third, a physics-based multi-objective proximal policy optimization model embedded with Arrhenius equation-derived freshness-decay kinetics performs online decision-making. Experiments demonstrate that pre-computed all-pairs shortest paths and an O(1) hash-based dynamic-disruption indexing mechanism support fast online replanning. On heterogeneous simulated terrains based on real Chinese geospatial data, H-MODRL outperforms state-of-the-art algorithms across four objectives-logistics cost, carbon emissions, terminal freshness, and delivery time-while exhibiting compact, low-variance performance distributions, thereby validating its engineering robustness and practical value in complex agricultural cold chain environments.
PMID:
42345669
Bibliographic data and abstract were imported from PubMed on 25 Jun 2026.
Read full publication at:
Please sign in
to see all details.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 8
- Comments 0