MSF-VMDNet for Multi-Class Segmentation of Skin Cancer Whole Slide Images Using a Multi-Frequency Dual Encoder Network

April 2026 in “ Scientific Reports ”

Jiangliang Zhang, Qiumei Pu, Jinglong Tian, Junhao Wang, Jieyao Wei, Menghan Yang, Lina Zhao

skin cancer whole-slide images multi-class segmentation U-Net Vision Mamba AFNO spectral decomposition SCConv module MIoU Dice coefficient

The study introduces MSF-VMDNet, a novel deep learning model designed for multi-class segmentation of skin cancer whole-slide images, addressing the complexity of differentiating 10 distinct tissue classes. This model combines U-Net and Vision Mamba dual encoders to enhance feature extraction and segmentation accuracy. The U-Net encoder uses an improved AFNO spectral decomposition module for high-resolution semantic information, while the Vision Mamba encoder optimizes long-range dependency modeling. The SCConv module fuses features from various frequency domains and spatial levels. MSF-VMDNet outperforms existing methods, achieving an MIoU of 95.37% and a Dice coefficient of 95.11%, and demonstrates strong generalization across multiple datasets.

View this study on nature.com →

Discuss this study in the Community →

Research cited in this study

1 / 1 results

research Skin Biopsy

106 citations , December 2015 in “Journal of The American Academy of Dermatology”

Correct skin biopsy techniques are crucial to avoid misdiagnosis of skin diseases.