ISSN: 3048-6815

Enhanced Few-Shot Learning for Medical Image Segmentation via Meta-Learning with Attention-Guided Feature Augmentation

Abstract

Medical image segmentation is a crucial task in computer-aided diagnosis, enabling accurate localization and delineation of anatomical structures and pathological regions. However, deep learning-based segmentation methods typically require large amounts of annotated data, which are often scarce and expensive to acquire in the medical domain. Few-shot learning (FSL) offers a promising solution by enabling models to learn from limited labeled examples. This paper proposes an enhanced FSL framework for medical image segmentation that combines meta-learning with attention-guided feature augmentation. Specifically, we employ a Prototypical Network-based meta-learning architecture, which learns to extract task-specific prototypes from support sets. To address the challenge of limited data, we introduce an attention mechanism that focuses on salient image regions and guides feature augmentation, thereby enhancing the diversity and representativeness of the support set features. Experimental results on benchmark medical image segmentation datasets demonstrate that the proposed method significantly outperforms existing FSL approaches, achieving state-of-the-art performance with minimal labeled data. The proposed approach holds substantial promise for improving the efficiency and effectiveness of medical image analysis, particularly in scenarios with limited labeled data.

References

  1. Snell, J., Swersky, K., & Zemel, R. S. (2017). Prototypical networks for few-shot learning. Advances in Neural Information Processing Systems, 30.
  2. Ouyang, Y., Li, X., Tian, Q., & Qian, C. (2020). Self-support few-shot semantic segmentation. arXiv preprint arXiv:2003.03409.
  3. Finn, C., Abbeel, P., & Levine, S. (2017). Model-agnostic meta-learning for fast adaptation of deep networks. International Conference on Machine Learning, 1126-1135.
  4. Rajeswaran, A., Finn, C., Kakade, S. M., & Levine, S. (2019). Meta-learning with implicit gradients. Advances in Neural Information Processing Systems, 32.
  5. Han, C., Lu, Y., Xing, E. P., & Yang, G. (2018). Few-shot image semantic segmentation with mask-aware discriminator. arXiv preprint arXiv:1811.09398.
  6. Tripathi, S., Anand, D., & Chellappa, R. (2020). Few-shot semantic segmentation via cycle-consistent generative adversarial networks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10646-10655.
  7. Oktay, O., Schlemper, J., Folgoc, L. L., Lee, M., Heinrich, M., Misawa, K., ... & Rueckert, D. (2018). Attention U-Net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999.
  8. Wang, X., Girshick, R., Shrivastava, A., & He, K. (2017). Non-local neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition, 7794-7803.
  9. Yun, S., Han, D., Oh, S. J., Chun, S., Choe, J., & Lee, Y. (2019). Cutmix: Regularization strategy to train strong classifiers with localizable evidence. Proceedings of the IEEE/CVF International Conference on Computer Vision, 6023-6032.
  10. Zhang, H., Cisse, M., Dauphin, Y. N., & Lopez-Paz, D. (2018). mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412.
  11. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. International Conference on Medical image computing and computer-assisted intervention, 234-241.
  12. Kingma, D. P., & Welling, M. (2013). Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114.
  13. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.
  14. Isola, P., Zhu, J. Y., Zhou, T., & Efros, A. A. (2017). Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE conference on computer vision and pattern recognition, 1125-1134.
  15. Perez-Garcia, F., Sparks, R., Ourselin, S., & Cardoso, M. J. (2021). TorchIO: a Python library for efficient loading, preprocessing, augmentation and patch extraction of multi-modal medical images. Medical Image Analysis, 71, 102047.
Download PDF

How to Cite

Gnanzou, D., (2025-04-28 18:59:05.449). Enhanced Few-Shot Learning for Medical Image Segmentation via Meta-Learning with Attention-Guided Feature Augmentation. JANOLI International Journal of Artificial Intelligence and its Applications, Volume EOCMPeqBj5R9ZDur0Rlk, Issue 1.