Prompt Engineering in Medical Image Segmentation: An Overview of the Paradigm Shift

Hazrat Ali*, Mohammad Farhad Bulbul, Zubair Shah

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Foundation AI models have emerged as powerful pre-trained models on a large scale, capable of seamlessly handling diverse tasks across multiple domains with minimal or no fine-tuning. These models, exemplified by the impressive achievements of GPT-3 and BERT in natural language processing (NLP), as well as CLIP and DALL-E in computer vision, have garnered considerable attention for their exceptional performance. A noteworthy addition to the realm of image segmentation is the Segment Anything Model (SAM), a foundation AI model that revolutionizes image segmentation. With a single click or a natural language prompt, SAM exhibits the remarkable ability to segment any object within an image, marking a significant paradigm shift in medical image segmentation. Unlike conventional approaches that rely on labeled data and domain-specific knowledge, SAM breaks free from these constraints. Deep convolutional neural network (DCNN)-based, SAM comprises an image encoder, a prompt encoder, and a mask decoder, showcasing its efficient and flexible architecture. Medical image segmentation, in particular, benefits from SAM's exceptional speed and high-quality segmentation. In this paper, we delve into the effectiveness of SAM for medical image segmentation shedding light on its capabilities. Moreover, our investigation explores the strengths and limitations of prompt engineering in medical computer vision applications, not only encompassing SAM but also other foundation AI models. Through this exploration, we unravel their immense potential in catalyzing a paradigm shift in the field of medical imaging.

Original languageEnglish
Title of host publication2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things, AIBThings 2023 - Proceedings
EditorsAhmed Abdelgawad, Akhtar Jamil, Alaa Ali Hameed
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350322347
DOIs
Publication statusPublished - 2023
Event2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things, AIBThings 2023 - Mount Pleasant, United States
Duration: 16 Sept 202317 Sept 2023

Publication series

Name2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things, AIBThings 2023 - Proceedings

Conference

Conference2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things, AIBThings 2023
Country/TerritoryUnited States
CityMount Pleasant
Period16/09/2317/09/23

Keywords

  • Foundation AI
  • Healthcare
  • Large Language Models
  • Medical Artificial Intelligence
  • Medical Imaging
  • Natural Language Processing
  • Prompt Engineering
  • Segmentation

Fingerprint

Dive into the research topics of 'Prompt Engineering in Medical Image Segmentation: An Overview of the Paradigm Shift'. Together they form a unique fingerprint.

Cite this