Sample Size for Training and Testing: Segment Anything Models and Supervised Approaches

Daniela Cuza, Carlo Fantozzi, Loris Nanni*, Daniel Fusaro, Gustavo Zanoni Felipe, Sheryl Brahnam

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

The problem of determining the minimum amount of data required to train and test an artificial intelligence model has received substantial attention in the literature. In this chapter, we first review key concepts on the topic, then we survey selected theoretical and experimental results from the open literature, and in the end we present, as a case study, experiments we performed ourselves on the semantic segmentation of radiology images. A discussion from both a theoretical and an experimental point of view is required because the two approaches have complementary insights to offer. Theory provides general guidelines to avoid pitfalls during all phases of design: data collection, model design, training, and testing. Experimental results show what the current state of the art is in terms of performance and provide practical advice on which techniques have proven to be the most effective; for a more comprehensive study, we tested both supervised and zero-shot segmentation approaches, such as the “Segment Anything Model” (better known as SAM).

Original languageEnglish
Title of host publicationIntelligent Systems Reference Library
EditorsChee-Peng Lim, Ashlesha Vaidya, Nikhil Jain, Margarita N. Favorskaya, Lakhmi C. Jain
PublisherSpringer
Pages107-145
Number of pages39
ISBN (Electronic)9783031654305
ISBN (Print)9783031654299
DOIs
Publication statusPublished - 19 Sept 2024

Publication series

NameIntelligent Systems Reference Library
Volume258
ISSN (Print)1868-4394
ISSN (Electronic)1868-4408

Keywords

  • Algorithms
  • Artificial intelligence
  • Classifiers
  • Data augmentation
  • Data collection
  • Radiology
  • Sample size
  • Semantic segmentation
  • Zero-shot segmentation

ASJC Scopus subject areas

  • General Computer Science
  • Information Systems and Management
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Sample Size for Training and Testing: Segment Anything Models and Supervised Approaches'. Together they form a unique fingerprint.

Cite this