Research Article

YOLO-UNet Architecture for Detecting and Segmenting the Localized MRI Brain Tumor Image

Table 1

Comparison of several studies for classification and segmentation in medical image processing.

AuthorMethodsCSADSDatasetSizeType

Deepak and Ameer [2]GoogLeNet with transfer learning3064 brain MRI images from 233 patients (diagnosed meningioma, glioma, and pituitary tumors) are with T1-CE MRI modality and include coronal, sagittal, and axial views512 × 512.Mat file
Hu et al. [3]Multikernel depthwise convolutionChest X-ray 14 dataset and Chest X-ray 2017 dataset512 × 512 preprocess to 256 × 256Dicom
Liu et al. [4]Landmark-based deep multi-instance learning (LDMIL)1.5T and 3T T1-weighted structural MRI data (1526)Voxel level with 256 × 256 × 256
Suk et al. [5]Deep ensemble learning of sparse regression models805 subjects of 186 (AD), 393 (MCI), and 226 (normal control, NC)Voxel level with 256 × 256 × 256Neuroimaging Informatics Technology Initiative (NIfTI)
Xue et al. [6]AlexNet, VGG-19, and ResNet420 anteroposterior (AP) view hip X-ray images with normal (219) and OA (201)432 mm × 356 mm, with a pixel resolution of 0.187 mm × 0.187 mm
Safdar et al. [8]YOLOT1-weighted and FLAIR images from MRI images of patients who suffered from low-grade glioma256 × 256Dicom
Ronneberger et al. [9]UNetTransmission electron microscopy of the Drosophila first instar larva ventral nerve cord (VNC)512 × 512
Weng et al. [11]UNetMRI, CT, and ultrasound (Promise12, chaos, NERVE dataset)256 × 256; 512 × 512; 580 × 420Dicom
Li et al. [12]UNetNIH pancreatic segmentation dataset that contains 82 contrast-enhanced abdominal CT volumes and corresponding fine annotations512 × 512 × L, where L ∈ [181, 466]
Pravitasari et al. [10]UNet-VGG16Real dataset from the general hospital (RSUD) Dr. Soetomo (152)256 × 256Dicom
Bisa [14]R-CNNBraTS (brain tumor segmentation) challenge datasetNeuroimaging Informatics Technology Initiative (NIfTI)
Süleyman Yıldırım and Dandıl [15]Mask R-CNNMR image of multiple sclerosis dataset (1838)min 256, max 512
He et al. [16]Mask R-CNNCOCO dataset640 × 800JPG

Note: C = classification, S = segmentation, and ADS = automatic detection segmentation.