Research Article

Classification of Cancer Primary Sites Using Machine Learning and Somatic Mutations

Table 3

Micro- and macroaveraged accuracies of seven combinations of gene symbols with three other features.

Feature combinationNumber of featuresmiAccuracymaAccuracy (mean)maAccuracy (SD)

Gene (baseline)21,2860.570.570.019
Gene + gMutation 101,1510.580.580.019
Gene + Pathway 21,5710.580.580.010
Gene + Chromosome 21,3110.600.600.022
Gene + gMutation + Pathway 101,4360.600.600.013
Gene + gMutation + Chromosome 101,1760.620.620.021
Gene + gMutation + Chromosome + Pathway 101,4610.600.600.015

Note: miAccuracy represents the microaverage accuracy; maAccuracy represents the macroaverage accuracy, which is reported in mean and standard deviation (SD) over 10 accuracies from 10-fold cross validation.