Research Article

An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums

Table 5

Top 10 feature contributions for medication and symptom class in a random forest model.

FeatureBack.Med.Sym.

Top 10 FC for medication sentences
Prescribed = 1−0.002750.01195−0.00920
(PRP, CD, CD) = 1−0.002510.01156−0.00905
Morpho. = 1−0.002060.00660−0.00455
hlca = 1−0.000710.00559−0.00489
(NN, SYMP, SYMP, CC) = 00.001150.00429−0.00544
sosy = 00.001910.00406−0.00597
(PRP, CD, IN, NN, NN) = 1−0.000750.00402−0.00327
(CD, IN, CD, CD) = 1−0.001200.00396−0.00276
thr. Crt. = 00.001540.00381−0.00535
(PRP, CD, JJ, JJ) = 1−0.000860.00362−0.00276
Top 10 FC for symptom sentences
sosy = 1−0.00589−0.007830.01371
Prescribed = 00.00234−0.0157340.01339
thr. Crt. = 1−0.00381−0.006830.01064
(PRP, CD, CD) = 00.00271−0.012640.00993
(SYMP, SYMP, SYMP) = 1−0.00330−0.005640.00895
(NN, SYMP, SYMP, CC) = 1−0.00209−0.006670.00876
Position < vth1−0.00334−0.005400.00874
patf = 1−0.00254−0.003790.00633
(SYMP, CC, JJ) = 1−0.00172−0.004040.00576
Word count > vth2−0.00131−0.004230.00554