Research Article

Evaluation and Verification of the Global Rapid Identification of Threats System for Infectious Diseases in Textual Data Sources

Table 1

The ontologies used in GRITS, their contents, and their descriptions.

OntologyContentsDescription

Biocaster ontologyGeneral disease ontologyEnglish terms for symptoms, diseases, and pathogens are used as features

GRITS ontologyCurated ontology of symptoms, control measures, descriptions of infected individuals, diseases, disease categories, environmental factors, hosts, host uses, modes of disease transmission, occupations, disease risks, vectors, and zoonotic typesCollection of keywords and terms gathered and vetted from a consensus of experts at EcoHealth Alliance

HealthMap disease labelsDiseases identified as significant by HealthMap and used for their disease labelsUsed as outcome in logistic regression models

The disease ontologyHuman disease related terms, phenotypic characteristics, and medical vocabulary disease conceptsDisease names and synonyms are used as keyword features. Predicates from disease definitions

USGS topographic feature vocabulariesEnvironmental factorsSubset used as features (all labels and synonyms of type owl#Thing)

WordnetEnglish language ontology that maps word relatednessHyponyms and lemmata for a set of epidemiology-related root keywords are used as features