Public Library of Science
Browse

Flow diagram of Natural Language Processing (NLP) methodology.

Download (1.59 MB)
figure
posted on 2020-06-19, 17:43 authored by Charlene Jennifer Ong, Agni Orfanoudaki, Rebecca Zhang, Francois Pierre M. Caprasse, Meghan Hutch, Liang Ma, Darian Fard, Oluwafemi Balogun, Matthew I. Miller, Margaret Minnig, Hanife Saglam, Brenton Prescott, David M. Greer, Stelios Smirnakis, Dimitris Bertsimas

Text featurization with GloVe and binary classification lead to Receiver Operator Curves (ROC) for stroke occurrence, MCA location and stroke acuity. Representative ROC curves for each of the text featurization methods are displayed. RPDR = Research Patient Data Registry; CT = Computed Tomography; CTA = Computed Tomography Angiography; MRI = Magnetic Resonance Imaging; MRA = Magnetic Resonance Angiography; BOW = Bag of Words; tf-idf = Term Frequency-Inverse Document Frequency; GloVe = Global Vectors for Word Representation; CART = Classification and Regression Trees; OCT = Optimal Classification Trees; RF = Random Forests; RNN = Recurrent Neural Networks.

History