Neurolinguistic Approach to Vector Representation of Medical Concepts

Wlodzislaw Duch1,2, Pawel Matykiewicz1,3, John Pestian3
1Department of Informatics, Nicolaus Copernicus University, Grudziadzka 5, 87-100 Torun, Poland.
2School of Computer Engineering, Nanyang Technological University, Singapore.
3Department of Biomedical Informatics, Children's Hospital Research Foundation, Cincinnati, Ohio, USA, and


Putative brain processes responsible for understanding language are based on spreading activation in semantic networks, providing enhanced representations that involve concepts not found directly in the text. Approximation of this process is of great practical and theoretical interest. Vector model should reflect activations of various concepts in the brain spreading through associative network. Medical ontolo-gies are used to select concepts of specific semantic type and add to each of them related concepts, providing expanded vector representations. The process is constrained by selection of useful extensions for the classification task. Short hospital discharge summaries are used to illustrate how this process works on a real, very noisy data. Results show significantly improved clustering and classification accuracy. A practical approach to mapping of associative networks of the brain involved in processing of specific concepts is presented.

Proc. of the 20th Int. Joint Conference on Neural Networks (IJCNN), Orlando, IEEE Press, August 12-17, 2007, pp. 1808

Preprint for comments in PDF, 455 KB.

BACK to the publications of W. Duch.
BACK to the on-line publications of the Department of Informatics, NCU.