ProMiner

Up to date information about biomedical entities like genes, proteins, diseases or drugs is often not found in structured databases but rather in scientific text. For specific information retrieval or information extraction the recognition of these terms and their normalisation to database entries (e.g. gene names to ENTREZ-GENE) or structured vocabulary/ontologies (e.g. GO/MESH/UMLS) is a prerequisite. The need of normalisation implies the usage of dictionaries generated from these sources and the inclusion of direct mappings. As databases and ontologies are evolving rapidly, automated updating and processing is needed to generate comprehensive and specific dictionaries. The high ambiguity of terms and acronyms used in the Life Science domain complicates precise recognition further.

 

 

Publishing Notes | Contact