Difference between revisions of "Mapping Set"

From NCBO Wiki
Jump to navigation Jump to search
Line 9: Line 9:
 
The data gathered over the set of mappings can be found [[media:MappingData.xls|here]].  The data is in a .xls format spreadsheet, with several worksheets containing data.  Please be sure to consult the worksheet labeled "Introduction" for explanations of the data contained in the spreadsheet.
 
The data gathered over the set of mappings can be found [[media:MappingData.xls|here]].  The data is in a .xls format spreadsheet, with several worksheets containing data.  Please be sure to consult the worksheet labeled "Introduction" for explanations of the data contained in the spreadsheet.
  
Additionally, we created several graphs of the ontologies for different thresholds of percent-normalized links.  These graphs can be found [[media:stub | here]].  Each graph image has the filename ontoX.tiff, where x is the the similarity threshold for links included in the graph.  For example, every edge in the graph onto70.tiff has at least 70% of concepts in the source ontology mapped to concepts in the target ontology.
+
Additionally, we created several graphs of the ontologies for different thresholds of percent-normalized links.  These graphs can be found [[media:OntologyGraphs.zip | here]].  Each graph image has the filename ontoX.tiff, where x is the the similarity threshold for links included in the graph.  For example, every edge in the graph onto70.tiff has at least 70% of concepts in the source ontology mapped to concepts in the target ontology.

Revision as of 21:55, 21 June 2009

Introduction

We created a set of mappings by applying simple lexical matching to preferred names and synonyms across all 4,021,662 concepts in 140 BioPortal ontologies and 67 vocabularies in the Unified Medical Language System (UMLS). This process resulted in 4,001,775 mappings.

By analyzing these mappings, we were able to produce data on the connectivity of ontologies.

Data

The data gathered over the set of mappings can be found here. The data is in a .xls format spreadsheet, with several worksheets containing data. Please be sure to consult the worksheet labeled "Introduction" for explanations of the data contained in the spreadsheet.

Additionally, we created several graphs of the ontologies for different thresholds of percent-normalized links. These graphs can be found here. Each graph image has the filename ontoX.tiff, where x is the the similarity threshold for links included in the graph. For example, every edge in the graph onto70.tiff has at least 70% of concepts in the source ontology mapped to concepts in the target ontology.