Annotator Dataset Workflow Howto
Synchronize Ontology Data with Ontology Services
The ontologies and related data that will be used with the Annotator is gathered from the Ontology Services (part of BioPortal Core). This process should be run any time a new ontology (or a new version of an existing ontology) is added to the Ontology Services, though it could theoretically be run from a cron script or scheduled job.
- Remove out-dated ontologies from Annotator Database (e.g. older version of ontologies that does not in BioPortal anymore). By invoking this restlet, it will remove all the outdated ontology data and the associated entities such as concepts, terms, relations, semantic types and hierarchy information.
See the list of ontologies/versions to be removed: http://ncbobioportal/obs/admin/ontologies/list/old
Remove old ontologies: http://ncbobioportal/obs/admin/ontologies/remove - Add new ontologies from BioPortal to Annotator. By invoking this restlet, it will add all the new ontology data and the associated entities such as concepts, terms, relations, semantic types and hierarchy information.
See the list of ontologies/versions to be added: http://ncbobioportal/obs/admin/ontologies/list/new
Add new ontologies: http://ncbobioportal/obs/admin/ontologies/add - Populate Concepts (For details, please refer to Chapter 2.1)
- Populate Hierarchy (For details, please refer to Chapter 2.2)
http://ncbobioportal/obs/loaderBigPaths/all
To monitor the progress and view any errors, refer to:
- The "status" field in the table obs_ontology in Annotator DB (obs_hibernate database).
- Check the Tomcat log (/var/logs/tomcat6/catalina.out)
Create Dictionary File
All the terms will be created as dictionary file. (Data is coming from the obs_term table)
- Location : The directory location is specified in build.properties
# Dictionary File path obs.dictionary.path=/usr/local/tomcat6/webapps/annotator/WEB-INF/resources/dictionary/
http://ncbobioportal/obs/createDictionary/0
Mapping Data Population
Mappings between ontologies can be used in the Annotator to find related terms. Loading the mapping information is currently a manual process, though this will be automated in the future. If you have mapping data you would like to include in annotator, please <a href="mailto:support@bioontology.org">contact NCBO</a>.