Page 1 of 1

Generate thesaurus problem

PostPosted:Thu Mar 17, 2011 9:08 am
by r041
I just want to try thesaurus example on http://wiki.openkm.com/index.php/Thesaurus and http://wiki.openkm.com/index.php/Generate_thesaurus.
I put .owl files and .rdf files into $Jboss_HOME$/vocabulary/,as well as the stopwords_en.txt.

set properties in OpenKM.cfg:
Code: Select all
kea.thesaurus.skos.file=/vocabulary/agrovoc.rdf
kea.thesaurus.owl.file=/vocabulary/agrovoc_oaei2007.owl
kea.thesaurus.vocabulary.serql=SELECT X,UID FROM {X} skos:prefLabel {UID} WHERE lang(UID) ="en" USING NAMESPACE rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, skos=<http://www.w3.org/2004/02/skos/core#>,rdfs=<http://www.w3.org/2000/01/rdf-schema#>, dc=<http://purl.org/dc/elements/1.1/>, dcterms=<http://purl.org/dc/terms/>, foaf=<http://xmlns.com/foaf/0.1/>
kea.thesaurus.base.url=http://www.fao.org/aos/agrovoc
kea.thesaurus.tree.root=SELECT DISTINCT UID, TEXT FROM {UID} Y {OBJECT}, {UID} rdfs:label {TEXT} ; [rdfs:subClassOf {CLAZZ}] where not bound(CLAZZ) and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>, rdf=<http://www.w3.org/1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>, skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
kea.thesaurus.tree.childs=SELECT DISTINCT UID, TEXT FROM {UID} rdfs:subClassOf {CLAZZ}, {UID} rdfs:label {TEXT} where xsd:string(CLAZZ) = "RDFparentID" and lang(TEXT)="en" USING NAMESPACE foaf=<http://xmlns.com/foaf/0.1/>, dcterms=<http://purl.org/dc/terms/>,  rdf=<http://www.w3.org /1999/02/22-rdf-syntax-ns#>, owl=<http://www.w3.org/2002/07/owl#>, rdfs=<http://www.w3.org/2000/01/rdf-schema#>, skos=<http://www.w3.org/2004/02/skos/core#>, dc=<http://purl.org/dc/elements/1.1/>
kea.automatic.keyword.extraction.number=10
kea.automatic.keyword.extraction.restriction=on
kea.stopwords.file=/vocabulary/stopwords_en.txt
But I don't know how to set the kea.model.file propertiy cause I can't find .model file.

Thanks a lot

Re: Generate thesaurus problem

PostPosted:Fri Mar 18, 2011 9:11 am
by jllort
NO NO NO.

Phase 1
For thesaurus you only need doing what's explained on http://wiki.openkm.com/index.php/Thesaurus_full_example nothing else.

Phase 2
KEA = Automatic KEY Extraction needs some extra job. Needs first having a thesaurus deployed. When you will got it, tell me and we'll talking about kea model training. Some of the properties you put on OpenKM.cfg are KEA properties, for first phase remove non necessary parameters.