uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
Ontology learning from Swedish text
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2015 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Ontology learning from text generally consists roughly of NLP, knowledge extraction and ontology construction. While NLP and information extraction for Swedish is approaching that of English, these methods have not been assembled into the full ontology learning pipeline.

This means that there is currently very little automated support for using knowledge from Swedish literature in semantically-enabled systems.

This thesis demonstrates the feasibility of using some existing OL methods for Swedish text and elicits proposals for further work toward building and studying open domain ontology learning systems for Swedish and perhaps multiple languages. This is done by building a prototype ontology learning system based on the state of the art architecture of such systems, using the Korp NLP framework for Swedish text, the GATE system for corpus and annotation management, and embedding it as a self-contained plugin to the Protege ontology engineering framework.

The prototype is evaluated similarly to other OL systems. As expected, it is found that while sufficient for demonstrating

feasibility, the ontology produced in the evaluation is not usable in practice, since many more methods and fewer cascading errors are necessary to richly and accurately model the domain. In addition to simply implementing more methods to extract more ontology elements, a framework for programmatically defining knowledge extraction and ontology construction methods and their dependencies is recommended to enable more effective research and application of ontology learning.

Place, publisher, year, edition, pages
2015. , 70 p.
IT, 15006
Keyword [en]
Natural language processing.
National Category
Engineering and Technology
URN: urn:nbn:se:uu:diva-245334OAI: oai:DiVA.org:uu-245334DiVA: diva2:791132
Educational program
Master Programme in Computer Science
Available from: 2015-02-26 Created: 2015-02-26 Last updated: 2015-02-26Bibliographically approved

Open Access in DiVA

fulltext(2052 kB)502 downloads
File information
File name FULLTEXT01.pdfFile size 2052 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 502 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 809 hits
ReferencesLink to record
Permanent link

Direct link