Converting a Toolbox lexical database to LKB format
Contents
Summary
The LKB system (Linguistic Knowledge Builder) is a grammar and lexicon development environment for unification-based linguistic formalisms. LKB is focused on the use of HPSG. This page contains a description and the program to convert a lexicon database made with Toolbox to the lexicon format needed by LKB. The scripts were developed by Hannes Hirzel.
A presentation given in Trondheim in 2005 (File:Toolbox-LKB-Link-slides - version 4.pdf) shows how this may be applied to a lexicon file of the Ga language. The dictionary file was created by Mary E. Kropp Dakubu.
The scripting language used for the conversion is called 'Consistent changes' and built into the Toolbox program.
For the working portable setup see the download section on this page. The setup might need some adaptation for the needs of other languages. All files to do so are text files which you may change, see license section.
Status
Updated 9th November 2012. Hannes Hirzel. The conversion runs fine. In case of problems please contact dictionaries_gillbt@gillbt.org
Download
The following folder File:Toolbox Project Ga.zip contains the standard files produced by the utility program 'Toolbox New Project Package 1.5.8 from http://www'.sil.org/computing/toolbox/downloads.htm .
The file 'Dictionary.txt' has been replaced
by the Ga lexicon
created by Mary Ester Dakubu (MED), University of Ghana.
This folder has been posted to this web site [www.typecraft.org] by permission.
How to start Toolbox
The folder 'Settings' contains the Toolbox exe file. Double click on it to start it.
How to create the LKB tdl file
You may run the conversion program from within Toolbox.
To run the conversion do the following steps
- Make the dictionary window the 'active window' by clicking on the title bar
- Choose menu 'File' / 'Export'
- Select 'TBox-LKB Step1'
- Click 'OK'.
- A new file 'LKBlexicon.tdl' is created.
Implementation of the conversion
There is a folder 'Tbox2LKB-conv-scripts' which has a copy of the the cct files of the folder 2005-05-31Ga-for-LKB-Uni-Trondheim-11a mentioned in the presentation of 2005.
These cct files are used to convert the Ga lexicon which is in SFM (Toolbox format) to the format LKB (Linguistic Knowledge Builder) needs.
License
The presentation and this wiki page are licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. The script code (program code) is under the MIT license.
License for data (dictionary file): to be determined; contact medakubu@gmail.com