Difference between revisions of "TypeCraft:About"
(→A short history) |
|||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
===A short history=== | ===A short history=== | ||
− | Since the mid eighties, groups of researchers and students at the [http://www.ntnu.no/english/ Norwegian University of Science and Technology] have explored the use of formal and computational linguistic methods for natural language applications. The | + | Since the mid eighties, groups of researchers and students at the [http://www.ntnu.no/english/ Norwegian University of Science and Technology] have explored the use of formal and computational linguistic methods for natural language applications. The formalisation and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name '''LingLab'''. This then became the '''[http://www.ntnu.no/isl/project-description-digital-linguistics Research Group in Digital Linguistics]''' |
− | At present the group has two focal areas: ''' | + | At present the group has two focal areas: '''grammar engineering''' and the support for linguistic work with '''lesser-described languages'''. In this context we have started to explore |
+ | Semantic Web Technologies. | ||
====Grammar Engineering==== | ====Grammar Engineering==== | ||
− | In Grammar Engineering the main application developed by LingLab is the Norwegian computational grammar NorSource ( | + | In Grammar Engineering the main application developed by LingLab is the Norwegian computational grammar NorSource ([[Norwegian HPSG grammar NorSource]]). Together with partners in the [http://wiki.delph-in.net/moin/FrontPage DELPH-IN] network LingLab applies Head-Driven Phrase Structure Grammar |
([http://books.google.com/books?hl=no&lr=&id=aweEntGaBrIC&oi=fnd&pg=PR9&dq=%22Pollard%22+%22Head-Driven+Phrase+Structure+Grammar%22+&ots=e_M-ggAVpv&sig=eFGqv2XIuqN-b7MTA3bnb1Ouri0#PPP1,M1 Pollard and Sag (1994)] and Minimal Recursion Semantics (MRS)[http://www.springerlink.com/content/8g7924476l471916/ (Copestake et.al.2005)] to advance deep natural language processing. | ([http://books.google.com/books?hl=no&lr=&id=aweEntGaBrIC&oi=fnd&pg=PR9&dq=%22Pollard%22+%22Head-Driven+Phrase+Structure+Grammar%22+&ots=e_M-ggAVpv&sig=eFGqv2XIuqN-b7MTA3bnb1Ouri0#PPP1,M1 Pollard and Sag (1994)] and Minimal Recursion Semantics (MRS)[http://www.springerlink.com/content/8g7924476l471916/ (Copestake et.al.2005)] to advance deep natural language processing. | ||
Line 11: | Line 12: | ||
A further effort to represent lexical and construction level information is the Construction Labeling Project [[Verbconstructions cross-linguistically - Introduction]], a system for encoding construction types across languages [http://www.hf.ntnu.no/hf/isk/Ansatte/lars.hellan/personInfo.html Lars Hellan]. | A further effort to represent lexical and construction level information is the Construction Labeling Project [[Verbconstructions cross-linguistically - Introduction]], a system for encoding construction types across languages [http://www.hf.ntnu.no/hf/isk/Ansatte/lars.hellan/personInfo.html Lars Hellan]. | ||
− | |||
====Support for lesser-described languages==== | ====Support for lesser-described languages==== | ||
− | Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is | + | Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is mp longer maintained. The project became known as '' The Legon Trondheim Linguistics Project'', while its official name was ''Computational Lexicography, Typology and Adult Literacy''. The Ghanaian coordinator was the Head of the Linguistics Department in office. For more information about this over years very influential, and extremely productive project consult also the following [[The Legon Trondheim Linguistics Project| page]]. |
− | As part of this project the TypeCraft development started in 2005 | + | As part of this project the TypeCraft development started in 2005. Its goal was to allow a better project internal management for linguistic data. We had the wish to create a tool that was suited for the creation of morpheme-level glossed data, but that unlike other well-known systems allowed a distributed use, and the easy exchange of data. The idea was that the tool should be tailored to the needs of groups of researchers and students from the North and the South working together on the description of lesser-described languages. |
The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by [https://www.linkedin.com/in/atleprange Atle Prange]] | The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by [https://www.linkedin.com/in/atleprange Atle Prange]] | ||
+ | |||
+ | |||
===TypeCraft=== | ===TypeCraft=== | ||
TypeCraft itself is a product of LingLab's effort in supporting data-driven language description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0 was developed as a joint-effort of [[User:Pavel|Pavel Mihaylov]] and [[User:Dorothee|Dorothee Beermann]]. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the [http://www.cidles.eu/ Interdisciplinery Centre of Social and Language Documentation] | TypeCraft itself is a product of LingLab's effort in supporting data-driven language description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0 was developed as a joint-effort of [[User:Pavel|Pavel Mihaylov]] and [[User:Dorothee|Dorothee Beermann]]. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the [http://www.cidles.eu/ Interdisciplinery Centre of Social and Language Documentation] | ||
− | |||
====A short description of TypeCraft==== | ====A short description of TypeCraft==== | ||
Line 31: | Line 32: | ||
TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim. | TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim. | ||
− | TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be logged in. | + | TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be logged in. Please consult the main page of the TypeCraft wiki for a more direct introduction to the system. |
Latest revision as of 19:42, 2 August 2014
Contents
A short history
Since the mid eighties, groups of researchers and students at the Norwegian University of Science and Technology have explored the use of formal and computational linguistic methods for natural language applications. The formalisation and encoding of morpho-syntactic and semantic information, both at lexical and phrasal level, has been a central theme for a group which in 2004 took the name LingLab. This then became the Research Group in Digital Linguistics
At present the group has two focal areas: grammar engineering and the support for linguistic work with lesser-described languages. In this context we have started to explore Semantic Web Technologies.
Grammar Engineering
In Grammar Engineering the main application developed by LingLab is the Norwegian computational grammar NorSource (Norwegian HPSG grammar NorSource). Together with partners in the DELPH-IN network LingLab applies Head-Driven Phrase Structure Grammar (Pollard and Sag (1994) and Minimal Recursion Semantics (MRS)(Copestake et.al.2005) to advance deep natural language processing.
As part of this work Pavel Mihaylov developed for LingLab an LKB multi-script interface called Trollet.
A further effort to represent lexical and construction level information is the Construction Labeling Project Verbconstructions cross-linguistically - Introduction, a system for encoding construction types across languages Lars Hellan.
Support for lesser-described languages
Between 1996 - 2009 the former linguistic department at NTNU and the linguistic departments at the University of Ghana, Legon cooperated in a project that was sponsored by the Norwegian NUFU programme, which is mp longer maintained. The project became known as The Legon Trondheim Linguistics Project, while its official name was Computational Lexicography, Typology and Adult Literacy. The Ghanaian coordinator was the Head of the Linguistics Department in office. For more information about this over years very influential, and extremely productive project consult also the following page.
As part of this project the TypeCraft development started in 2005. Its goal was to allow a better project internal management for linguistic data. We had the wish to create a tool that was suited for the creation of morpheme-level glossed data, but that unlike other well-known systems allowed a distributed use, and the easy exchange of data. The idea was that the tool should be tailored to the needs of groups of researchers and students from the North and the South working together on the description of lesser-described languages. The first prototype of the system was developed in cooperation with Businesscape, an NTNU spin-off IT-company, led by Atle Prange]
TypeCraft
TypeCraft itself is a product of LingLab's effort in supporting data-driven language description and analysis. After an early prototype of TypeCraft, presented at the University of Ghana, Legon and at the Texas Linguistic Society in Austin in 2006. TypeCraft v.1.0 was developed as a joint-effort of Pavel Mihaylov and Dorothee Beermann. In August 2014, TypeCraft v.2.0 was presented which is a co-development of the TypeCraft team and CIDLeS, the Interdisciplinery Centre of Social and Language Documentation
A short description of TypeCraft
TypeCraft is an online application consisting of a natural language database and a linguistic editor for interlinear glossing. The user adds linguistic annotation to written material which is stored in a relational database from where it can be retrieved using multiple views. The system is wrapped into a customised mediawiki which serves as an entrance port to the system.
Texts as well as annotations are in Unicode. Annotated data can be exported to standard text editors (WORD, Open Office and LaTex)as well as to XML format. TC has been designed for projects on minority languages. TypeCraft features an intuitive user interface and allows distributive usage. The application is written in Java using PostgreSQL database. It is hosted at a server owned by the Norwegian University of Science and Technology in Trondheim.
TypeCraft can be freely used. To use the TypeCraft Editor the user needs to be logged in. Please consult the main page of the TypeCraft wiki for a more direct introduction to the system.