Construction of the Turkish National Corpus (TNC)
Loading...

Date
2012
Authors
Yesim Aksan
Mustafa Aksan
Ahmet Koltuksuz
Taner Sezer
Umit Mersinli
Umut Ufuk Demirhan
Hakan Yilmazer
Ozlem Kurtoglu
Gulsum Atasoy
Seda Oz
Journal Title
Journal ISSN
Volume Title
Publisher
EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its web-based user-friendly search interface.
Description
Keywords
Turkish National Corpus, corpus construction, corpus linguistics, Corpus Linguistics, Corpus Construction, Turkish National Corpus
Fields of Science
Citation
WoS Q
Scopus Q
Source
8th International Conference on Language Resources and Evaluation (LREC)
Volume
Issue
Start Page
3223
End Page
3227
SCOPUS™ Citations
88
checked on Apr 09, 2026
Web of Science™ Citations
39
checked on Apr 09, 2026
