Construction of the Turkish National Corpus (TNC)

Loading...
Publication Logo

Date

2012

Authors

Yesim Aksan
Mustafa Aksan
Ahmet Koltuksuz
Taner Sezer
Umit Mersinli
Umut Ufuk Demirhan
Hakan Yilmazer
Ozlem Kurtoglu
Gulsum Atasoy
Seda Oz

Journal Title

Journal ISSN

Volume Title

Publisher

EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its web-based user-friendly search interface.

Description

Keywords

Turkish National Corpus, corpus construction, corpus linguistics, Corpus Linguistics, Corpus Construction, Turkish National Corpus

Fields of Science

Citation

WoS Q

Scopus Q

Source

8th International Conference on Language Resources and Evaluation (LREC)

Volume

Issue

Start Page

3223

End Page

3227
SCOPUS™ Citations

88

checked on Apr 09, 2026

Web of Science™ Citations

39

checked on Apr 09, 2026

Google Scholar Logo
Google Scholar™

Sustainable Development Goals

SDG data is not available