Construction of the Turkish national corpus (TNC)

Yeşim Aksan; Mustafa Aksan; Ahmet Hasan Koltuksuz; Taner Sezer; Ümit Mersinli; Umut Ufuk Demirhan; Hakan Yilmazer; Özlem Kurtoglu; Gülsüm Atasoy; Seda Öz

Construction of the Turkish national corpus (TNC)

dc.contributor.author	Yeşim Aksan
dc.contributor.author	Mustafa Aksan
dc.contributor.author	Ahmet Hasan Koltuksuz
dc.contributor.author	Taner Sezer
dc.contributor.author	Ümit Mersinli
dc.contributor.author	Umut Ufuk Demirhan
dc.contributor.author	Hakan Yilmazer
dc.contributor.author	Özlem Kurtoglu
dc.contributor.author	Gülsüm Atasoy
dc.contributor.author	Seda Öz
dc.contributor.editor	M.U. Dogan , J. Mariani , A. Moreno , S. Goggi , K. Choukri , N. Calzolari , J. Odijk , T. Declerck , B. Maegaard , S. Piperidis , H. Mazo , O. Hamon
dc.date.accessioned	2025-10-06T17:52:57Z
dc.date.issued	2012
dc.description.abstract	This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its webbased user-friendly search interface. © 2017 Elsevier B.V. All rights reserved.
dc.description.sponsorship	CELI - Language and Information Technology, European Media Laboratory GmbH (EML), IMMI, Meta, Nuance, Quaero
dc.identifier.isbn	9782951740877
dc.identifier.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-84926307827&partnerID=40&md5=e311fbc0ebeb16064d2317f227e2c7b6
dc.identifier.uri	https://gcris.yasar.edu.tr/handle/123456789/10193
dc.language.iso	English
dc.publisher	European Language Resources Association (ELRA)
dc.relation.ispartof	8th International Conference on Language Resources and Evaluation LREC 2012
dc.subject	Corpus Construction, Corpus Linguistics, Turkish National Corpus, Open Systems, Software Engineering, British National Corpora, Corpus Construction, Corpus Linguistics, Design Features, Management Systems, Practical Issues, Search Interfaces, Turkishs, Open Source Software
dc.subject	Open systems, Software engineering, British national corpora, Corpus construction, Corpus linguistics, Design features, Management systems, Practical issues, Search interfaces, Turkishs, Open source software
dc.title	Construction of the Turkish national corpus (TNC)
dc.type	Conference Object
dspace.entity.type	Publication
gdc.coar.type	text::conference output
gdc.index.type	Scopus
oaire.citation.endPage	3227
oaire.citation.startPage	3223
person.identifier.scopus-author-id	Aksan- Yeşim (25621425200), Aksan- Mustafa (57198899045), Koltuksuz- Ahmet Hasan (13408802300), Sezer- Taner (57193614606), Mersinli- Ümit (57198894479), Demirhan- Umut Ufuk (57198882338), Yilmazer- Hakan (57198886801), Kurtoglu- Özlem (55103539300), Atasoy- Gülsüm (57198885669), Öz- Seda (57198890314)
project.funder.name	TNC was supported by a research grant from the Scientific and Technological Research Council of Turkey (TÜBİTAK Grant No: 108K242).
relation.isOrgUnitOfPublication	ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery	ac5ddece-c76d-476d-ab30-e4d3029dee37

Collections

Scopus İndeksli Yayınlar Koleksiyonu

Construction of the Turkish national corpus (TNC)

Files

Collections