Construction of the Turkish national corpus (TNC)

dc.contributor.author Yeşim Aksan
dc.contributor.author Mustafa Aksan
dc.contributor.author Ahmet Hasan Koltuksuz
dc.contributor.author Taner Sezer
dc.contributor.author Ümit Mersinli
dc.contributor.author Umut Ufuk Demirhan
dc.contributor.author Hakan Yilmazer
dc.contributor.author Özlem Kurtoglu
dc.contributor.author Gülsüm Atasoy
dc.contributor.author Seda Öz
dc.contributor.editor M.U. Dogan , J. Mariani , A. Moreno , S. Goggi , K. Choukri , N. Calzolari , J. Odijk , T. Declerck , B. Maegaard , S. Piperidis , H. Mazo , O. Hamon
dc.date.accessioned 2025-10-06T17:52:57Z
dc.date.issued 2012
dc.description.abstract This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its webbased user-friendly search interface. © 2017 Elsevier B.V. All rights reserved.
dc.description.sponsorship CELI - Language and Information Technology, European Media Laboratory GmbH (EML), IMMI, Meta, Nuance, Quaero
dc.identifier.isbn 9782951740877
dc.identifier.uri https://www.scopus.com/inward/record.uri?eid=2-s2.0-84926307827&partnerID=40&md5=e311fbc0ebeb16064d2317f227e2c7b6
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/10193
dc.language.iso English
dc.publisher European Language Resources Association (ELRA)
dc.relation.ispartof 8th International Conference on Language Resources and Evaluation LREC 2012
dc.subject Corpus Construction, Corpus Linguistics, Turkish National Corpus, Open Systems, Software Engineering, British National Corpora, Corpus Construction, Corpus Linguistics, Design Features, Management Systems, Practical Issues, Search Interfaces, Turkishs, Open Source Software
dc.subject Open systems, Software engineering, British national corpora, Corpus construction, Corpus linguistics, Design features, Management systems, Practical issues, Search interfaces, Turkishs, Open source software
dc.title Construction of the Turkish national corpus (TNC)
dc.type Conference Object
dspace.entity.type Publication
gdc.coar.type text::conference output
gdc.index.type Scopus
oaire.citation.endPage 3227
oaire.citation.startPage 3223
person.identifier.scopus-author-id Aksan- Yeşim (25621425200), Aksan- Mustafa (57198899045), Koltuksuz- Ahmet Hasan (13408802300), Sezer- Taner (57193614606), Mersinli- Ümit (57198894479), Demirhan- Umut Ufuk (57198882338), Yilmazer- Hakan (57198886801), Kurtoglu- Özlem (55103539300), Atasoy- Gülsüm (57198885669), Öz- Seda (57198890314)
project.funder.name TNC was supported by a research grant from the Scientific and Technological Research Council of Turkey (TÜBİTAK Grant No: 108K242).
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files