Construction of the Turkish National Corpus (TNC)

dc.contributor.author Yesim Aksan
dc.contributor.author Mustafa Aksan
dc.contributor.author Ahmet Koltuksuz
dc.contributor.author Taner Sezer
dc.contributor.author Umit Mersinli
dc.contributor.author Umut Ufuk Demirhan
dc.contributor.author Hakan Yilmazer
dc.contributor.author Ozlem Kurtoglu
dc.contributor.author Gulsum Atasoy
dc.contributor.author Seda Oz
dc.contributor.author Ipek Yildiz
dc.contributor.author Sezer, Taner
dc.contributor.author Yildiz, Ipek
dc.contributor.author Mersinli, Ümit
dc.contributor.author Koltuksuz, Ahmet
dc.contributor.author Aksan, Mustafa
dc.contributor.author Demirhan, Umut Ufuk
dc.contributor.author Aksan, Yesim
dc.contributor.editor N Calzolari
dc.contributor.editor K Choukri
dc.contributor.editor T Declerck
dc.contributor.editor MU Dogan
dc.contributor.editor B Maegaard
dc.contributor.editor J Mariani
dc.contributor.editor J Odijk
dc.contributor.editor S Piperidis
dc.coverage.spatial Istanbul TURKEY
dc.date.accessioned 2025-10-06T16:21:04Z
dc.date.issued 2012
dc.description.abstract This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its web-based user-friendly search interface.
dc.description.sponsorship TÜBİTAK, (108K242); Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK
dc.description.sponsorship Scientific and Technological Research Council of Turkey (TUBITAK) [108K242]
dc.description.sponsorship CELI - Language and Information Technology; European Media Laboratory GmbH (EML); IMMI; Meta; Nuance; Quaero
dc.description.sponsorship TNC was supported by a research grant from the Scientific and Technological Research Council of Turkey (TÜBİTAK, Grant No: 108K242).
dc.identifier.isbn 978-2-9517408-7-7
dc.identifier.isbn 9782951740877
dc.identifier.scopus 2-s2.0-84926307827
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/6703
dc.language.iso English
dc.publisher EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
dc.relation.ispartof 8th International Conference on Language Resources and Evaluation (LREC)
dc.rights info:eu-repo/semantics/closedAccess
dc.source LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
dc.subject Turkish National Corpus, corpus construction, corpus linguistics
dc.subject Corpus Linguistics
dc.subject Corpus Construction
dc.subject Turkish National Corpus
dc.title Construction of the Turkish National Corpus (TNC)
dc.type Conference Object
dspace.entity.type Publication
gdc.author.id SEZER, TANER/0000-0002-7328-7650
gdc.author.id Demirhan, Umut Ufuk/0000-0001-8429-4680
gdc.author.scopusid 57198899045
gdc.author.scopusid 57198882338
gdc.author.scopusid 57198899166
gdc.author.scopusid 57198894479
gdc.author.scopusid 57193614606
gdc.author.scopusid 25621425200
gdc.author.scopusid 13408802300
gdc.author.wosid Demirhan, Umut Ufuk/G-6053-2015
gdc.author.wosid Oz, Seda/HTN-7749-2023
gdc.author.wosid Atasoy, Gülsüm/ABF-3441-2020
gdc.author.wosid YILMAZER, HAKAN/LBB-6979-2024
gdc.author.wosid SEZER, TANER/MFJ-6652-2025
gdc.author.wosid Aksan, Yeşim/ABF-2195-2020
gdc.author.wosid Aksan, Mustafa/ABF-1926-2020
gdc.coar.type text::conference output
gdc.description.department
gdc.description.departmenttemp [Aksan, Yesim; Aksan, Mustafa; Sezer, Taner; Mersinli, Umit; Demirhan, Umut Ufuk; Yilmazer, Hakan; Kurtoglu, Ozlem; Atasoy, Gulsum; Oz, Seda; Yildiz, Ipek] Mersin Univ, Fen Edebiyat Fak, TR-33343 Mersin, Turkey; [Koltuksuz, Ahmet] Yasar Univ, Muhendislik Fak, TR-35100 Izmir, Turkey
gdc.description.endpage 3227
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
gdc.description.startpage 3223
gdc.description.woscitationindex Conference Proceedings Citation Index - Social Science & Humanities
gdc.identifier.wos WOS:000323927703046
gdc.index.type WoS
gdc.index.type Scopus
gdc.scopus.citedcount 88
gdc.virtual.author Koltuksuz, Ahmet Hasan
gdc.wos.citedcount 39
oaire.citation.endPage 3227
oaire.citation.startPage 3223
person.identifier.orcid Demirhan- Umut Ufuk/0000-0001-8429-4680, SEZER- TANER/0000-0002-7328-7650
project.funder.name Scientific and Technological Research Council of Turkey (TUBITAK) [108K242]
relation.isAuthorOfPublication 0a146451-eb5a-43c9-bfca-979da9ee51d7
relation.isAuthorOfPublication.latestForDiscovery 0a146451-eb5a-43c9-bfca-979da9ee51d7
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files