Construction of the Turkish National Corpus (TNC)
| dc.contributor.author | Yesim Aksan | |
| dc.contributor.author | Mustafa Aksan | |
| dc.contributor.author | Ahmet Koltuksuz | |
| dc.contributor.author | Taner Sezer | |
| dc.contributor.author | Umit Mersinli | |
| dc.contributor.author | Umut Ufuk Demirhan | |
| dc.contributor.author | Hakan Yilmazer | |
| dc.contributor.author | Ozlem Kurtoglu | |
| dc.contributor.author | Gulsum Atasoy | |
| dc.contributor.author | Seda Oz | |
| dc.contributor.author | Ipek Yildiz | |
| dc.contributor.author | Sezer, Taner | |
| dc.contributor.author | Yildiz, Ipek | |
| dc.contributor.author | Mersinli, Ümit | |
| dc.contributor.author | Koltuksuz, Ahmet | |
| dc.contributor.author | Aksan, Mustafa | |
| dc.contributor.author | Demirhan, Umut Ufuk | |
| dc.contributor.author | Aksan, Yesim | |
| dc.contributor.editor | N Calzolari | |
| dc.contributor.editor | K Choukri | |
| dc.contributor.editor | T Declerck | |
| dc.contributor.editor | MU Dogan | |
| dc.contributor.editor | B Maegaard | |
| dc.contributor.editor | J Mariani | |
| dc.contributor.editor | J Odijk | |
| dc.contributor.editor | S Piperidis | |
| dc.coverage.spatial | Istanbul TURKEY | |
| dc.date.accessioned | 2025-10-06T16:21:04Z | |
| dc.date.issued | 2012 | |
| dc.description.abstract | This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its web-based user-friendly search interface. | |
| dc.description.sponsorship | TÜBİTAK, (108K242); Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK | |
| dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TUBITAK) [108K242] | |
| dc.description.sponsorship | CELI - Language and Information Technology; European Media Laboratory GmbH (EML); IMMI; Meta; Nuance; Quaero | |
| dc.description.sponsorship | TNC was supported by a research grant from the Scientific and Technological Research Council of Turkey (TÜBİTAK, Grant No: 108K242). | |
| dc.identifier.isbn | 978-2-9517408-7-7 | |
| dc.identifier.isbn | 9782951740877 | |
| dc.identifier.scopus | 2-s2.0-84926307827 | |
| dc.identifier.uri | https://gcris.yasar.edu.tr/handle/123456789/6703 | |
| dc.language.iso | English | |
| dc.publisher | EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA | |
| dc.relation.ispartof | 8th International Conference on Language Resources and Evaluation (LREC) | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.source | LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | |
| dc.subject | Turkish National Corpus, corpus construction, corpus linguistics | |
| dc.subject | Corpus Linguistics | |
| dc.subject | Corpus Construction | |
| dc.subject | Turkish National Corpus | |
| dc.title | Construction of the Turkish National Corpus (TNC) | |
| dc.type | Conference Object | |
| dspace.entity.type | Publication | |
| gdc.author.id | SEZER, TANER/0000-0002-7328-7650 | |
| gdc.author.id | Demirhan, Umut Ufuk/0000-0001-8429-4680 | |
| gdc.author.scopusid | 57198899045 | |
| gdc.author.scopusid | 57198882338 | |
| gdc.author.scopusid | 57198899166 | |
| gdc.author.scopusid | 57198894479 | |
| gdc.author.scopusid | 57193614606 | |
| gdc.author.scopusid | 25621425200 | |
| gdc.author.scopusid | 13408802300 | |
| gdc.author.wosid | Demirhan, Umut Ufuk/G-6053-2015 | |
| gdc.author.wosid | Oz, Seda/HTN-7749-2023 | |
| gdc.author.wosid | Atasoy, Gülsüm/ABF-3441-2020 | |
| gdc.author.wosid | YILMAZER, HAKAN/LBB-6979-2024 | |
| gdc.author.wosid | SEZER, TANER/MFJ-6652-2025 | |
| gdc.author.wosid | Aksan, Yeşim/ABF-2195-2020 | |
| gdc.author.wosid | Aksan, Mustafa/ABF-1926-2020 | |
| gdc.coar.type | text::conference output | |
| gdc.description.department | ||
| gdc.description.departmenttemp | [Aksan, Yesim; Aksan, Mustafa; Sezer, Taner; Mersinli, Umit; Demirhan, Umut Ufuk; Yilmazer, Hakan; Kurtoglu, Ozlem; Atasoy, Gulsum; Oz, Seda; Yildiz, Ipek] Mersin Univ, Fen Edebiyat Fak, TR-33343 Mersin, Turkey; [Koltuksuz, Ahmet] Yasar Univ, Muhendislik Fak, TR-35100 Izmir, Turkey | |
| gdc.description.endpage | 3227 | |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
| gdc.description.startpage | 3223 | |
| gdc.description.woscitationindex | Conference Proceedings Citation Index - Social Science & Humanities | |
| gdc.identifier.wos | WOS:000323927703046 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.scopus.citedcount | 88 | |
| gdc.virtual.author | Koltuksuz, Ahmet Hasan | |
| gdc.wos.citedcount | 39 | |
| oaire.citation.endPage | 3227 | |
| oaire.citation.startPage | 3223 | |
| person.identifier.orcid | Demirhan- Umut Ufuk/0000-0001-8429-4680, SEZER- TANER/0000-0002-7328-7650 | |
| project.funder.name | Scientific and Technological Research Council of Turkey (TUBITAK) [108K242] | |
| relation.isAuthorOfPublication | 0a146451-eb5a-43c9-bfca-979da9ee51d7 | |
| relation.isAuthorOfPublication.latestForDiscovery | 0a146451-eb5a-43c9-bfca-979da9ee51d7 | |
| relation.isOrgUnitOfPublication | ac5ddece-c76d-476d-ab30-e4d3029dee37 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | ac5ddece-c76d-476d-ab30-e4d3029dee37 |
