Construction of the Turkish national corpus (TNC)
| dc.contributor.author | Yeşim Aksan | |
| dc.contributor.author | Mustafa Aksan | |
| dc.contributor.author | Ahmet Hasan Koltuksuz | |
| dc.contributor.author | Taner Sezer | |
| dc.contributor.author | Ümit Mersinli | |
| dc.contributor.author | Umut Ufuk Demirhan | |
| dc.contributor.author | Hakan Yilmazer | |
| dc.contributor.author | Özlem Kurtoglu | |
| dc.contributor.author | Gülsüm Atasoy | |
| dc.contributor.author | Seda Öz | |
| dc.contributor.editor | M.U. Dogan , J. Mariani , A. Moreno , S. Goggi , K. Choukri , N. Calzolari , J. Odijk , T. Declerck , B. Maegaard , S. Piperidis , H. Mazo , O. Hamon | |
| dc.date.accessioned | 2025-10-06T17:52:57Z | |
| dc.date.issued | 2012 | |
| dc.description.abstract | This paper addresses theoretical and practical issues experienced in the construction of Turkish National Corpus (TNC). TNC is designed to be a balanced large scale (50 million words) and general-purpose corpus for contemporary Turkish. It has benefited from previous practices and efforts for the construction of corpora. In this sense TNC generally follows the framework of British National Corpus yet necessary adjustments in corpus design of TNC are made whenever needed. All throughout the process different types of open-source software are used for specific tasks and the resulting corpus is a free resource for non-commercial use. This paper presents TNC's design features web-based corpus management system carefully planned workflow and its webbased user-friendly search interface. © 2017 Elsevier B.V. All rights reserved. | |
| dc.description.sponsorship | CELI - Language and Information Technology, European Media Laboratory GmbH (EML), IMMI, Meta, Nuance, Quaero | |
| dc.identifier.isbn | 9782951740877 | |
| dc.identifier.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84926307827&partnerID=40&md5=e311fbc0ebeb16064d2317f227e2c7b6 | |
| dc.identifier.uri | https://gcris.yasar.edu.tr/handle/123456789/10193 | |
| dc.language.iso | English | |
| dc.publisher | European Language Resources Association (ELRA) | |
| dc.relation.ispartof | 8th International Conference on Language Resources and Evaluation LREC 2012 | |
| dc.subject | Corpus Construction, Corpus Linguistics, Turkish National Corpus, Open Systems, Software Engineering, British National Corpora, Corpus Construction, Corpus Linguistics, Design Features, Management Systems, Practical Issues, Search Interfaces, Turkishs, Open Source Software | |
| dc.subject | Open systems, Software engineering, British national corpora, Corpus construction, Corpus linguistics, Design features, Management systems, Practical issues, Search interfaces, Turkishs, Open source software | |
| dc.title | Construction of the Turkish national corpus (TNC) | |
| dc.type | Conference Object | |
| dspace.entity.type | Publication | |
| gdc.coar.type | text::conference output | |
| gdc.index.type | Scopus | |
| oaire.citation.endPage | 3227 | |
| oaire.citation.startPage | 3223 | |
| person.identifier.scopus-author-id | Aksan- Yeşim (25621425200), Aksan- Mustafa (57198899045), Koltuksuz- Ahmet Hasan (13408802300), Sezer- Taner (57193614606), Mersinli- Ümit (57198894479), Demirhan- Umut Ufuk (57198882338), Yilmazer- Hakan (57198886801), Kurtoglu- Özlem (55103539300), Atasoy- Gülsüm (57198885669), Öz- Seda (57198890314) | |
| project.funder.name | TNC was supported by a research grant from the Scientific and Technological Research Council of Turkey (TÜBİTAK Grant No: 108K242). | |
| relation.isOrgUnitOfPublication | ac5ddece-c76d-476d-ab30-e4d3029dee37 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | ac5ddece-c76d-476d-ab30-e4d3029dee37 |
