Persistent Identifier
|
doi:10.18710/SZZDLI |
Publication Date
|
2024-02-13 |
Title
| Replication Data for: Zooming in on the semantics of French ingressives: a collostructional analysis |
Author
| Verroens, Filip (Ghent University) - ORCID: 0000-0002-4604-7205 |
Point of Contact
|
Use email button above to contact.
Verroens, Filip (Ghent University) |
Description
| Dataset abstract: The dataset includes an annotated corpus sample of N = 2000 French sentences with se mettre à or commencer à (1000 observations of each verb). The sample was drawn from the literary corpus Frantext and the journalistic corpus Le Monde (1000 observations from both corpora). The sample is balanced for verb as well as corpus, so we have 500 observations for each Verb-Corpus combination. The data is annotated for 3 variables: Source (corpus), Verb, collexeme.
Article abstract: This paper examines the semantic value of the infinitive in the ingressive constructions se mettre à (SMA) and commencer à (COMA) using a distinctive collexeme analysis. We find that the collexemes significant for the construction SMA are fairly homogeneous across the different corpora and can be grouped into the general category of expressive collexemes. The collexemes significant for COMA are more heterogeneous and belong to the category of cognitive collexemes and to semantic fields of sensory and creative acts. The results are compatible with the hypothesis put forward by Verroens and De Cuypere (2023) stating that the overall meaning of the SMA construction is intrinsically punctual. The punctual value of SMA is not only compatible with expressive collexemes, but, moreover, emphasizes their unforeseen and unintentional meaning. Conversely, the incremental value of COMA is consistent with the gradual onset of cognitive and sensory collexemes.
Verroens, F., & De Cuypere, L. (2023). French ingressives and (phasal) aspect: A frame-semantic corpus-based analysis. Canadian Journal of Linguistics/Revue Canadienne de Linguistique, 68(3), 435-461. doi:10.1017/cnj.2023.19 (2024-01-09) |
Subject
| Arts and Humanities |
Keyword
| French
ingressive
selection-theoretical approaches to aspect
coercion
construction grammar |
Related Publication
| Verroens, F., & De Cuypere, L. (2023). French ingressives and (phasal) aspect: A frame-semantic corpus-based analysis. Canadian Journal of Linguistics/Revue Canadienne de Linguistique, 68(3), 435-461. doi:10.1017/cnj.2023.19 doi: 10.1017/cnj.2023.19 https://doi.org/10.1017/cnj.2023.19
Verroens, F. (in press). Zooming in on the semantics of French ingressives : a collostructional analysis. Journal of French Language Studies, 24. |
Language
| English |
Producer
| Ghent University (UGent) https://www.ugent.be/ |
Production Date
| 2023 |
Production Location
| Belgium, Flanders |
Contributor
| Researcher : Verroens Filip |
Distributor
| The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ |
Depositor
| Verroens, Filip |
Deposit Date
| 2024-01-09 |
Date of Collection
| Start Date: 1985-01-01 ; End Date: 2000-01-01
Start Date: 2005-01-01 ; End Date: 2006-09-01 |
Data Type
| annotated corpus data |
Software
| PerlClx, Version: 1.0b
MS Excel, Version: Microsoft Office Professional Plus 2016
Abundantia Verborum |
Related Dataset
| Verroens, Filip; De Cuypere, Ludovic, 2022, "Replication Data for: French ingressives and (phasal) aspect. A frame-semantic corpus-based analysis", https://doi.org/10.18710/WVW9U4, DataverseNO, V1 |
Data Source
| The collexeme analysis reported in this dataset was carried out by conducting searches in the following two corpora: Le Monde and Frantext.
1. Le Monde is a (commercial) monolingual tokenized corpus of written French.
Reference: Text corpus of "Le Monde", ELRA catalogue (http://catalog.elra.info), ISLRN: 421-401-527-366-2, ELRA ID: ELRA-W0015.
2. Frantext is a (commercial) historical, lemmatized and Part of Speech tagged corpus of written French and includes data from 950 to today.
Reference: ATILF. Base textuelle Frantext (En ligne). ATILF-CNRS & Université de Lorraine. 1998-2022. https://www.frantext.fr/ (data retrieved in 2011).
The infinitival verb forms in the data files of this dataset represent the lexemes which the corpora listed above were searched for; they do not represent coherent stretches of text. Therefore, the reuse (including redistribution) of these elements is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. US Copyright Act), Fair dealing (UK; cf. Exceptions to copyright), the EU Database Directive (cf. art 8 Rights and obligations of lawful users), and the Norwegian Copyright Act (cf. § 24 Eneretten til databaser). |