Replication Data for: Alea iacta est. Insights from corpus semantics into the diachrony of the Latin passive

Version 1.0

Aerts, Simon, 2024, "Replication Data for: Alea iacta est. Insights from corpus semantics into the diachrony of the Latin passive", https://doi.org/10.18710/VSMP3U, DataverseNO, V1

Learn about Data Citation Standards.

Contact Owner

Dataset Metrics

11 Downloads

Description	Dataset includes annotated corpus data from Latin texts from the 3rd c. BCE until the 10th c. CE. Attestations of forms of both the original construction ('PP-BE.inf.', e.g. 'cantatus est') and the innovation ('PP-BE.perf.', e.g. 'cantatus fuit') for the expression of the passive (taken together with forms of deponent verbs into 'nonactive') of perfectum stem tenses were extracted from major online corpora (331.131 data points); a random sample (n = 383) of data points of PP-BE.inf. that represents all text types and time periods as evenly as possible was then subjected to a close-reading analysis in order to ascertain the attestation rate of meanings for which it competed with the innovation PP-BE.perf. (specialized in ANTERIORITY) and the present tense (specialized in present events or situations). Only the data points that were annotated in full in this second phase are included in the current dataset. For the data points examined in the first phase, only the formal categories in the list below were annotated to the extent that these annotations are not subject to interpretation. (2023-08-22)
Subject	Arts and Humanities
Keyword	Latin passive system, quantitative and qualitative linguistics, form-function pairings, tense-aspect systems, diachronic linguistics
Related Publication	Aerts, S. “Alea iacta est. Insights from corpus semantics into the diachrony of the Latin passive.” Submitted for review.
License/Data Use Agreement	Custom Dataset Terms

Filter by

	1 to 3 of 3 Files	Download
	0_README_Alea.txt Plain Text - 22.5 KB Published Jan 17, 2024 5 Downloads MD5: 249d0b1a6837658802dcf6c5c2265e21	Preview "0_README_Alea.txt" Access File File Access Public Download Options Plain Text Download Metadata Data File Citation EndNote XML RIS BibTeX
	Alea.txt Plain Text - 349.1 KB Published Jan 17, 2024 4 Downloads MD5: 887aae9b172eeccfef2b0842ee8882c5	Preview "Alea.txt" Access File File Access Public Download Options Plain Text Download Metadata Data File Citation EndNote XML RIS BibTeX
	Alea.xlsx MS Excel Spreadsheet - 160.3 KB Published Jan 17, 2024 2 Downloads MD5: 7c27de60c59653f7e59c5b1daea79214	Access File File Access Public Download Options MS Excel Spreadsheet Download Metadata Data File Citation EndNote XML RIS BibTeX

Citation Metadata

Persistent Identifier	doi:10.18710/VSMP3U
Publication Date	2024-01-17
Title	Replication Data for: Alea iacta est. Insights from corpus semantics into the diachrony of the Latin passive
Author	Aerts, Simon (Ghent University) - ORCID: 0000-0003-1852-9255
Point of Contact	Use email button above to contact. Aerts, Simon (Ghent University)
Description	Dataset includes annotated corpus data from Latin texts from the 3rd c. BCE until the 10th c. CE. Attestations of forms of both the original construction ('PP-BE.inf.', e.g. 'cantatus est') and the innovation ('PP-BE.perf.', e.g. 'cantatus fuit') for the expression of the passive (taken together with forms of deponent verbs into 'nonactive') of perfectum stem tenses were extracted from major online corpora (331.131 data points); a random sample (n = 383) of data points of PP-BE.inf. that represents all text types and time periods as evenly as possible was then subjected to a close-reading analysis in order to ascertain the attestation rate of meanings for which it competed with the innovation PP-BE.perf. (specialized in ANTERIORITY) and the present tense (specialized in present events or situations). Only the data points that were annotated in full in this second phase are included in the current dataset. For the data points examined in the first phase, only the formal categories in the list below were annotated to the extent that these annotations are not subject to interpretation. (2023-08-22)
Subject	Arts and Humanities
Keyword	Latin passive system quantitative and qualitative linguistics form-function pairings tense-aspect systems diachronic linguistics
Related Publication	Aerts, S. “Alea iacta est. Insights from corpus semantics into the diachrony of the Latin passive.” Submitted for review.
Language	English
Producer	Ghent University https://www.ugent.be/en
Contributor	Data Curator : Cluyse, Brian
Funding Information	Research Foundation - Flanders: Grant number: 1282722N
Distributor	The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/
Depositor	Aerts, Simon
Deposit Date	2023-08-22
Time Period	Start Date: 200BCE ; End Date: 0950
Date of Collection	Start Date: 2021-11-01 ; End Date: 2023-04-15
Data Type	Annotated corpus data
Series	Tracing change and reaction in the Latin tense system: The datasets in this series contain the replication data for research papers published within the FWO-funded project "Tracing change and reaction in the Latin tense system: an empirical analysis of language-internal and language-external influences on the development of morphological innovations and form-function pairings from Early Latin to Early Romance".
Software	R
Data Source	The data contained in this dataset originate from the following sources: CDC: Codex diplomaticus Cavensis Vol. 1. (8th - 10th c. CE). Diplomatic charters from the context of the Lombard rule of Central Italy (Campania)(n = 505; 0,15%). This data is accessible under the CC BY-NC-ND license. ECDS: Epigraphik-Datenbank Clauss/Slaby. Epigraphic texts (0.99 % of all attestations). ECDS does not provide a user license / Terms of Use, except for the following disclaimer: "All texts, pictures and graphics published on this website are subject to copyright and other laws for the protection of intellectual property". LASLA: Laboratoire d’Analyse Statistique des Langues Anciennes - Hyperbase. (2nd c. BCE – 2nd c. CE): classical, literary texts (5,23% of all attestations). LASLA does not provide a user license / Terms of Use, except for the general copyright statement in the about section of the LASLA Opera Latina website: Copyright LASLA - CIPL 2014. LLT: Library of Latin texts. (3rd c. BC - 8th c. CE): all text types from all periods of natural language use (n = 13.119; 92,78% of all attestations). LLT is part of the BREPOLiS databases, for which the BREPOLiS Terms and Conditions apply. The BREPOLiS Terms and Conditions entitle users "to extract and re-utilize, for non-commercial purposes only, any insubstantial parts of the contents of the Database". PaLaFra: The transition from Latin to French: constitution and analysis of a Latin-French digital corpus. PaLaFra-Lat-V2 (5th - 10th c. CE): various text types, mainly from the Merovingian period (0,84% of all attestations). The subcorpus PaLaFraLat is accessible under the CC BY-NC-SA 4.0 license. Papyri.info: Papyri.info. 2nd - 5th c. CE. Texts on papyri (mainly personal letters) which provide direct access to everyday language (n = 22; < 0,01%). Papyri.info does not provide any user license / Terms of Use. The extracted text fragments that are contained in the data file of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. US Copyright Act), Fair dealing (UK; cf. Exceptions to copyright), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. § 14 in Åndsverkloven), "uvesentlige deler av databaser" (Norway; cf. § 24 in Åndsverkloven), "sitatretten" (Norway; cf. § 29 in Åndsverkloven). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question "Do I always have to comply with the license terms? If not, what are the exceptions?" in the Creative Commons Frequently Asked Questions).

Dataset Terms

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom Dataset Terms — the following Custom Dataset Terms have been defined for this dataset.

This dataset may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: https://creativecommons.org/licenses/by-nc/4.0/.

Dataset Version	Summary	Contributors	Published on
No records found.

Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Request Access

Enable access request

You must enable request access or add terms of access to restrict file access.

Terms of Access for Restricted Files

Save Changes

Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Select File(s)

Please select one or more files.

Share Dataset

Share this dataset on your favorite social media networks.

Dataset Citations

Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.

Sorry, no citations were found.

Restricted Files Selected

The selected file(s) may not be downloaded because you have not been granted access.

You may request access to the restricted file(s) by clicking the Request Access button.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 4.7 GB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Restricted Files Selected

The restricted file(s) selected may not be downloaded because you have not been granted access.

Click Continue to download the files you have access to download.

Delete Dataset

Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Dataset Private URL

Use a Private URL to allow those without Dataverse accounts to access your unpublished dataset. For more information about the Private URL feature, please refer to the User Guide.

Private URL has not been created.

Unpublished Dataset Private URL

Are you sure you want to disable the Private URL? If you have shared the Private URL with others they will no longer be able to use it to access your unpublished dataset.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Compute

This dataset contains restricted files you may not compute on because you have not been granted access.

Deaccession Dataset

Are you sure you want to deaccession? The selected version(s) will no longer be viewable by the public.

Deaccession Dataset

Are you sure you want to deaccession this dataset? It will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Log In to request access.

Dataset Terms

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Custom terms specific to this dataset Custom Dataset Terms — the following Custom Dataset Terms have been defined for this dataset.

Terms of Use This dataset may be reused according to the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license as described here: https://creativecommons.org/licenses/by-nc/4.0/.

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

Download URL

https://dataverse.no/api/access/datafile/

Request Access

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

Compute Batch

Clear Batch

Dataset	Persistent Identifier	Change Compute Batch

Compute Batch

Submit for Review

You will not be able to make changes to this dataset while it is in review.

Publish Dataset

Are you sure you want to republish this dataset?

Select if this is a minor or major version update.

Minor Release (1.1)

Major Release (2.0)

Publish Dataset

This dataset cannot be published until TROLLing is published by its administrator.

Publish Dataset

This dataset cannot be published until TROLLing and DataverseNO are published.

Return to Author

Return this dataset to contributor for modification.