Dataset

The MIXPAR project aims to compile all available data into a standardized tabular format that can be easily exported (e.g., as CSV files) and processed in widely used statistical and linguistic software such as R.

The first version of the dataset is now publicly available:

Štichauer, P. & Ripamonti, F. (2025). MIXPAR Database: Version 1.0 (September 2025). LINDAT/CLARIAH-CZ Digital Library, Institute of Formal and Applied Linguistics (ÚFAL), Charles University. http://hdl.handle.net/11234/1-5982

There is also an interactive online database:

Čapka, T., Ripamonti, F., & Štichauer, P. (2025). MIXPAR: Mixed Paradigms [Computer software]. Retrievable from https://korpus.cz/mixpar/