This README.txt file was generated on <20211126> by ------------------- GENERAL INFORMATION ------------------- 1. Title of Dataset: Dataset and model for the project "Investigating contact languages with agent-based modelling" 2. Author Information First Author Contact Information Name: CHEUNG Ching Yat Faculty: Faculty of Arts Email: chingyat@connect.hku.hk Corresponding Author Contact Information Name: CHEUNG Ching Yat Faculty: Faculty of Arts Email: chingyat@connect.hku.hk Author Contact Information (if applicable) Name: N/A Faculty: N/A Email: N/A --------------------- DATA & FILE OVERVIEW --------------------- Directory of Files: A. Filename: Meta Short description: The folder contains 2 R file which were used to generate the illustrative figures used in the thesis. MetaImages: The generated images that has been used in the thesis for illustrative purposes. lexicon: An exploratory of the lexical etymology in Nigerian Pidgin and Chinese Pidgin English, ranked by frequency, with the available data. B. Filename: Emergence_of_Creoles: Short description: The folder contains the R files and results of the fourth chapter of the thesis. replica.R: A replication of the results in Tria et al. (2015) paper. replication_data: The results of the replication in .RData format and plot visualization. Files are named after the places from which the demographic data are obtained. main.R: The code that is used to link the other separate R files and conduct the simulation for-loop. initialization.R: The code sets up the agents; initial conditions are set in this file. network.R: The code constructs the computational network. Some relevant parameters are set here. interaction.R: The codecontains the executable interaction rules among agents. data: The results of the simulation with social networks (the Rdata and plots used) are inside this folder. Files are named after the conditions. C. Filename: Social_Interaction_Lexicon Short description: The folder contains the R files and results of the fifth chapter of the thesis. main.R: The code that is used to run the simulation (The steps and condition can be set manually). evaluation_data.R: After obtaining the simulation results from main.R, the script can be run directly to generate all the plots and automatically save them, as can be seen in the Appendices. data: These contain the .Rdata files that contain the outputs under different conditions as described in the thesis. data_analysis_plots: The visualization of the results which can be seen in the appdices of the thesis. Additional Notes on File Relationships, Context, or Content (for example, if a user wants to reuse and/or cite your data, what information would you want them to know?): Code used external packages which are cited in the thesis. File Naming Convention: N/A ----------------------------------------- DATA DESCRIPTION FOR: replication_data ----------------------------------------- 1. Number of variables: 1 2. Number of cases/rows: 16 3. Missing data codes: Code/symbol Definition Code/symbol Definition 4. Variable List A. Name: Population proportion Description: The variable is a list that consists of 3 numbers, which sum to 1. The numbers are the proportion of Bozals, Mulattos, and Europeans in a particular region. The proportion is as recorded in the historical demographic data. This independent variable can be used to predict the historical emergence of creoles in the model presented in Tria et al. (2015). This thesis has replicated the model by testing a total of 16 regions. ----------------------------------------- DATA DESCRIPTION FOR: Emergence_of_Creoles/data ----------------------------------------- 1. Number of variables: 2 2. Number of cases/rows: 9 3. Missing data codes: Code/symbol Definition Code/symbol Definition 4. Variable List A. Name: Internal network structure Description: There are 3 possible internal network structures, which are a scale-free network, a dense small-world network, or a sparse small-world network. It is used in combination with the external network structures to form a 3*3 experimental design. It can be edited by setting the Internal parameter {sf, sws, swd} of the function init.network in the script network.R. For details of the parameters, please refer to network.R. B. Name: External network structure Description: There are 3 possible external network structures, which are a mediated network, a sparse network, or a dense world network. It is used in combination with the internal network structures to form a 3*3 experimental design. It can be edited by setting the External parameter {d, m, l} of the function init.network in the script network.R. For details of the parameters, please refer to network.R. ----------------------------------------- DATA DESCRIPTION FOR: Social_Interaction_Lexicon/data ----------------------------------------- 1. Number of variables: 2 2. Number of cases/rows: 4 3. Missing data codes: Code/symbol Definition Code/symbol Definition 4. Variable List A. Name: Instance of E1 marjority departure Description: For each simulation, when a threshold of step has been reached, the E1 may depart. The time of departure tested in this thesis are {never, 100k, 1m}. B. Name: Instance of European groups complete departure Description: If the variable as described in (A) is set to 100k, there are two possible scenario. One is that the all European agents will exit the language ecology in 2-millionth step, or they will not exit the ecology and let the language develop until the 5 millionth step is reached. -------------------------- METHODOLOGICAL INFORMATION -------------------------- 1. Software-specific information: Name: R Version: 4.0.5 System Requirements: N/A Open Source? (Y/N): Y (if available and applicable) Executable URL: N/A Source Repository URL: N/A Developer: R core team Product URL: https://www.r-project.org/ Software source components: N/A Additional Notes(such as, will this software not run on certain operating systems?): N/A 2. Equipment-specific information: N/A Manufacturer: N/A Model: N/A (if applicable) Embedded Software / Firmware Name: N/A Embedded Software / Firmware Version: N/A Additional Notes: N/A