The eBird data for the Crested Myna was filtered and tidied using the AUK package in R. The code used for this process is available in our repository Crested Myna Data Analysis. Once the data was preprocessed, it was further analyzed to understand the distribution of the species across different countries. This involved spatially joining eBird observations with country polygons to calculate the number of bird observations in each country over time. Additionally, each country with Crested Myna sightings was categorized as either part of the species' native range or outside of it, according information from bibliography.
The country polygon layer was initially processed with PostGIS to ensure all geometries were valid. After validation, the polygons were simplified to reduce geometric complexity, optimizing the data for spatial analysis without sacrificing significant detail. The SQL code can be found in our repository, in the django migrations applied over the database.
The world cities layer used in this project was created by merging two distinct datasets: "Cities" and "World Cities". Both shapefiles contained many major cities, but each included unique entries not found in the other. To ensure comprehensive coverage, a union operation was performed to combine the layers into a single dataset. Duplicate entries were then identified and removed based on both attribute values and spatial location.
This species prefers open areas such as agricultural lands, parks and urban spaces. A spatial analysis was conducted to determine whether, outside its native range, this species occupies rural areas or remains within urban zones like other exotic species such as the house sparrow (Passer domesticus).
Urban areas worldwide were determined by creating variable-radius buffers around the city layer. Since the dataset includes only the major cities of the world, a minimum radius of 20 km and a maximum radius of 200 km were set based on the population size of each city.
The species observations were then classified based on whether they were located within urban areas or not.
Bibliography: