Skip to content

Liyubov/citizen_science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Citizen science open data analysis

Here we do data analysis for citizen science projects with the open data https://www.inaturalist.org/projects/city-nature-challenge-2019 The project is in progress and discussions between @Liubov Tupikina, Muki Haklay, CRI, UCL, Bastian, CRI, Open Humans and team from CorrelAid.

The project is in process. See the correlaid team github for the names of team members.

The main repository is posted here https://github.com/correlaid-paris/citizen_science_inaturalist Preliminary notebook for analysis is "citizen_science_analysis".

The result of the project between City Interaction lab, CorrelAid, UCL, CRI is the paper [Liubov Tupikina, Frank Schlosser, Vadim Voskresenskii, Katharina Kloppenborg, Florence Lopez, Albrecht Mariz, Anna Mogilevskaja, Muki Haklay, Bastian Greshake Tzovaras] "iNaturalist citizen science community during City Nature Challenge: new computational approach for analysis of user activity" https://arxiv.org/abs/2112.02693

Reseach Questions (RQs)

RQ1: Can we identify macroscopic laws of citizen science projects? The main goal of the project to identify short-term and long-term dynamics in citizen science projects. (general RQ)

RQ2: How citizen science communities are growing in time, what drives their growth if not preferential attachment or exogenous factors?

RQ3: If we are looking at City Nature Challenge data - we have now data for San Francisco, LA, and London, which now have 4 or 3 years time series. How the power law like distributions of human activities are emerging? It's interesting that it is similar for SF and LA and the pattern that they have in year 4, London has in year 3.

RQ4: (a) Are there differences in data collection patterns in each year and between years? Are there also emerging pattern over time in a location? (b) Are there also emerging pattern over time in a location?

RQ5: To what degree the heavy contributors influence the total pattern and distribution of data? Is there any observed impact from major contributors (e.g. someone who is heavy contributors dropping off because they can't keep up with the competition). There is also the question about repeat recruitment - are the pattern of one timers vs people who come back stay the same from year to year? Are there any patterns or places that are behaving differently in terms of patterns of contribution?

RQ6: Since there are specific patterns of participation for different people from various countries we would like to understand the reason of power-law of participation (some participate a lot, many participate only little). We would like to understand how this pattern emerge dynamically (M.Pocco).

RQ7: What is the dfference between citizen science projects evolution in different cities around the globe? London vs. San Francisco? What are parameters which may lead to differences?

Some of the problems for citizen science community are: heterogeneity of collected data, missing data, non-controlled growth of community of observers.

Participate

The project is now discussed together with CorrelAid X Paris x CRI https://github.com/correlaid-paris/

Contact person: Liubov Tupikina (CRI, Bell labs), Muki Haklay (UCL), Frie (Correlaid).

About

Data analysis for citizen science projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published