Trend AnalysisInterdisciplinaryMixed Methods

Citizen Science Platforms and Public Participation in Research

Citizen science platforms turn millions of volunteers into data collectors, expanding the geographic and temporal reach of scientific observation far beyond what professional researchers can achieve alone. But data quality remains the central challenge—and AI is emerging as the bridge between volunteer enthusiasm and scientific rigor.

By Sean K.S. Shin

This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

Why It Matters

Professional scientists cannot be everywhere. Biodiversity monitoring requires observations across vast geographic areas, at all times of day and year, in habitats that may be remote or politically inaccessible. Citizen science platforms—eBird, iNaturalist, Zooniverse, and hundreds of specialized alternatives—solve this scale problem by recruiting volunteers who contribute observations from their backyards, hiking trails, and neighborhoods.

The numbers are staggering: eBird alone collects over 200 million bird observations annually from millions of contributors worldwide. iNaturalist has documented over 150 million observations of life on Earth. These datasets now rival or exceed the scale of professional monitoring networks, and they are increasingly used in conservation policy, environmental impact assessments, and climate change research.

But a fundamental tension persists: volunteers are not trained scientists, and their observations carry biases—spatial bias (concentrated near roads and population centers), temporal bias (concentrated on weekends and holidays), taxonomic bias (charismatic species overrepresented), and identification errors. Resolving this tension between scale and quality is the defining challenge of modern citizen science.

The Science

Evaluating Dataset Adequacy for Monitoring

Backstrom et al. (2024), with 11 citations, develop a systematic framework for assessing whether citizen science datasets are adequate for biodiversity monitoring purposes. Rather than asking "is citizen science data good?" in the abstract, they ask the operationally useful question: "for a specific species, in a specific region, does the available citizen science data support statistically valid trend estimation?"

Their framework evaluates spatial coverage, temporal continuity, detection probability, and sample size against the minimum thresholds needed for trend detection. Applied to Australian biodiversity data, the results are nuanced: citizen science data is adequate for monitoring common, widespread species but insufficient for rare, cryptic, or geographically restricted species—precisely the species of greatest conservation concern.

Data Quality Assessment in Practice

Prenda et al. (2024), with 2 citations, conduct a detailed quality assessment of citizen science bird observation data across the Iberian Peninsula. They compare citizen science records against expert surveys to quantify identification accuracy, spatial completeness, and temporal consistency.

Key findings: identification accuracy is high for common and distinctive species (>95%) but drops substantially for look-alike species pairs and juvenile birds. Mobile applications with integrated photo verification significantly improve accuracy compared to checklist-only submissions. The study quantifies what practitioners have long suspected: the quality of citizen science data is highly variable and depends heavily on platform design, verification mechanisms, and the specific taxonomic challenge.

Human-AI Collaboration for Quality Management

Prastowo et al. (2025) propose an architectural framework that integrates AI directly into citizen science biodiversity platforms. Their design addresses the multi-taxon challenge—platforms that handle plants, animals, fungi, and insects simultaneously, each with different identification difficulties and expert communities.

The architecture implements three AI layers: (1) automated species identification using computer vision models trained on verified observations, (2) anomaly detection that flags unusual records for expert review, and (3) adaptive task routing that matches observation verification tasks to volunteers based on their demonstrated expertise level. This represents a shift from post-hoc quality control (cleaning data after collection) to real-time quality assurance (preventing errors at the point of observation).

Community-Centered Approaches

Omar et al. (2025), with 1 citation, document a citizen science initiative in Kalimantan, Indonesia, that takes a fundamentally different approach: rather than recruiting individual volunteers into a global platform, it embeds biodiversity monitoring within existing community governance structures. Local villagers—who already patrol their forests for resource management—add standardized wildlife observations to their routine activities.

This community-centered model produces different data patterns: higher temporal regularity (daily patrols versus recreational birdwatching), coverage of remote areas that global platforms miss, and integration of local ecological knowledge that aids species identification. The trade-off is scale—it produces fewer total observations but with higher contextual richness per observation.

Citizen Science Data Quality Framework

Quality Dimension	Challenge	Emerging Solution
Species ID Accuracy	Varies by taxa difficulty	AI-assisted identification + photo verification
Spatial Bias	Concentrated near roads/cities	Targeted campaigns for undersampled areas
Temporal Bias	Weekend/holiday peaks	Community-based models with regular patrols
Taxonomic Bias	Charismatic species overrepresented	Gamification for underreported taxa
Adequacy for Monitoring	Sufficient for common species only	Hybrid designs combining citizen + professional data

What To Watch

The convergence of citizen science and AI is accelerating rapidly. Expect platforms to move from passive data collection to active adaptive sampling—AI models that identify the highest-value observations needed to fill knowledge gaps and direct volunteers to specific locations, times, and target species. The community-centered model demonstrated in Indonesia may prove particularly important for the Global South, where smartphone-based platforms face connectivity and cultural barriers. The next frontier is integrating citizen science data into formal regulatory frameworks—using crowdsourced observations as legally admissible evidence in environmental impact assessments and conservation law enforcement.

Explore related work through ORAA ResearchBrain.

면책 조항: 이 게시물은 정보 제공을 목적으로 한 연구 동향 개요이다. 학술 연구에서 인용하기 전에 구체적인 연구 결과, 통계 및 주장을 원본 논문과 대조하여 검증해야 한다.

시민 과학 플랫폼과 연구에 대한 공공 참여

중요성

전문 과학자들이 모든 곳에 있을 수는 없다. 생물다양성 모니터링은 광대한 지리적 영역에 걸쳐, 하루 중 어느 때나 연중 어느 계절에나, 외딴곳이거나 정치적으로 접근하기 어려운 서식지에서도 관측이 이루어져야 한다. eBird, iNaturalist, Zooniverse, 그리고 수백 개의 특화된 플랫폼들로 구성된 시민 과학 플랫폼은 자신의 뒷마당, 등산로, 동네에서 관측 데이터를 제공하는 자원봉사자를 모집함으로써 이러한 규모의 문제를 해결한다.

그 수치는 놀랍다. eBird 하나만 해도 전 세계 수백만 명의 기여자로부터 매년 2억 건 이상의 조류 관측 데이터를 수집한다. iNaturalist는 지구상의 생물에 대한 1억 5천만 건 이상의 관측 데이터를 기록하였다. 이러한 데이터셋은 이제 전문 모니터링 네트워크의 규모에 필적하거나 이를 초과하며, 보전 정책, 환경 영향 평가, 기후변화 연구에서 점차 활용되고 있다.

그러나 근본적인 긴장 관계는 여전히 존재한다. 자원봉사자는 훈련된 과학자가 아니며, 그들의 관측에는 편향이 내재되어 있다. 즉, 도로와 인구 밀집 지역 근처에 집중되는 공간적 편향, 주말과 휴일에 집중되는 시간적 편향, 카리스마 있는 종이 과대 표현되는 분류학적 편향, 그리고 동정 오류 등이 존재한다. 규모와 품질 사이의 이 긴장 관계를 해소하는 것이 현대 시민 과학의 핵심 과제이다.

연구 내용

모니터링을 위한 데이터셋 적합성 평가

Backstrom et al. (2024)은 11회 인용을 기록하며, 시민 과학 데이터셋이 생물다양성 모니터링 목적에 적합한지를 평가하기 위한 체계적인 프레임워크를 개발하였다. 이들은 추상적인 수준에서 "시민 과학 데이터는 우수한가?"라고 묻는 대신, 실용적으로 유용한 질문인 "특정 종에 대해, 특정 지역에서, 이용 가능한 시민 과학 데이터가 통계적으로 유효한 추세 추정을 지원하는가?"를 제기한다.

이들의 프레임워크는 추세 탐지에 필요한 최소 임계값에 대비하여 공간적 커버리지, 시간적 연속성, 탐지 확률, 표본 크기를 평가한다. 호주 생물다양성 데이터에 적용한 결과는 복합적으로 나타났다. 즉, 시민 과학 데이터는 흔하고 광범위하게 분포하는 종의 모니터링에는 적합하지만, 희귀하거나 은밀하거나 지리적으로 제한된 종—바로 보전 측면에서 가장 중요한 종들—에 대해서는 불충분하다.

실제 데이터 품질 평가

Prenda et al. (2024)은 2회 인용을 기록하며, 이베리아반도 전역의 시민 과학 조류 관측 데이터에 대한 상세한 품질 평가를 수행하였다. 이들은 동정 정확도, 공간적 완전성, 시간적 일관성을 정량화하기 위해 시민 과학 기록과 전문가 조사를 비교하였다.

주요 연구 결과는 다음과 같다. 흔하고 식별이 쉬운 종에 대한 동정 정확도는 높지만(>95%), 형태가 유사한 종 쌍과 어린 새에 대해서는 상당히 낮아진다. 사진 검증이 통합된 모바일 애플리케이션은 체크리스트 전용 제출 방식에 비해 정확도를 크게 향상시킨다. 이 연구는 실무자들이 오랫동안 의심해 온 사실을 정량화하였다. 즉, 시민 과학 데이터의 품질은 매우 가변적이며, 플랫폼 설계, 검증 메커니즘, 그리고 특정 분류학적 난이도에 크게 의존한다.

품질 관리를 위한 인간-AI 협력

Prastowo et al. (2025)은 AI를 시민 과학 생물다양성 플랫폼에 직접 통합하는 아키텍처 프레임워크를 제안한다. 이들의 설계는 다중 분류군 문제, 즉 각기 다른 동정 난이도와 전문가 커뮤니티를 가진 식물, 동물, 균류, 곤충을 동시에 처리하는 플랫폼의 문제를 다룬다. 이 아키텍처는 세 가지 AI 레이어를 구현한다: (1) 검증된 관찰 데이터로 학습된 컴퓨터 비전 모델을 활용한 자동화된 종 동정, (2) 비정상적인 기록을 전문가 검토를 위해 표시하는 이상 탐지, (3) 자원봉사자의 입증된 전문성 수준에 따라 관찰 검증 작업을 배정하는 적응형 과제 라우팅. 이는 사후 품질 관리(데이터 수집 후 정제)에서 실시간 품질 보증(관찰 시점에서의 오류 방지)으로의 전환을 의미한다.

커뮤니티 중심 접근법

1회 피인용을 기록한 Omar et al. (2025)은 인도네시아 칼리만탄에서 진행된 시민 과학 이니셔티브를 기술하는데, 이 이니셔티브는 근본적으로 다른 접근법을 취한다. 개별 자원봉사자를 글로벌 플랫폼에 모집하는 대신, 기존 지역사회 거버넌스 구조 내에 생물다양성 모니터링을 내재화한다. 자원 관리를 위해 이미 산림을 순찰하는 지역 마을 주민들이 일상적인 활동에 표준화된 야생동물 관찰을 추가하는 방식이다.

이 커뮤니티 중심 모델은 서로 다른 데이터 패턴을 생성한다. 더 높은 시간적 규칙성(레크리에이션 목적의 탐조 활동과 비교되는 일일 순찰), 글로벌 플랫폼이 놓치는 오지 지역의 커버리지, 그리고 종 동정을 보조하는 지역 생태 지식의 통합이 그것이다. 그 트레이드오프는 규모이다—총 관찰 건수는 더 적지만 관찰 건당 맥락적 풍부성은 더 높다.

시민 과학 데이터 품질 프레임워크

품질 차원	과제	새로운 해결책
종 동정 정확도	분류군 난이도에 따라 상이함	AI 보조 동정 + 사진 검증
공간적 편향	도로/도시 근처에 집중	표본추출 부족 지역을 위한 목표 캠페인
시간적 편향	주말/공휴일 피크	정기 순찰이 있는 커뮤니티 기반 모델
분류학적 편향	매력적인 종의 과잉 대표	보고 부족 분류군을 위한 게임화
모니터링 적합성	일반 종에만 충분	시민 + 전문가 데이터를 결합한 하이브리드 설계

주목해야 할 동향

시민 과학과 AI의 수렴은 빠르게 가속화되고 있다. 플랫폼들이 수동적 데이터 수집에서 능동적 적응형 샘플링으로 전환할 것으로 예상된다. 즉, 지식 공백을 채우기 위해 가장 가치 있는 관찰이 무엇인지 파악하고 자원봉사자를 특정 위치, 시간, 목표 종으로 안내하는 AI 모델이 등장할 것이다. 인도네시아에서 입증된 커뮤니티 중심 모델은 스마트폰 기반 플랫폼이 연결성 및 문화적 장벽에 직면하는 글로벌 사우스에서 특히 중요한 의미를 가질 수 있다. 다음 개척지는 시민 과학 데이터를 공식 규제 프레임워크에 통합하는 것이다—환경 영향 평가 및 환경 보전법 집행에서 크라우드소싱된 관찰을 법적으로 허용되는 증거로 활용하는 것이다.

ORAA ResearchBrain을 통해 관련 연구를 탐색하라.

References (4)

[1] Backstrom, L.J., Callaghan, C., & Leseberg, N.P. (2024). Assessing adequacy of citizen science datasets for biodiversity monitoring. Ecology and Evolution, 14, e10857.

DOI Scholar

[2] Prenda, J., Dominguez-Olmedo, J.L., & Lopez-Lozano, E. (2024). Assessing citizen science data quality for bird monitoring in the Iberian Peninsula. Scientific Reports, 14.

DOI Scholar

[3] Prastowo, R.D., Manik, L., & Kushadiani, S.K. (2025). Participatory Human-AI Mechanisms for Multi-Taxon Citizen Science Biodiversity Platforms. Proc. ICETISI 2025, IEEE.

DOI Scholar

[4] Omar, M., Dennis, R., & Meijaard, E. (2025). Centering Communities in Biodiversity Monitoring and Conservation. Diversity, 17(10), 679.

DOI Scholar

Citizen Science Platforms and Public Participation in Research

Why It Matters

The Science

Evaluating Dataset Adequacy for Monitoring

Data Quality Assessment in Practice

Human-AI Collaboration for Quality Management

Community-Centered Approaches

Citizen Science Data Quality Framework

What To Watch

시민 과학 플랫폼과 연구에 대한 공공 참여

중요성

연구 내용

모니터링을 위한 데이터셋 적합성 평가

실제 데이터 품질 평가

품질 관리를 위한 인간-AI 협력

커뮤니티 중심 접근법

시민 과학 데이터 품질 프레임워크

주목해야 할 동향

References (4)

Explore this topic deeper