BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//talks.osgeo.org//VJPFRN
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-foss4g-2022-academic-track-VJPFRN@talks.osgeo.org
DTSTART;TZID=CET:20220825T120000
DTEND;TZID=CET:20220825T123000
DESCRIPTION:Free and open source software for geospatial analysis (FOSS4G) 
 supports burgeoning possibilities for practicing open and computationally 
 reproducible human-environment and geographical research (Singleton et al 
 2016).\nOpen and reproducible research practices may accelerate the pace o
 f scientific discovery and enhance the scientific community's functions of
  knowledge verification\, correction\, and diffusion (Rey 2009\, Kedron an
 d Holler 2022).\nGeospatial metadata provides the foundation for reproduci
 bility and open science and accordingly\, requires more support in open so
 urce geospatial software.\nFollowing Wilson and others' (2021) five star g
 uide for reproducibility\, researchers can achieve four stars by conductin
 g research with open data and software and documenting metadata according 
 to the standards of the International Organization for Standardization (IS
 O) and OGC (Open Geospatial Consortium).\nFor Tullis and Kar (2021)\, meta
 data is the key to documenting the provenance of research data artifacts\,
  preserving information about every detail of data creation and transforma
 tion.\nWilkinson and others' (2016) FAIR Guiding Principles for scientific
  data management enumerate functions for metadata in each of the principle
 s for research: findable\, accessible\, interoperable\, and reusable.\nHow
 ever\, open source geospatial software platforms generally lack the tools 
 necessary for mainstreaming geospatial metadata into the full research wor
 kflow in support of more efficient research work and enhanced reproducibil
 ity and open science.\nThis research on metadata is part of a larger human
 -environment and geographical sciences reproducibility and replicability (
 github.com/HEGSRR) project aimed at conducting formal reproduction and rep
 lication studies in the geographical sciences and integrating reproducibil
 ity into undergraduate and graduate level curricula in research methods.\n
 \nFollowing the National Academies of Science\, Engineering and Medicine (
 NASEM\, 2019)\, a reproduction study aims find the same results using the 
 same data and methodology as a published study.\nA replication study aims 
 to test the findings of a published study by collecting new data and follo
 wing a similar methodology\, which may intentionally modify one or more re
 search parameters.\nTogether\, reproduction and replication studies offer 
 a deep understanding of the original research\, test its credibility and g
 eneralizability\, and enhance the self-corrective mechanisms of the scient
 ific community.\nMetadata is information about data\, including essential 
 contextual information about the data's spatial structure\, attributes\, c
 reation\, maintenance\, access\, licensing\, and provenance.\n\nA key comp
 onent of reproducible research is an executable research compendium contai
 ning all of the data\, code\, and narrative required to compile a research
  publication from raw data (Nust and Pebesma 2021 and the Opening Reproduc
 ible Research Project).\nComputational notebooks like Jupyter notebooks or
  R Markdown are commonly used to interweave narrative with code in executa
 ble compendia.\nIn order to maximize replicability and inferential power\,
  the research compendium should begin with a pre-registered research plan 
 prior to data collection\, requiring researchers to fully specify metadata
  for all of the research data and analyzes that they intend to create (Nos
 ek et al. 2018).\nIt is recommended to store compendia in version tracking
  systems like Git in order to preserve a full history of changes to the re
 search project.\nFinally\, the compendium should be published parallel to 
 academic publications so that other researchers can independently re-run\,
  check and verify the analysis\, or incorporate the research in future pro
 jects.\nIn order to maximize the findability and legibility of the researc
 h compendium for both humans and machines\, the overall repository and eac
 h of its components must be meticulously documented with metadata accordin
 g to international standards (Wilkinson et al. 2016\, Wilson et al. 2021).
 \n\nIn this three-part research paper\, we focus on metadata in research c
 ompendia and related research products through all phases of the research 
 workflow.\nFirst\, we specify ideal requirements of geospatial metadata in
  support of reproducible research workflows and open science.\nWe consider
  metadata needs at each step of the research process\, including proposal 
 writing\, pre-analysis registration\, ethics review board approval\, data 
 collection and analysis\, publishing\, and reproducing published research.
 \nThe metadata software needs assessment is based on literature review of 
 reproducibility and open science\, and on teaching and practicing reproduc
 ibility with geographic methods.\n\nSecond\, we review the Dublin Core and
  International Organization for Standardization (ISO) geospatial metadata 
 standards and popular open source platforms for geospatial research and th
 eir support for the requirements of geospatial metadata articulated in the
  first part.\nThe scope of the review includes metadata functionality avai
 lable through spatial analysis software platforms\, including R\, Python\,
  QGIS\, GRASS and SAGA\; and it also includes metadata or cataloging tools
 \, including GeoNetwork\, GeoNode\, the USGS Metadata Wizard\, and mdedito
 r.\n\nFinally\, we articulate a vision for open source geospatial metadata
  software development in support of open and reproducible human-environmen
 t and geographical research.\nIn this vision\, metadata software tools sha
 ll integrate with executable research compendium to assist researchers wit
 h their workflow from inception to publishing and archiving.\nThe vision b
 uilds off our HEGSRR project\, in which we independently reproduce and rep
 licate published studies with open source geospatial software\, integrate 
 reproduction and replication studies into project-based geographic informa
 tion science courses\, and develop curricula and infrastructure for reprod
 ucible research.\nEach section of the paper is thus supplemented with expe
 riences and examples drawn from the HEGSRR project.\nOf particular relevan
 ce\, we have already completed seven reproduction or replication studies w
 ith graduate and undergraduate students using open source geospatial softw
 are\, encountering numerous barriers caused by inadequate use or documenta
 tion of metadata in research planning\, execution\, and archiving.\nWe hav
 e also developed a template Git repository compendium for reproducible res
 earch and prototyped its use in our studies and teaching\, discovering sof
 tware barriers to documenting metadata and opportunities to integrate meta
 data into more efficient and reproducible research practices.\n\nWe have s
 elected FOSS4G for this research paper in hopes of reaching both the acade
 mic audience and developer audience at the conference. We hope to raise aw
 areness in the academic audience of the critical importance of geospatial 
 metadata in each stage of the research workflow. We hope to raise interest
  in the open source geospatial software community for collaboration on imp
 roved support for geospatial metadata in research workflows.
DTSTAMP:20260413T203518Z
LOCATION:Room Hall 3A
SUMMARY:Mainstreaming metadata into research workflows to advance reproduci
 bility and open geographic information science - Joseph Holler
URL:https://talks.osgeo.org/foss4g-2022-academic-track/talk/VJPFRN/
END:VEVENT
END:VCALENDAR
