BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//talks.staging.osgeo.org//foss4g-2024-academic-track//spe
 aker//XTCD39
BEGIN:VTIMEZONE
TZID:-03
BEGIN:STANDARD
DTSTART:20000101T000000
RRULE:FREQ=YEARLY;BYMONTH=1
TZNAME:-03
TZOFFSETFROM:-0300
TZOFFSETTO:-0300
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-foss4g-2024-academic-track-VQK7SK@talks.staging.osgeo.org
DTSTART;TZID=-03:20241204T143000
DTEND;TZID=-03:20241204T150000
DESCRIPTION:OGC Standards and OSGeo Projects have been widely applied to di
 fferent kinds of geospatial data and extended for the implementation of ge
 ospatial data science environments. However\, there’s no review comprehe
 nsively summarising and discussing the progress of these open source techn
 ologies for publishing geospatial databases on the Web. The proposed Syste
 matic Technology Review is a stylized version of the Systematic Literature
  Review\, covering the documentation of OGC Standards and OSGeo Projects. 
 The search strategy consisted of screening OGC and OSGeo websites for the 
 latest version of OGC Standards' implementation (or community) specificati
 on and OSGeo Projects' developers manual. This review considered the techn
 ologies published until June 2024. A total of 80 OGC Standards and 52 OSGe
 o Projects were identified. To recognize the main topics of each technolog
 y in detail\, the documentation was analysed by Latent Dirichlet Allocatio
 n - LDA using the Scikit-learn package in Python. Grid-search was used to 
 find the optimal hyperparameters for the number of components and the deca
 y of the learning rate. With the maximum number of iterations set to 100\,
  the best model was obtained with 8 components and 0.1 learning decay. The
 n\, the most probable topic was predicted for each documentation. The netw
 ork of similarities arising from LDA was exported to Gephi for visualisati
 on\, where ForceAtlas2 layout algorithm was used to create a weighted undi
 rected graph\, keeping only edges with weight greater than 0.33. The lates
 t developments in terms of the OGC Standards for data encoding took place 
 in the GeoPackage standard. For accessing\, processing or visualising data
 \, the trend was the development of OGC API related standards. However\, G
 ML is the most implemented OGC Standard for data encoding in OSGeo Project
 s\, along with Web Services like WMS\, WFS\, WCS and WPS for accessing\, p
 rocessing and visualising the data. Community Standards represented less t
 han 10% of the OGC Standards\, while Community Projects represented more t
 han 50% of the OSGeo Projects. The adoption of these technologies were eva
 luated based on the number of Github forks and stars\, as well as Docker p
 ulls. With more than 100 million pulls\, PostGIS is the most downloaded OS
 Geo Project\, followed by GeoNetwork and Open Data Cube\, with more than 5
  million pulls each. But many of the analysed technologies lacked an offic
 ial Docker image. In terms of Github forks and stars\, the most shared and
  favoured OSGeo project is OpenLayers\, followed by QGIS and GDAL. The Lat
 ent Dirichlet Allocation analyses found eight topics underlying the OGC St
 andards and OSGeo Projects. The keywords of the top four topics were confo
 rmance\, layer\, tile and response. Based on the analysis of the Implement
 ation Standard and Community Standard documentations\, the most similar OG
 C Standards were OGC API - Tiles and Two Dimensional Tile Matrix Set. On t
 he other hand\, based on the analysis of developer manuals\, the most simi
 lar OSGeo Projects were GDAL and MDAL. The strongest relationship of an OG
 C Standard and an OSGeo Project occurred between WPS and ZOO-project\, fol
 lowed by WPS and PyWPS. Overall\, the OSGeo Project most closely related t
 o the entire set of OGC Standards was rasdaman\, followed by MapServer and
  deegree. Notably\, a large group of standards and projects showed scarce 
 connections\, mainly those that are domain specific\, like PubSub\, LAS an
 d PipelineML among the OGC Standards and like Giswater and MobilityDB amon
 g the OSGeo Community Projects\, or those that are the basis of the other 
 technologies\, like Simple Features\, WKT and Coordinate Transformation st
 andards and like PROJ and PostGIS projects. The presented Systematic Techn
 ology Review can promote the evolution of the current OGC Standards and OS
 Geo Projects\, as well as the development of new technologies. It can also
  support developers of new solutions in the geospatial community. Specific
 ally\, this review is the basis for the proposal of a new library for the 
 integrated access of INPE’s environmental databases. An important limita
 tion of this systematic review is that it was not possible to find any PDF
  documentation for almost 20% of the existing technologies\, which were ex
 cluded from the analysis.
DTSTAMP:20260514T031720Z
LOCATION:Room V
SUMMARY:Systematic Technology Review of OGC Standards and OSGeo Projects - 
 Luiz Fernando Satolo
URL:https://talks.staging.osgeo.org/foss4g-2024-academic-track/talk/VQK7SK/
END:VEVENT
END:VCALENDAR
