BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//talks.staging.osgeo.org//foss4g-2022//talk//BSY973
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-foss4g-2022-BSY973@talks.staging.osgeo.org
DTSTART;TZID=CET:20220825T100000
DTEND;TZID=CET:20220825T100500
DESCRIPTION:The Apache Arrow (https://arrow.apache.org/) project specifies 
 a standardized language-independent columnar memory format. It enables sha
 red computational libraries\, zero-copy shared memory\, streaming messagin
 g and interprocess communication without serialization overhead\, etc. Now
 adays\, Apache Arrow is supported by many programming languages. \n\nGeosp
 atial data often comes in tabular format\, with one (or multiple) column w
 ith feature geometries and additional columns with feature attributes. Thi
 s is a perfect match for Apache Arrow. Defining a standard and efficient w
 ay to store geospatial data in the Arrow memory layout (https://github.com
 /geopandas/geo-arrow-spec/) can help interoperability between different to
 ols and enables us to tap into the full Apache Arrow ecosystem:\n\n- Effic
 ient\, columnar data formats. Apache Arrow contains an implementation of t
 he Apache Parquet file format\, and thus gives us access to GeoParquet (ht
 tps://github.com/opengeospatial/geoparquet) and functionalities to interac
 t with this format in partitioned and/or cloud datasets.\n- The Apache Arr
 ow project includes several mechanisms for fast data exchange (the IPC mes
 sage format and Arrow Flight for transferring data between processes and m
 achines\; the C Data Interface for zero-copy sharing of data between indep
 endent runtimes running in the same process). Those mechanisms can make it
  easier to efficiently share data between GIS tools such as GDAL and QGIS 
 and bindings in Python\, R\, Rust\, with web-based applications\, etc.\n- 
 Several projects in the Apache Arrow community are working on high-perform
 ance query engines for computing on in-memory and bigger-than-memory data.
  Being able to store geospatial data in Arrow will make it possible to ext
 end those engines with spatial queries.
DTSTAMP:20260405T011334Z
LOCATION:Room 4
SUMMARY:Geospatial and Apache Arrow: accelerating geospatial data exchange 
 and compute - Joris van den Bossche
URL:https://talks.staging.osgeo.org/foss4g-2022/talk/BSY973/
END:VEVENT
END:VCALENDAR
