A contribution rest export view for sessions.

GET /programme/17/export/?format=api
HTTP 200 OK
Allow: GET, HEAD, OPTIONS
Content-Type: application/json
Vary: Accept

[
    {
        "title": "The chat bot that writes poetry can do climate analysis?",
        "url": "/submissions/121",
        "abstract": "<p>\n Climate research often requires substantial technical expertise. This involves managing data standards, various file formats, software engineering, and high-performance computing. Translating scientific questions into code that can answer them demands significant effort. The question is, why? Data analysis platforms like Freva (Kadow et al. 2021, e.g.,\n <a href=\"https://gems.dkrz.de\" rel=\"noopener noreferrer\" target=\"_blank\">\n  gems.dkrz.de\n </a>\n ) aim to enhance user convenience, yet programming expertise is still required. In this context, we introduce a large language model setup and chat bot interface for different core e.g. based on GPT-4/ChatGPT or DeepSeek, which enables climate analysis without technical obstacles, including language barriers. Not yet, we are dealing with climate LLMs for this purpose. Dedicated natural language processing methodologies could bring this to a next level. This approach is tailored to the needs of the broader climate community, which deals with small and fast analysis to massive data sets from kilometer-scale modeling and requires a processing environment utilizing modern technologies, but addressing still society after all, such as those in the Earth Virtualization Engines (EVE -\n <a href=\"https://eve4climate.org\" rel=\"noopener noreferrer\" target=\"_blank\">\n  eve4climate.org\n </a>\n ). Our interface runs on an High Performance Computer with access to PetaBytes of data - everything just a chat away.\n</p>\n",
        "presentation_type": "Keynote",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T13:30:00+02:00",
        "duration": "00:30:00",
        "authors": [
            {
                "author": {
                    "first_name": "Christopher",
                    "last_name": "Kadow",
                    "orcid": "0000-0001-6537-3690"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Gizem",
                    "last_name": "Ekinci",
                    "orcid": null
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Sebastian",
                    "last_name": "Willman",
                    "orcid": null
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Felix",
                    "last_name": "Oertel",
                    "orcid": "0009-0002-4992-723X"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Bianca",
                    "last_name": "Wentzel",
                    "orcid": "0000-0002-9218-5676"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Johannes",
                    "last_name": "Meuer",
                    "orcid": "0009-0000-1998-6699"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Martin",
                    "last_name": "Bergemann",
                    "orcid": "0000-0003-0604-4103"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Étienne",
                    "last_name": "Plésiat",
                    "orcid": "0000-0003-2725-9998"
                },
                "affiliation": [
                    "German Climate Computing Center (DKRZ)"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Philipp Sebastian Sommer",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": false,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "German Climate Computing Center (DKRZ)"
        ]
    },
    {
        "title": "ARGUS – Automated Recognition of Ghost Ships and Underwater Surveillance",
        "url": "/submissions/108",
        "abstract": "<p>\n The protection of critical underwater infrastructure, such as pipelines, data cables, or offshore energy assets, has become an emerging security challenge. Despite its growing importance, maritime infrastructure monitoring remains limited by high costs, insufficient coverage, and fragmented data processing workflows. The ARGUS project addresses these challenges by developing an AI-driven platform to support risk assessment and surveillance at sea.\n</p>\n<p>\n At its core, ARGUS integrates satellite-based Synthetic Aperture Radar (SAR) imagery, AIS vessel tracking data, and spatial information on critical assets into a unified data management system. A key functionality is detecting so-called \"ghost ships\" – vessels that deliberately switch off their AIS transponders – using object detection techniques on SAR imagery.\n</p>\n<p>\n At the same time, we are currently developing methods for underwater anomaly and change detection based on optical imagery. This work is still ongoing and focuses on identifying relevant structural or environmental changes in submerged infrastructure through automated image comparison and temporal analysis.\n</p>\n<p>\n In this talk, we present the architecture and workflows of the ARGUS system, including our use of deep learning (YOLO-based object detection) in the maritime context. We share insights into the current capabilities and limitations of AI models for maritime surveillance, especially in the context of underwater imaging, data scarcity, and domain-specific challenges.\n</p>\n<p>\n ARGUS illustrates how tailored AI applications, grounded in operational needs and real-world sensor data, can contribute to digital infrastructure protection and maritime situational awareness.\n</p>\n",
        "presentation_type": "Talk",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T14:00:00+02:00",
        "duration": "00:15:00",
        "authors": [
            {
                "author": {
                    "first_name": "Judith",
                    "last_name": "Fischer",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            }
        ],
        "submitter": "Judith Fischer",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
        ]
    },
    {
        "title": "Comparative study to handle missing data in a machine learning model of tidal  salinities",
        "url": "/submissions/86",
        "abstract": "<p>\n Current AI cannot function without data, yet this precious resource is often underappreciated. In the context\n <br/>\n of machine learning, dealing with incomplete datasets are a widespread challenge. Large, consistent, and\n <br/>\n error-free data sets are essential for an optimally trained neural network. Complete and well-structured in-\n <br/>\n puts substantially contribute to both training, results and subsequent conclusions. As a result, using high-\n <br/>\n quality data improves the performance and the ability of neural networks to generalize.\n <br/>\n However, real-world datasets from field measurements can contain information leakage. Sensor failures,\n <br/>\n maintenance issues or inconsistent data collection can cause invalid ('NaN', Not a Number) values to appear\n <br/>\n in the neural network input matrices.\n <br/>\n Imputation techniques are an important step in data processing for handling missing values. Estimating\n <br/>\n 'NaN' values or replacing them with plausible values directly affects the quality of the input data and thus\n <br/>\n the effectiveness of the neural network.\n</p>\n<p>\n In this contribution, we present a neural network-based regression model (ANN regression), that explains\n <br/>\n the salt characteristics in the Elbe estuary. In this context, we focus on selecting appropriate imputation\n <br/>\n strategies.\n <br/>\n While traditional methods such as imputation by mean, median, or mode are simple and computationally\n <br/>\n efficient, they sometimes fail to preserve the underlying data distribution and relationships.\n <br/>\n More sophisticated statistical approaches, such as k-nearest neighbors (k-NN), iterative imputation methods\n <br/>\n like multiple imputation by chained equations (MICE), and matrix completion algorithms, help to replace\n <br/>\n 'NaN' values with more accurate estimates. Imputation can also be achieved using integrative machine learn-\n <br/>\n ing models in advance. These approaches make it possible to take complex, non-linear relationships within\n <br/>\n the data into account.\n <br/>\n We here demonstrate which imputation techniques for ANN regression of salt characteristics in the Elbe\n <br/>\n estuary distort the input matrices the least and improve model accuracy the most.\n</p>\n<p>\n Apart from the aforementioned data preprocessing steps for the input matrices, we investigate further strat-\n <br/>\n egies in which neural architectures are adapted without explicit imputation.\n <br/>\n An essential aspect here is the use of masking vectors, which indicate the presence, absence, or substitution\n <br/>\n of data points. This allows the ANN regression model to weight real and substituted values appropriately\n <br/>\n during training. In addition, techniques such as partial input masks, feature dropout, or developing models\n <br/>\n that are inherently robust to missing data are possible alternatives.\n <br/>\n Here, we compare what proves most promising when dealing with incomplete data: a complex data prepro-\n <br/>\n cessing workflow or direct adaptation of the neural network.\n</p>\n<p>\n This contribution provides a comprehensive overview of the methods and considerations that play a role in\n <br/>\n the imputation of 'NaN' values in input matrices for our machine learning-based solutions. We have learned\n <br/>\n that imputation strategies depend on specific data types and missing value patterns. This study emphasizes\n <br/>\n the importance of selecting an appropriate imputation strategy while taking computational efficiency, data\n <br/>\n integrity, and model robustness into account. This can improve the robustness and reliability of the ANN\n <br/>\n analyses when dealing with incomplete data.\n</p>\n",
        "presentation_type": "Talk",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T14:15:00+02:00",
        "duration": "00:15:00",
        "authors": [
            {
                "author": {
                    "first_name": "Franziska",
                    "last_name": "Lauer",
                    "orcid": "0009-0001-4780-735X"
                },
                "affiliation": [
                    "Bundesanstalt für Wasserbau - Hamburg"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Georgii",
                    "last_name": "Gusak",
                    "orcid": null
                },
                "affiliation": [
                    "Ocean and Climate Physics, Universität Hamburg"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Frank",
                    "last_name": "Kösters",
                    "orcid": null
                },
                "affiliation": [
                    "Bundesanstalt für Wasserbau - Hamburg"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Franziska Lauer",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Bundesanstalt für Wasserbau - Hamburg",
            "Ocean and Climate Physics, Universität Hamburg"
        ]
    },
    {
        "title": "Spatial Prediction Framework: From Point Measurements to Grid-Scale Estimates using Machine Learning models",
        "url": "/submissions/66",
        "abstract": "<p>\n We present a comprehensive machine learning framework for predicting spatially distributed geographical data from point measurements. The framework takes as input a set of geographical features at a specified grid resolution (e.g., 5 arc-minute scale) and corresponding point measurements with their spatial coordinates and target values. The framework trains and evaluates multiple machine learning models, including both tree-based methods (Random Forest, XGBoost, CatBoost) and deep learning architectures (feed forward neural networks, TabPFN[1]), to identify the optimal predictive model for the given dataset.\n <br/>\n The framework incorporates hyperparameter search(depth and width) for deep learning models and systematic parameter search for tree-based models (e.g., number of estimators). This ensures robust model selection and performance optimization across different geographical contexts and data characteristics. The framework outputs the best-performing model along with comprehensive performance metrics and uncertainty estimates.\n <br/>\n As a non-trivial application, we demonstrate the framework's effectiveness in predicting total organic carbon (TOC) concentrations[2] and sedimentation rates in the ocean. This involves integrating features from both the sea surface and seafloor, encompassing a diverse array of oceanographic, geological, geographic, biological, and biogeochemical parameters. The framework successfully identifies the most suitable model architecture and hyperparameters for this complex spatial prediction task, providing both high accuracy and reliable uncertainty quantification.\n</p>\n<p>\n <strong>\n  References\n </strong>\n</p>\n<p>\n [1] Hollmann, N., Müller, S., Purucker, L.\n <i>\n  et al.\n </i>\n Accurate predictions on small data with a tabular foundation model.\n <i>\n  Nature\n </i>\n <strong>\n  637\n </strong>\n , 319–326 (2025).\n <a href=\"https://doi.org/10.1038/s41586-024-08328-6\" rel=\"noopener noreferrer\" target=\"_blank\">\n  https://doi.org/10.1038/s41586-024-08328-6\n </a>\n</p>\n<p>\n [2] Parameswaran, N., Gonzalez, E., Burwicz-Galerne, E., Braack, M., &amp; Wallmann, K. (2025). NN-TOC v1: global prediction of total organic carbon in marine sediments using deep neural networks. Geoscientific Model Development, 18(9), 2521–2544.\n</p>\n",
        "presentation_type": "Talk",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T14:30:00+02:00",
        "duration": "00:15:00",
        "authors": [
            {
                "author": {
                    "first_name": "Malte",
                    "last_name": "Braack",
                    "orcid": "0000-0003-1730-6316"
                },
                "affiliation": [
                    "Christian Albrecht University of Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Naveen Kumar",
                    "last_name": "Parameswaran",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Klaus",
                    "last_name": "Wallmann",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Malte Braack",
        "event": "Data Science Symposium 2025",
        "activity": "MareHub",
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Christian Albrecht University of Kiel",
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
        ]
    },
    {
        "title": "Downscaling Regional CAMS Reanalysis to the Urban Scale with XGBoost and Gaussian Processes: A Transferable Machine Learning Framework",
        "url": "/submissions/68",
        "abstract": "<p>\n Urban-scale air quality data is crucial for exposure assessment and decision-making in cities. However, high-resolution Eulerian Chemistry Transport Models (CTMs) with street-scale resolutions (100 m x 100 m), while process-based and scenario-capable, are computationally expensive and require city-specific emission inventories, meteorological fields and boundary concentrations. In contrast, machine learning (ML) offers a scalable and efficient alternative to enhance spatial resolution using existing regional-scale (1 km - 10 km grid resolutions) reanalysis datasets.\n</p>\n<p>\n We present a reproducible ML framework that downscales hourly NO\n <sub>\n  2\n </sub>\n data from the CAMS Europe ensemble (~10 km resolution) to 100 × 100 m\n <sup>\n  2\n </sup>\n resolution, using 11 years of data (2013–2023) for Hamburg. The framework integrates satellite-based and modelled inputs (CAMS, ERA5-Land), spatial predictors (CORINE, GHSL, OSM), and time indicators. Two ML approaches are employed: XGBoost for robust prediction and interpretability (via SHAP values), and Gaussian Processes for quantifying spatial and temporal uncertainty.\n</p>\n<p>\n The downscaling is evaluated through random, time-based and leave-site-out validation approaches. Results demonstrate good reproduction of observed spatial and temporal NO\n <sub>\n  2\n </sub>\n patterns, including traffic peaks and diurnal/seasonal trends. The trained models generate over 160 million hourly predictions for Hamburg with associated uncertainty fields. Although developed for Hamburg, the framework has been successfully tested in multiple European cities without reconfiguration, highlighting its portability and robustness. All code and workflows are openly available, contributing to reproducibility and reusability in urban air quality research.\n</p>\n<p>\n This work bridges Earth system science, infrastructure (via open datasets and standards), and data methods (XGBoost, GPs, SHAP), exemplifying how transparent, transferable ML can support urban digital twins and data-driven environmental policy.\n</p>\n",
        "presentation_type": "Talk",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T14:45:00+02:00",
        "duration": "00:15:00",
        "authors": [
            {
                "author": {
                    "first_name": "Martin Otto Paul",
                    "last_name": "Ramacher",
                    "orcid": "0000-0001-5813-2258"
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Paul",
                    "last_name": "Keil",
                    "orcid": "0000-0002-6502-4148"
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Martin Otto Paul Ramacher",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Helmholtz-Zentrum Hereon"
        ]
    },
    {
        "title": "Reconstructing Coastal Sediment Dynamics with 4DVarNet: Neural Data Assimilation Leveraging Models and Satellite Data",
        "url": "/submissions/74",
        "abstract": "<p>\n This study presents an end-to-end deep learning framework, 4DVarNet, for reconstructing high-resolution spatiotemporal fields of suspended particulate matter (SPM) in the German Bight under realistic satellite data gaps. Using a two-phase approach, the network is first pretrained on gap-free numerical model outputs masked with synthetic cloud patterns, then fine-tuned against sparse CMEMS observations with an additional independent validation mask. The framework architecture embeds a trainable dynamical prior and a convolutional LSTM solver to iteratively minimize a cost function that balances data agreement with physical consistency. The framework is applied for one year data (2020) of real observations (CMEMS) and co-located model simulations, demonstrating robust performance under operational conditions. Reconstructions capture major spatial patterns with correlation R2 = 0.977 and 50% of errors within ± 0.2 mg/L, even when 27% of days lack any observations. Sensitivity experiments reveal that removing 60% of available data doubles RMSE and smooths fine-scale SPM spatial features. Moreover, increasing the assimilation window reduces edge discontinuities between the data-void area and the adjacent data-rich region, whereas degrades sub-daily variability. Extending 4DVarNet to higher temporal resolution (hourly) reconstruction will require incorporating tidal dynamics to account for SPM resuspension, enabling real-time sediment transport forecasting in coastal environments.\n</p>\n",
        "presentation_type": "Talk",
        "session": "Artificial Intelligence/ Machine Learning methods in Earth System Sciences",
        "start": "2025-09-03T15:00:00+02:00",
        "duration": "00:15:00",
        "authors": [
            {
                "author": {
                    "first_name": "Wei",
                    "last_name": "Chen",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Nga",
                    "last_name": "Nguyen",
                    "orcid": null
                },
                "affiliation": [
                    "IMT Atlantique Bretagne-Pays de la Loire Campus de Brest Technople"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Johannes",
                    "last_name": "Pein",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Frédéric",
                    "last_name": "Jourdin",
                    "orcid": null
                },
                "affiliation": [
                    "Service hydrographique et oceanographique de la marine (Shom)"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Joanna",
                    "last_name": "Staneva",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Ronan",
                    "last_name": "Fablet",
                    "orcid": null
                },
                "affiliation": [
                    "IMT Atlantique Bretagne-Pays de la Loire Campus de Brest Technople"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Wei Chen",
        "event": "Data Science Symposium 2025",
        "activity": "MareHub",
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Helmholtz-Zentrum Hereon",
            "IMT Atlantique Bretagne-Pays de la Loire Campus de Brest Technople",
            "Service hydrographique et oceanographique de la marine (Shom)"
        ]
    }
]