A contribution rest export view for sessions.

GET /programme/30/export/?format=api
HTTP 200 OK
Allow: GET, HEAD, OPTIONS
Content-Type: application/json
Vary: Accept

[
    {
        "title": "Towards an AI enabled forest systems platform",
        "url": "/submissions/117",
        "abstract": "<p>\n In forestry, having consistently relevant and correct information is critical towards environmentally conscionable decision making. Consulting commercially available or open-source Large Language Models (LLMs) in this decision-making process can be an effective way towards informed decision making. However, currently LLMs have demonstrated to be deficient in three critical areas: reliable information sources, lack of access to real world data, and ambiguity in their scientific reasoning ability.\n</p>\n<p>\n To overcome these shortcomings we opted to instantiate a curated knowledgebase containing information from relevant CC-0 research articles. With clearly defined constraints applied to each research article intended for ingestion in the knowledgebase, it becomes possible for the underlying LLM to produce feedback that is correct, concise and accurate. Some of the constraints of the knowledgebase are in place to adhere to European laws regarding the ethical use of AI as well as comply with copyright laws.\n</p>\n<p>\n Aside from having access to relevant research papers, having access to real world data is one of the cornerstones of the proposed framework. By utilizing calibrated level 1 data from multiple sensors, platforms and measuring devices, we can implement agentic RAG functionality to retrieve information about our area of interest.\n</p>\n<p>\n Lastly, reasoning abilities in Large Language Models (LLMs) are paramount and are considered an area of continuous research. While in the current state of the art LLMs convey a form of thinking or reasoning process, its efficacy is still hotly debated. In this research, the focus is on a more comprehensive and explainable scientific reasoning framework involving a decision-making AI agent. Wherein the AI agent will have to determine the goal of an input query, devise a logical methodology to resolve that goal, and execute the resolution using an array of pre-defined actions.\n</p>\n<p>\n This system has the potential to not only provide the end user with real world data but also provide deeper insights from scientific articles, derive scientific reasoning, as well as assist the user in decision making processes.\n</p>\n",
        "presentation_type": "Digital Poster/Demo",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "stephan",
                    "last_name": "playfair",
                    "orcid": null
                },
                "affiliation": [
                    "Hochschule für Nachhaltiche Entwicklung Eberswalde",
                    "Leibniz-Zentrum für Agrarlandschaftsforschung (ZALF)"
                ],
                "is_presenter": true
            }
        ],
        "submitter": "stephan playfair",
        "event": "Data Science Symposium 2025",
        "activity": "AK Portal & Viewer Technologies (Viewer)",
        "accepted": true,
        "license": "Creative Commons Zero v1.0 Universal (CC0-1.0)",
        "affiliations": [
            "Hochschule für Nachhaltiche Entwicklung Eberswalde",
            "Leibniz-Zentrum für Agrarlandschaftsforschung (ZALF)"
        ]
    },
    {
        "title": "Different Portals - Different Views on the Same Content",
        "url": "/submissions/106",
        "abstract": "<p>\n The proliferation of digital portals for accessing environmental and geoscientific data has significantly enhanced the ability of researchers, policymakers, and the public to retrieve and utilize critical information.\n</p>\n<p>\n Furthermore, metadata content can be harvested, which brings the added value that information is collected only once and then presented in different web presences. In this way, it is possible to tailor the presentation of metadata attributes to the relevant user groups - optimally presented according to their priorities.\n</p>\n<p>\n The Earth Data Portal (https://earth-data.de/), as a collaborative effort of the Helmholtz centers of the research field Earth and Environment enables querying data from multiple repositories, particularly from the Helmholtz research centers.\n</p>\n<p>\n In addition, umwelt.info (https://umwelt.info/de), operated by the German Environment Agency, offers a user-friendly interface of openly available environmental and nature protection data tailored for seamless access by the general public.\n</p>\n<p>\n The respective metadata content, on the other hand, is certainly best presented on the website of the source repository.\n</p>\n<p>\n This poster provides a general overview of the highlighted portals and delves into the specifics of their implementations, with a focus on the use of Persistent Identifiers (PIDs). PIDs play a crucial role in ensuring the long-term accessibility and citability of data, and the discussion will cover how each portal integrates PIDs to enhance data management and retrieval processes.\n</p>\n<p>\n Ultimately, the goal is to weigh how extensively the content depth should be represented, i.e., how granular the metadata attributes should be displayed, and what advantages an overarching perspective offers.\n</p>\n<p>\n This poster might be a suitable addition to the collaboration coffee “Don’t let your data be a needle in a haystack – efficient metadata curation for optimal findability”, see:\n <a href=\"https://dss2025.hereon.de/submissions/78\" rel=\"noopener noreferrer\" target=\"_blank\">\n  https://dss2025.hereon.de/submissions/78\n </a>\n <br/>\n The best way to do this is to run a parallel demo of the portals on a laptop or tablet.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Andrea",
                    "last_name": "Pörsch",
                    "orcid": "0000-0003-4502-6223"
                },
                "affiliation": [
                    "GFZ German Research Centre for Geosciences"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Emanuel",
                    "last_name": "Söding",
                    "orcid": "0000-0002-4467-642X"
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Maximilian",
                    "last_name": "Berthold",
                    "orcid": "0000-0003-1985-6426"
                },
                "affiliation": [
                    "Umweltbundesamt"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Roland",
                    "last_name": "Koppe",
                    "orcid": null
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Dorothee",
                    "last_name": "Kottmeier",
                    "orcid": null
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Sören",
                    "last_name": "Lorenz",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Andrea Pörsch",
        "event": "Data Science Symposium 2025",
        "activity": "AK Metadaten PIDs (Metadaten-PIDs)",
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GFZ German Research Centre for Geosciences",
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel",
            "Umweltbundesamt",
            "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
        ]
    },
    {
        "title": "Enabling Stakeholders to Maintain PID Metadata: Which Tools are Required?",
        "url": "/submissions/94",
        "abstract": "<p>\n At the Helmholtz Association, we aim to establish a well-structured and harmonized data space that connects information across distributed data infrastructures. Achieving this goal requires the standardization of dataset descriptions using appropriate metadata and the definition of a single source of truth for much of this metadata, from which different systems can draw. Persistent Identifiers (PIDs) in metadata enable the reuse of common information from shared sources. Broad adoption of PID types enhances interoperability and supports machine-actionable data. As a first step, we recommend implementing ROR, ORCID, IGSN, PIDINST, DataCite DOI, and Crossref DOI in our data systems.\n</p>\n<p>\n However, to practically record and integrate this information into our repositories, we must first identify the specific locations and stakeholders within institutions where this data is generated and maintained. We must also assess what kinds of tools and services the Association needs to provide to support seamless data management for its users.\n</p>\n<p>\n In this presentation, we highlight several tools we propose to implement across the organization, based on envisioned workflows. These include, for example, repository software, electronic lab notebooks (ELNs), terminology services, and other infrastructure components. Implementing these tools will support the various stakeholder groups in fulfilling their roles and will contribute to a fully integrated, FAIR-compliant data collection and publication system within the Helmholtz Association.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Emanuel",
                    "last_name": "Söding",
                    "orcid": "0000-0002-4467-642X"
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Andrea",
                    "last_name": "Pörsch",
                    "orcid": "0000-0003-4502-6223"
                },
                "affiliation": [
                    "GFZ German Research Centre for Geosciences"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Dorothee",
                    "last_name": "Kottmeier",
                    "orcid": null
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung",
                    "MARUM - Center for Marine Environmental Sciences, University Bremen"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Stanislav",
                    "last_name": "Malinovschii",
                    "orcid": "0009-0002-9792-6768"
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Sören",
                    "last_name": "Lorenz",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Emanuel Söding",
        "event": "Data Science Symposium 2025",
        "activity": "AK Metadaten PIDs (Metadaten-PIDs)",
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel",
            "GFZ German Research Centre for Geosciences",
            "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung",
            "MARUM - Center for Marine Environmental Sciences, University Bremen"
        ]
    },
    {
        "title": "iFDOs: A Key to Unlocking the Potential of Marine Image Datasets",
        "url": "/submissions/99",
        "abstract": "<p>\n Images and videos are usually a more vivid data source than raw scalar data. However, even in the era of analog photo albums, metadata was added to images to preserve their context for the future. Today, the marine community wants to analyze far larger datasets of videos and images using computers, which generally cannot easily understand the image content on their own. Therefore, researchers have to record the content and context of images in a structured format to enable automated, systematic and quantitative image analysis.\n</p>\n<p>\n The metadata file format FAIR Digital Objects for images (iFDOs) provides this structure for describing individual images and hole datasets. iFDOs primarily structure the answers to the five W's and H questions: Where were the images taken, by whom, why, when, how, and what is actually shown in the images or videos. Together, these pieces of information provide FAIRness (findability, accessibility, interoperability and reusability) to datasets.\n</p>\n<p>\n Researchers benefit from iFDO enhanced datasets, as they already provide the information necessary for data homogenization, enabling machine learning applications and mass-data-analysis. Data viewers and portals, such as\n <a href=\"https://marine-data.de\" rel=\"noopener noreferrer\" target=\"_blank\">\n  marine-data.de\n </a>\n , can increase the reach and impact of datasets by visualizing the datasets and making them findable using the context and content information given by iFDOs. For this to work, the data must first be published in FAIR data repositories such as\n <a href=\"https://pangaea.de/\" rel=\"noopener noreferrer\" target=\"_blank\">\n  PANGAEA\n </a>\n . However, the data repositories also benefit from iFDOs, as they can use metadata from the iFDO to simplify or even automate parts of the submission process saving time for the curators and researchers.\n</p>\n<p>\n Although the image datasets and their iFDO files could be created manually, our aim is to automate the process as much as possible, while also integrating existing metadata sources, such as the\n <a href=\"https://registry.o2a-data.de/\" rel=\"noopener noreferrer\" target=\"_blank\">\n  O2A Registry\n </a>\n and the\n <a href=\"https://extract.dship.awi.de/insightics/\" rel=\"noopener noreferrer\" target=\"_blank\">\n  DSHIP land system\n </a>\n . The standardized nature of iFDOs enables interoperability between curation and annotation tools, allowing researchers to choose the tools that fit their workflow. To achieve this, we are developing the\n <a href=\"https://o2a-data.de/tools/ifdo-creator/\" rel=\"noopener noreferrer\" target=\"_blank\">\n  iFDO Creator\n </a>\n , which allows researchers to create and edit iFDOs with a focus on usability. We are also contributing to the\n <a href=\"https://github.com/csiro-fair/marimba\" rel=\"noopener noreferrer\" target=\"_blank\">\n  marimba\n </a>\n framework, which offers a higher degree of automation for dataset and iFDO creation.\n</p>\n<p>\n We demonstrate that the metadata file standard iFDO benefits researchers at every step of the data lifecycle: measurement, dataset curation and creation, publication and analysis.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Karsten",
                    "last_name": "Schimpf",
                    "orcid": null
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Christopher",
                    "last_name": "Krämmer",
                    "orcid": "0009-0001-1994-0399"
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Karsten Schimpf",
        "event": "Data Science Symposium 2025",
        "activity": "MareHub",
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
        ]
    },
    {
        "title": "A Large-scale FAIR Research Data Infrastructure for Environmental Time Series Data",
        "url": "/submissions/109",
        "abstract": "<p>\n In Environmental Sciences, Time-series data is key to, for example, monitoring environmental processes and validating Earth system models. A major issue is the lack of a consistent data availability standard aligned with the FAIR principles, but the Helmholtz Earth and Environment DataHub is working with the Helmholtz Metadata Collaboration project STAMPLATE to address this.\n</p>\n<p>\n The seven participating research centers are building a large-scale infrastructure using the Open Geospatial Consortium's SensorThings API (STA) as the central data interface. It is linked to other community-driven tools, such as sensor and device management systems, data ingestion systems, and the Earth Data Portal (www.earth-data.de) with highly customizable viewers.\n</p>\n<p>\n Our custom, semantic metadata profile augments STA’s core data model with domain-specific information. This ensures that metadata entered in any user self-service is also displayed in the Earth Data Portal along with the ingested data.\n</p>\n<p>\n The operationalization of the framework and its subsequent integration into research data workflows is imminent, thereby rendering long-term, nationwide measurements spanning decades available. Concurrently, our RDM processes are undergoing a transformative shift, moving from manual, person-based to self-organized, digital-supported workflows.\n</p>\n<p>\n This poster presents the fundamental elements of our initiative and the associated challenges. It also encourages new domains to get involved.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Claas",
                    "last_name": "Faber",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Ulrich",
                    "last_name": "Loup",
                    "orcid": "0009-0005-1370-6226"
                },
                "affiliation": [
                    "Forschungszentrum Jülich"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Roland",
                    "last_name": "Koppe",
                    "orcid": null
                },
                "affiliation": [
                    "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Ralf",
                    "last_name": "Kunkel",
                    "orcid": null
                },
                "affiliation": [
                    "Forschungszentrum Jülich"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Nils",
                    "last_name": "Brinckmann",
                    "orcid": null
                },
                "affiliation": [
                    "GFZ German Research Centre for Geosciences"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Marc",
                    "last_name": "Hanisch",
                    "orcid": null
                },
                "affiliation": [
                    "GFZ German Research Centre for Geosciences"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Mihir",
                    "last_name": "Rambhia",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "David",
                    "last_name": "Schäfer",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz Centre for Environmental Research"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Christof",
                    "last_name": "Lorenz",
                    "orcid": "0000-0001-5590-5470"
                },
                "affiliation": [
                    "Karlsruhe Institute of Technology"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Claas Faber",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel",
            "Forschungszentrum Jülich",
            "Alfred-Wegener-Institut - Helmholtz-Zentrum für Polar- und Meeresforschung",
            "GFZ German Research Centre for Geosciences",
            "Helmholtz-Zentrum Hereon",
            "Helmholtz Centre for Environmental Research",
            "Karlsruhe Institute of Technology"
        ]
    },
    {
        "title": "Sharing is caring – how umwelt.info aims for easier access to environmental and nature protection data and information in Germany",
        "url": "/submissions/77",
        "abstract": "<p>\n We, the National Centre for Environmental and Nature Conservation Information, develop the portal\n <a href=\"https://umwelt.info\" rel=\"noopener noreferrer\" target=\"_blank\">\n  <i>\n   umwelt.info\n  </i>\n </a>\n which acts as central access point to all of Germany’s knowledge on the environment and nature protection. We integrate all openly accessible sources from municipalities, to federal states, civil society, economy and sciences into one flexible catalogue. This catalogue at its core will make it easier to find and share all kinds of data and information, like web applications, research data, or editorials. Here, we want to present our approach on how to combine this diverse data ecosystem into one searchable catalogue. Our approach is to develop an open-source software, where everybody can contribute to the development. We want to give insights into our development process, both in our front- and back-end.\n</p>\n<p>\n To support the open data community, we offer a native API, as well as an emulated CKAN interface. Furthermore, we create editorials and scripts about data availability with a current focus on water-related data sets in Germany. These products aim to help scientists to gain easier access, as well as information on reusability. Our current product can be found at\n <a href=\"https://umwelt.info\" rel=\"noopener noreferrer\" target=\"_blank\">\n  https://umwelt.info\n </a>\n and our current development stage at\n <a href=\"https://gitlab.opencode.de/umwelt-info\" rel=\"noopener noreferrer\" target=\"_blank\">\n  https://gitlab.opencode.de/umwelt-info\n </a>\n .\n</p>\n",
        "presentation_type": "Digital Poster/Demo",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Maximilian",
                    "last_name": "Berthold",
                    "orcid": "0000-0003-1985-6426"
                },
                "affiliation": [
                    "Umweltbundesamt"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Johannes",
                    "last_name": "Vogel",
                    "orcid": "0000-0002-0654-9673"
                },
                "affiliation": [
                    "Umweltbundesamt"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Anja",
                    "last_name": "Reineke",
                    "orcid": null
                },
                "affiliation": [
                    "Umweltbundesamt"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Jakob",
                    "last_name": "Deller",
                    "orcid": "0000-0001-8341-007X"
                },
                "affiliation": [
                    "Umweltbundesamt"
                ],
                "is_presenter": true
            }
        ],
        "submitter": "Maximilian Berthold",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Umweltbundesamt"
        ]
    },
    {
        "title": "Can you Predict the Data? A Workshop on Reproducibility for Language Modelling",
        "url": "/submissions/85",
        "abstract": "<p>\n The scientific landscape is continually shifting towards increasing amounts of data, demanding a greater investment of (time-) resources into the management, and (pre-) processing of these data. As a result, data literacy has become a key element for researchers from all domains. Additionally, interdisciplinary, multidisciplinary, and collaborative approaches are more essential now than ever before. The Rhine-Ruhr Center for Scientific Data Literacy (DKZ.2R) focuses on a combined methodological data literacy, integrating data science and machine learning skills, high performance computing and research data management competencies. Our main objective is to promote a holistic data literacy offering support for researchers in the form of trainings, consultings, data challenges and tools for data analysis and management.\n</p>\n<p>\n The availability of ever larger and more complex amounts of data requires comprehensive and methodological skills that researchers must often learn independently. These skills begin with the consideration of how scientific data should be collected, extending to questions about data processing applications, methods, infrastructure, and finally, publishing. The DKZ.2R focuses on offering support for researchers to break through data related hurdles in order to find cross-domain solutions and synergies.\n</p>\n<p>\n In our contribution we are presenting our workflow on the filtering of training data for Foundation Models which is to be applied as a use case in data challenges.\n <br/>\n Foundation Model performance depends directly on training data quality. Public data sources like Common Crawl contain significant noise, creating a bottleneck for developing specialized models. We address this by presenting a complete HPC workflow that enables researchers to curate web-scale text data and measure the impact on model performance.\n</p>\n<p>\n Our workflow equips users to process a 230 GB, 89-million-document German dataset from Common Crawl. Users analyze pre-computed quality signals, design custom filters to remove low-quality content, and execute a pipeline that automates data preparation. This pipeline handles metadata removal, data merging, indexing, and tokenization. It integrates the modalities framework for distributed training on GPUs and uses poetry for environment management, making the HPC operations reproducible.\n</p>\n<p>\n We will apply this workflow as an interactive data challenge in which the participants are presented with a specific task:\n <br/>\n - Start with the 89 million raw documents and their quality signals.\n <br/>\n - Design a filter that selects a high-quality subset of approximately 20 million documents, aiming to remove around 80% of the original data.\n <br/>\n - Execute the automated HPC pipeline to process the filtered subset.\n <br/>\n - Train a large language model (LLM) on the curated data and evaluate it to achieve a performance score on the Hellaswag benchmark that exceeds our random-sampling baseline.\n</p>\n<p>\n This demonstration offers a tangible blueprint for data-centric AI projects. We invite attendees to interact with the system, discuss strategies for large-scale data QA/QC (Quality Assurance/Quality Control), and explore how to adapt the framework for other scientific domains. The project directly embodies the symposium's goal of building bridges between an application (LLMs), a method (data filtering), and the infrastructure (HPC) required to connect them. The DKZ.2R is therefore aiming to make a meaningful and lasting impact by supporting researchers in promoting their methodological Data Literacy.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Nicolo",
                    "last_name": "Brandizzi",
                    "orcid": "0000-0002-3191-6623"
                },
                "affiliation": [
                    "Fraunhofer IAIS"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Qasid",
                    "last_name": "Saleem",
                    "orcid": null
                },
                "affiliation": [
                    "Fraunhofer IAIS"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Alicia",
                    "last_name": "Janz",
                    "orcid": "0000-0002-8540-7744"
                },
                "affiliation": [
                    "Forschungszentrum Jülich - IAS 9"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Stefan",
                    "last_name": "Sandfeld",
                    "orcid": "0000-0001-9560-4728"
                },
                "affiliation": [
                    "Forschungszentrum Jülich - IAS 9"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Johannes",
                    "last_name": "Leveling",
                    "orcid": "0000-0003-0603-4191"
                },
                "affiliation": [
                    "Fraunhofer IAIS"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Alicia Janz",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Fraunhofer IAIS",
            "Forschungszentrum Jülich - IAS 9"
        ]
    },
    {
        "title": "Building explorative Data Analysis Web-Frontends with DASF and the ESRI Experience Builder",
        "url": "/submissions/91",
        "abstract": "<p>\n Building web applications for the exploration of scientific data presents several challenges. These include the difficulty of accessing large volumes of data, the need for high-performance computing (HPC) resources to process and analyze such data, and the complexity of developing intuitive web frontends—especially for scientists who are not trained web developers. The Data Analysis Software Framework (DASF) addresses these challenges by enabling scientists to focus on Python-based backend development while seamlessly integrating HPC resources, even when these are not directly exposed to the internet. DASF also provides an automated mechanism to generate web frontends, significantly lowering the barrier to entry for scientific web application development (DOI:10.5194/egusphere-egu25-3120).\n</p>\n<p>\n Complementing this, the ESRI Experience Builder empowers users to create multi-page web applications and dashboards through a content management system, without requiring expertise in JavaScript-based frontend frameworks. This makes it an ideal platform for scientists to build rich, interactive data exploration tools. The newly developed DASF plugin for the Experience Builder (available at https://codebase.helmholtz.cloud/dasf/dasf-experiencebuilder-plugin) bridges these two ecosystems. It enables seamless access to data and computational resources from within Experience Builder applications, facilitating the creation of powerful, user-friendly scientific web portals.\n</p>\n",
        "presentation_type": "Digital Poster/Demo",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Philipp Sebastian",
                    "last_name": "Sommer",
                    "orcid": "0000-0001-6171-7716"
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Markus",
                    "last_name": "Benninghoff",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Philipp Sebastian Sommer",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Helmholtz-Zentrum Hereon"
        ]
    },
    {
        "title": "Advancing bathymetric reconstruction and forecasting using deep learning",
        "url": "/submissions/73",
        "abstract": "<p>\n Bathymetric evolution in coastal environments is driven by complex interactions between hydrodynamics, sediment transport, and morphodynamics. Traditional morphodynamic models often face challenges in capturing these dynamics, particularly in regions like the Wadden Sea, where the feedback mechanisms between physical processes and seabed changes are highly intricate. In this study, we explore the potential of deep learning techniques to address these limitations, using convolutional neural networks (CNN) for bathymetric reconstruction and convolutional long short-term memory (ConvLSTM) networks for forecasting. We applied these models to a dataset of bathymetric observations from the German Bight, which provides detailed coverage of the seabed from 1983 to 2012. First, we demonstrated that CNN effectively reconstructs spatial bathymetric patterns based on incomplete data inputs, achieving accurate reproductions of observed bathymetry with minimal reconstruction error, particularly in regions with active dynamics like tidal channels. Second, we used ConvLSTM for forecasting, training the model with past observations to predict bathymetry. The ConvLSTM model performed well, with an area-averaged root mean square error of 0.139 m. Our results indicate that deep learning techniques offer promising alternatives to traditional methods for both spatial reconstruction and forecasting of bathymetric changes. These models can improve predictions of seabed dynamics, which are critical for effective coastal management, navigation, and environmental protection.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Irem",
                    "last_name": "Yildiz",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Emil V.",
                    "last_name": "Stanev",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Joanna",
                    "last_name": "Staneva",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Irem Yildiz",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "Helmholtz-Zentrum Hereon"
        ]
    },
    {
        "title": "AqQua The Aquatic Life Foundation Project:   Quantifying Life at Scale in a Changing World",
        "url": "/submissions/84",
        "abstract": "<p>\n Aquatic life is crucial for human well-being, playing a key role in carbon sequestration, climate regulation, biodiversity conservation and nutrition. Plankton are the basis of aquatic food webs and sustainably sequester vast amounts of carbon from the atmosphere to the ocean’s interior. Impacts of climate change and pollution on plankton functioning and diversity not only impact fish resources that play a major role in human nutrition, but also the efficiency of the biological carbon pump. The critical role of aquatic life in biogeochemical cycles, climate regulation, conservation of aquatic biodiversity and human nutrition mandates precise mapping and monitoring. Distributed pelagic imaging techniques enable comprehensive global observation, providing critical insights for decision-making and carbon dioxide removal strategies. To this end, each day, millions of images of plankton and particles are taken by researchers around the globe, using a variety of imaging systems. Each individual image and its associated metadata can provide crucial information not only about the individual organism or particle, but also on the biodiversity and functioning of aquatic food webs, ecosystem status of the related water body, and its role in carbon sequestration. The Aquatic Life Foundation Project will, for the first time, combine billions of images acquired with a variety of devices across the globe. We will leverage this dataset for large-scale training of a foundational pelagic imaging model, which we will fine-tune for the highly relevant down-stream tasks of species classification, trait extraction and particulate organic carbon estimation. We will exploit model predictions to establish global maps of species biodiversity, ecosystem status, and carbon flux at unprecedented accuracy and granularity, thereby generating\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Ankita",
                    "last_name": "Vaswani",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Christina",
                    "last_name": "Hübers",
                    "orcid": null
                },
                "affiliation": [
                    "Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Helmholtz Imaging"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Dagmar",
                    "last_name": "Kainmüller",
                    "orcid": null
                },
                "affiliation": [
                    "Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Helmholtz Imaging"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Klas Ove",
                    "last_name": "Möller",
                    "orcid": null
                },
                "affiliation": [
                    "Helmholtz-Zentrum Hereon"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Lisa",
                    "last_name": "Mais",
                    "orcid": null
                },
                "affiliation": [
                    "Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Helmholtz Imaging"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Martin",
                    "last_name": "Schröder",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Peter",
                    "last_name": "Hirsch",
                    "orcid": null
                },
                "affiliation": [
                    "Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Helmholtz Imaging"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Rainer",
                    "last_name": "Kiko",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Ankita Vaswani",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution Non Commercial 4.0 International (CC-BY-NC-4.0)",
        "affiliations": [
            "Helmholtz-Zentrum Hereon",
            "Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Helmholtz Imaging",
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
        ]
    },
    {
        "title": "YOLO at Your Fingertips: Object Detection without Coding",
        "url": "/submissions/113",
        "abstract": "<p>\n Image-based data analysis is becoming increasingly important in Earth and environmental sciences – for example, in marine biodiversity monitoring, drone image evaluation, or automated habitat classification. Deep learning approaches such as YOLO (You Only Look Once) offer powerful tools for object detection, but their application is often limited by technical complexity and the need for programming skills.\n</p>\n<p>\n In this demo, I present a user-friendly graphical interface that allows researchers to upload their own image datasets and annotations, and then configure and run a complete object detection workflow – including data preparation, model training, validation, and testing – all without writing any code. This tool is particularly aimed at scientists working with image data who want to apply deep learning methods without needing expertise in machine learning frameworks.\n</p>\n<p>\n The demo will showcase typical use cases from marine biology, but the workflow is domain-agnostic and easily transferable to a wide range of Earth and environmental science applications. In the future, the tool will be available via BinderHub, allowing users to run the entire workflow directly in their web browser without any local installation.\n</p>\n",
        "presentation_type": "Digital Poster/Demo",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Judith",
                    "last_name": "Fischer",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            }
        ],
        "submitter": "Judith Fischer",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
        ]
    },
    {
        "title": "OSIS – Modernizing the Ocean Science Information System for marine Expedition Management",
        "url": "/submissions/107",
        "abstract": "<p>\n The\n <strong>\n  O\n </strong>\n cean\n <strong>\n  S\n </strong>\n cience\n <strong>\n  I\n </strong>\n nformation\n <strong>\n  S\n </strong>\n ystem (OSIS), developed at GEOMAR Helmholtz Centre for Ocean Research Kiel, is a central platform for managing and publishing metadata related to marine research expeditions, experiments and simulations. In response to evolving national needs and broader integration efforts, OSIS is currently undergoing a major transformation.\n</p>\n<p>\n A key driver of this development is its adoption by the Deutsche Allianz Meeresforschung (DAM) as the primary system for recording German research cruises across partner institutions. This expansion has necessitated enhanced interoperability, standardized metadata workflows, and scalable infrastructure.\n</p>\n<p>\n In parallel, OSIS is building stronger integration with the O2ARegistry, developed by AWI as a cross-institutional metadata registry for sensors and platforms. These efforts aim to support the reuse of expedition and instrument metadata in broader national and international contexts.\n</p>\n<p>\n As part of its modernization, OSIS will support single sign-on (SSO) via the Helmholtz AAI, enabling seamless and secure access for users across participating institutions.\n</p>\n<p>\n Another major focus is the automated import of planned expedition data from upstream expedition planning and logistics systems such as MFP (Marine Facilities Planning) and EIS (Expeditions-Informationssystem). These enhancements are designed to streamline data entry, reduce redundancy, and improve data consistency across the research lifecycle.\n</p>\n<p>\n This poster presents the architecture, current development roadmap, and emerging collaborations that position OSIS as a key component in Germany’s national marine data infrastructure.\n</p>\n",
        "presentation_type": "Poster",
        "session": "Postersession No. 1",
        "start": "2025-09-03T15:45:00+02:00",
        "duration": "00:45:00",
        "authors": [
            {
                "author": {
                    "first_name": "Gregor",
                    "last_name": "Börner",
                    "orcid": "0000-0002-2652-5863"
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": true
            },
            {
                "author": {
                    "first_name": "Stefan",
                    "last_name": "Lingner",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Carsten",
                    "last_name": "Schirnick",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            },
            {
                "author": {
                    "first_name": "Claas",
                    "last_name": "Faber",
                    "orcid": null
                },
                "affiliation": [
                    "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
                ],
                "is_presenter": false
            }
        ],
        "submitter": "Gregor Börner",
        "event": "Data Science Symposium 2025",
        "activity": null,
        "accepted": true,
        "license": "Creative Commons Attribution 4.0 International (CC-BY-4.0)",
        "affiliations": [
            "GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel"
        ]
    }
]