Type of resources
Available actions
Topics
Keywords
Contact for the resource
Provided by
Years
Formats
Representation types
Update frequencies
status
Service types
Scale
Resolution
-
Overview: ERA5-Land is a reanalysis dataset providing a consistent view of the evolution of land variables over several decades at an enhanced resolution compared to ERA5. ERA5-Land has been produced by replaying the land component of the ECMWF ERA5 climate reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. Reanalysis produces data that goes several decades back in time, providing an accurate description of the climate of the past. Processing steps: The original hourly ERA5-Land air temperature 2 m above ground and dewpoint temperature 2 m data has been spatially enhanced from 0.1 degree to 30 arc seconds (approx. 1000 m) spatial resolution by image fusion with CHELSA data (https://chelsa-climate.org/). Subsequently, the temperature time series have been aggregated on a daily basis. From these, daily relative humidity has been calculated for the time period 01/2000 - 07/2021. Relative humidity (rh2m) has been calculated from air temperature 2 m above ground (Ta) and dewpoint temperature 2 m above ground (Td) using the formula for saturated water pressure from Wright (1997): maximum water pressure = 611.21 * exp(17.502 * Ta / (240.97 + Ta)) actual water pressure = 611.21 * exp(17.502 * Td / (240.97 + Td)) relative humidity = actual water pressure / maximum water pressure The resulting relative humidity has been aggregated to decadal averages. Each month is divided into three decades: the first decade of a month covers days 1-10, the second decade covers days 11-20, and the third decade covers days 21-last day of the month. Resultant values have been converted to represent percent * 10, thus covering a theoretical range of [0, 1000]. File naming scheme (YYYY = year; MM = month; dD = number of decade): ERA5_land_rh2m_avg_decadal_YYYY_MM_dD.tif Projection + EPSG code: Latitude-Longitude/WGS84 (EPSG: 4326) Spatial extent: north: 82:00:30N south: 18N west: 32:00:30W east: 70E Spatial resolution: 30 arc seconds (approx. 1000 m) Temporal resolution: Decadal Pixel values: Percent * 10 (scaled to Integer; example: value 738 = 73.8 %) Software used: GDAL 3.2.2 and GRASS GIS 8.0.0 Original dataset license: https://apps.ecmwf.int/datasets/licences/copernicus/ Processed by: mundialis GmbH & Co. KG, Germany (https://www.mundialis.de/) Reference: Wright, J.M. (1997): Federal meteorological handbook no. 3 (FCM-H3-1997). Office of Federal Coordinator for Meteorological Services and Supporting Research. Washington, DC Acknowledgements: This study was partially funded by EU grant 874850 MOOD. The contents of this publication are the sole responsibility of the authors and don't necessarily reflect the views of the European Commission.
-
Overview: 313: Vegetation formation composed principally of trees, including shrub and bush understory,where neither broad-leaved nor coniferous species predominate. Traceability (lineage): This dataset was produced with a machine learning framework with several input datasets, specified in detail in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ) Scientific methodology: The single-class probability layers were generated with a spatiotemporal ensemble machine learning framework detailed in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ). The single-class uncertainty layers were calculated by taking the standard deviation of the three single-class probabilities predicted by the three components of the ensemble. The HCL (hard class) layers represents the class with the highest probability as predicted by the ensemble. Usability: The HCL layers have a decreasing average accuracy (weighted F1-score) at each subsequent level in the CLC hierarchy. These metrics are 0.83 at level 1 (5 classes):, 0.63 at level 2 (14 classes), and 0.49 at level 3 (43 classes). This means that the hard-class maps are more reliable when aggregating classes to a higher level in the hierarchy (e.g. 'Discontinuous Urban Fabric' and 'Continuous Urban Fabric' to 'Urban Fabric'). Some single-class probabilities may more closely represent actual patterns for some classes that were overshadowed by unequal sample point distributions. Users are encouraged to set their own thresholds when postprocessing these datasets to optimize the accuracy for their specific use case. Uncertainty quantification: Uncertainty is quantified by taking the standard deviation of the probabilities predicted by the three components of the spatiotemporal ensemble model. Data validation approaches: The LULC classification was validated through spatial 5-fold cross-validation as detailed in the accompanying publication. Completeness: The dataset has chunks of empty predictions in regions with complex coast lines (e.g. the Zeeland province in the Netherlands and the Mar da Palha bay area in Portugal). These are artifacts that will be avoided in subsequent versions of the LULC product. Consistency: The accuracy of the predictions was compared per year and per 30km*30km tile across europe to derive temporal and spatial consistency by calculating the standard deviation. The standard deviation of annual weighted F1-score was 0.135, while the standard deviation of weighted F1-score per tile was 0.150. This means the dataset is more consistent through time than through space: Predictions are notably less accurate along the Mediterrranean coast. The accompanying publication contains additional information and visualisations. Positional accuracy: The raster layers have a resolution of 30m, identical to that of the Landsat data cube used as input features for the machine learning framework that predicted it. Temporal accuracy: The dataset contains predictions and uncertainty layers for each year between 2000 and 2019. Thematic accuracy: The maps reproduce the Corine Land Cover classification system, a hierarchical legend that consists of 5 classes at the highest level, 14 classes at the second level, and 44 classes at the third level. Class 523: Oceans was omitted due to computational constraints.
-
osm: Farmyard rasterized from OSM landuse polygons, first to 10m spatial resolution and after downsampled to 30m by spatial average.
-
Overview: 121: Land units that are under industrial or commercial use or serve for public service facilities. Traceability (lineage): This dataset was produced with a machine learning framework with several input datasets, specified in detail in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ) Scientific methodology: The single-class probability layers were generated with a spatiotemporal ensemble machine learning framework detailed in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ). The single-class uncertainty layers were calculated by taking the standard deviation of the three single-class probabilities predicted by the three components of the ensemble. The HCL (hard class) layers represents the class with the highest probability as predicted by the ensemble. Usability: The HCL layers have a decreasing average accuracy (weighted F1-score) at each subsequent level in the CLC hierarchy. These metrics are 0.83 at level 1 (5 classes):, 0.63 at level 2 (14 classes), and 0.49 at level 3 (43 classes). This means that the hard-class maps are more reliable when aggregating classes to a higher level in the hierarchy (e.g. 'Discontinuous Urban Fabric' and 'Continuous Urban Fabric' to 'Urban Fabric'). Some single-class probabilities may more closely represent actual patterns for some classes that were overshadowed by unequal sample point distributions. Users are encouraged to set their own thresholds when postprocessing these datasets to optimize the accuracy for their specific use case. Uncertainty quantification: Uncertainty is quantified by taking the standard deviation of the probabilities predicted by the three components of the spatiotemporal ensemble model. Data validation approaches: The LULC classification was validated through spatial 5-fold cross-validation as detailed in the accompanying publication. Completeness: The dataset has chunks of empty predictions in regions with complex coast lines (e.g. the Zeeland province in the Netherlands and the Mar da Palha bay area in Portugal). These are artifacts that will be avoided in subsequent versions of the LULC product. Consistency: The accuracy of the predictions was compared per year and per 30km*30km tile across europe to derive temporal and spatial consistency by calculating the standard deviation. The standard deviation of annual weighted F1-score was 0.135, while the standard deviation of weighted F1-score per tile was 0.150. This means the dataset is more consistent through time than through space: Predictions are notably less accurate along the Mediterrranean coast. The accompanying publication contains additional information and visualisations. Positional accuracy: The raster layers have a resolution of 30m, identical to that of the Landsat data cube used as input features for the machine learning framework that predicted it. Temporal accuracy: The dataset contains predictions and uncertainty layers for each year between 2000 and 2019. Thematic accuracy: The maps reproduce the Corine Land Cover classification system, a hierarchical legend that consists of 5 classes at the highest level, 14 classes at the second level, and 44 classes at the third level. Class 523: Oceans was omitted due to computational constraints.
-
123: Infrastructure of port areas (land and water surface), including quays, dockyards and marinas.
-
This landcover map was produced with a classification method developed in the project incora (Inwertsetzung von Copernicus-Daten für die Raumbeobachtung, mFUND Förderkennzeichen: 19F2079C) in cooperation with ILS (Institut für Landes- und Stadtentwicklungsforschung gGmbH) and BBSR (Bundesinstitut für Bau-, Stadt- und Raumforschung) funded by BMVI (Federal Ministry of Transport and Digital Infrastructure). The goal of incora is an analysis of settlement and infrastructure dynamics in Germany based on Copernicus Sentinel data. Even though the project is concluded, the annual land cover classification product is continuously generated. This classification is based on a time-series of monthly averaged, atmospherically corrected Sentinel-2 tiles (MAJA L3A-WASP: https://geoservice.dlr.de/web/maps/sentinel2:l3a:wasp; DLR (2019): Sentinel-2 MSI - Level 2A (MAJA-Tiles)- Germany). It consists of the following landcover classes: 10: forest 20: low vegetation 30: water 40: built-up 50: bare soil 60: agriculture Potential training and validation areas were automatically extracted using spectral indices and their temporal variability from the Sentinel-2 data itself as well as the following auxiliary datasets: - OpenStreetMap (Map data copyrighted OpenStreetMap contributors and available from htttps://www.openstreetmap.org) - Copernicus HRL Imperviousness Status Map 2018 (© European Union, Copernicus Land Monitoring Service 2018, European Environment Agency (EEA)) - S2GLC Land Cover Map of Europe 2017 (Malinowski et al. 2020: Automated Production of Land Cover/Use Map of Europe Based on Sentinel-2 Imagery. Remote Sens. 2020, 12(21), 3523; https://doi.org/10.3390/rs12213523) - Germany NUTS administrative areas 1:250000 (© GeoBasis-DE / BKG 2020 / dl-de/by-2-0 / https://gdz.bkg.bund.de/index.php/default/nuts-gebiete-1-250-000-stand-31-12-nuts250-31-12.html) - Contains modified Copernicus Sentinel data (2020), processed by mundialis Processing was performed for blocks of federal states and individual maps were mosaicked afterwards. For each class 100,000 pixels from the potential training areas were extracted as training data. An exemplary validation of the classification results was perfomed for the federal state of North Rhine-Westphalia as its open data policy allows for direct access to official data to be used as reference. Rules to convert relevant ATKIS Basis-DLM object classes to the incora nomenclature were defined. Subsequently, 5.000 reference points were randomly sampled and their classification in each case visually examined and, if necessary, revised to obtain a robust reference data set. The comparison of this reference data set with the incora classification yielded the following results: overall accuracy: 83.5% class: user's accuracy / producer's accuracy (number of reference points n) forest: 90.6% / 91.9% (1410) low vegetation: 69.2% / 82.8% (844) water: 97.0% / 94.2% (69) built-up: 96.5% / 97.4% (983) bare soil: 8.5% / 68.3% (41) agriculture: 96.6% / 68.4% (1653) Compared to the previous years, the overall accuracy and accuracies of some classes is reduced. 2021 was a rather cloudy year in Germany, which means that the detection of agricultural areas is hampered as it is based on the variance of the NDVI throughout the year. With fewer cloud-free images available, the NDVI variance is not fully covered and as no adaptations have been applied to the algorithm, agricultural fields may get classified as low vegetation or bare soil more often. Another reason for lower classification accuracy is the significant damage that occured to forest areas due to storm and bark beetle. The validation dataset was generated based on aerial imagery from the years 2018/2019 which and is slowly becoming obsolete. An up-to-date validation dataset will be applied. Incora report with details on methods and results: pending
-
Modified Normalized Difference Water Index (MNDWI) from MODIS data for Europe at 1 km resolution. Source data: - MODIS/Terra Surface Reflectance 8-Day L3 Global 500 m SIN Grid (MOD09A1 v006): https://lpdaac.usgs.gov/products/mod09a1v006/ The corresponding MODIS/Aqua product (MYD09A1 v006) could not be used due to the fact that the Aqua satellite has a number of broken detectors resulting in unreliable data for band 6 (SWIR) measurements. The Moderate Resolution Imaging Spectroradiometer (MODIS) Terra MOD09A1 Version 6 product provides an estimate of the surface spectral reflectance of Terra MODIS Bands 1 through 7 corrected for atmospheric conditions such as gasses, aerosols, and Rayleigh scattering. Along with the seven 500 meter (m) reflectance bands are two quality layers and four observation bands. For each pixel, a value is selected from all the acquisitions within the 8-day composite period. The criteria for the pixel choice include cloud and solar zenith. When several acquisitions meet the criteria the pixel with the minimum channel 3 (blue) value is used. For the time periods October 2016 - March 2017 and August 2020 - April 2021, the original data has been reprojected to ETRS89-extended / LAEA Europe and aggregated to a 1 km grid. The temporal resolution is 8 days. Bad quality pixels (cloud, cloud shadow, dead detector, solar zenith angle too large, etc.) have been masked using the provided quality assurance (QA) layers and appear as "no data". File naming: productCode.acquisitionDate[A (YYYYDDD)]_mosaic_spatialResolution_frequency_VI.tif example: MOD09A1.A2016353_mosaic_1000m_8_days_MNDWI.tif The date is Year and Day of Year. Values are MNDWI * 10000. Example: Value -5099 = -0.5099
-
rgb: Landsat RGB time-series, derived by the median pixel values obtained between June 25 and September 12 on a specific year.
-
osm: Forest rasterized from OSM landuse polygons, first to 10m spatial resolution and after downsampled to 30m by spatial average.
-
Overview: 321: Grasslands under no or moderate human influence. Low productivity grasslands. Often situated in areas of rough, uneven ground, steep slopes; frequently including rocky areas or patches of other (semi-)natural vegetation. Natural grasslands are areas with herbaceous vegetation (maximum height is 150 cm and gramineous species are prevailing) covering at least 50 % of the surface. Besides herbaceous vegetation, areas of shrub formations, of scattered trees and of mineral outcrops also occur. Often under nature conservation. In this context the term ”natural” indicates that vegetation is developed under a minimum human interference,(not mowed, drained, irrigated, sown, fertilized or stimulated by chemicals, which might influence production of biomass). Even though the human interference cannot be completely discarded in quoted areas, it does not suppress the natural development or species composition of the meadows. Maintenance mowing and shrub clearance for prevention of woody overgrowth due to natural succession is tolerated. Sporadic extensive grazing with low livestock unit/ha is possible. Typical visible characteristics: large extent, irregular shape, usually in distant location from larger settlements. Traceability (lineage): This dataset was produced with a machine learning framework with several input datasets, specified in detail in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ) Scientific methodology: The single-class probability layers were generated with a spatiotemporal ensemble machine learning framework detailed in Witjes et al., 2022 (in review, preprint available at https://doi.org/10.21203/rs.3.rs-561383/v3 ). The single-class uncertainty layers were calculated by taking the standard deviation of the three single-class probabilities predicted by the three components of the ensemble. The HCL (hard class) layers represents the class with the highest probability as predicted by the ensemble. Usability: The HCL layers have a decreasing average accuracy (weighted F1-score) at each subsequent level in the CLC hierarchy. These metrics are 0.83 at level 1 (5 classes):, 0.63 at level 2 (14 classes), and 0.49 at level 3 (43 classes). This means that the hard-class maps are more reliable when aggregating classes to a higher level in the hierarchy (e.g. 'Discontinuous Urban Fabric' and 'Continuous Urban Fabric' to 'Urban Fabric'). Some single-class probabilities may more closely represent actual patterns for some classes that were overshadowed by unequal sample point distributions. Users are encouraged to set their own thresholds when postprocessing these datasets to optimize the accuracy for their specific use case. Uncertainty quantification: Uncertainty is quantified by taking the standard deviation of the probabilities predicted by the three components of the spatiotemporal ensemble model. Data validation approaches: The LULC classification was validated through spatial 5-fold cross-validation as detailed in the accompanying publication. Completeness: The dataset has chunks of empty predictions in regions with complex coast lines (e.g. the Zeeland province in the Netherlands and the Mar da Palha bay area in Portugal). These are artifacts that will be avoided in subsequent versions of the LULC product. Consistency: The accuracy of the predictions was compared per year and per 30km*30km tile across europe to derive temporal and spatial consistency by calculating the standard deviation. The standard deviation of annual weighted F1-score was 0.135, while the standard deviation of weighted F1-score per tile was 0.150. This means the dataset is more consistent through time than through space: Predictions are notably less accurate along the Mediterrranean coast. The accompanying publication contains additional information and visualisations. Positional accuracy: The raster layers have a resolution of 30m, identical to that of the Landsat data cube used as input features for the machine learning framework that predicted it. Temporal accuracy: The dataset contains predictions and uncertainty layers for each year between 2000 and 2019. Thematic accuracy: The maps reproduce the Corine Land Cover classification system, a hierarchical legend that consists of 5 classes at the highest level, 14 classes at the second level, and 44 classes at the third level. Class 523: Oceans was omitted due to computational constraints.
Open Data Science Europe Metadata Catalog