Geology Datasets 

The geology domain, encompassing fields such as geospatial, weather, climate, ocean, earth, and environmental science, presents a rich source of information for machine learning applications. With data from sources such as satellite imagery, weather sensors, and oceanographic measurements, models can make predictions and uncover insights related to coastal management, agriculture, disaster response, and climate change. The emerging field of “geospatial analytics” leverages these data sources to study topics such as crop yield forecasting, flood prediction, and environmental impact analysis. With the right tools and algorithms, geology data has the potential to inform decision-making and drive progress in fields related to the environment, agriculture, and disaster response. So, explore the power of machine learning in the geology domain and discover new trends and insights that drive innovation and progress.

Sentinel-2 Cloud-Optimized GeoTIFFs

CMIP6 GCMs downscaled using WRF

ESA WorldCover

NOAA National Water Model Short-Range Forecast

Radiant MLHub

Community Earth System Model Large Ensemble (CESM LENS)

NOAA Geostationary Operational Environmental Satellites (GOES) 16, 17 & 18

NOAA Real-Time Mesoscale Analysis (RTMA)

Virginia Coastal Resilience Master Plan, Phase 1 – December 2021

NOAA Global Ensemble Forecast System (GEFS)

NOAA Global Extratropical Surge and Tide Operational Forecast System (Global ESTOFS)

Prefeitura Municipal de São Paulo (PMSP) LiDAR Point Cloud

NOAA North American Mesoscale Forecast System (NAM)

Defense Meteorology Satellite Program (DMSP) Auroral Particle Flux

HIRLAM Weather Model

NOAA Space Weather Forecast and Observation Data

Sentinel-1 SLC dataset for South and Southeast Asia, Taiwan, Korea and Japan

AWS iGenomes

SILAM Air Quality

Corn Kernel Counting Dataset

OpenStreetMap on AWS

NREL National Solar Radiation Database

NOAA S-102 Bathymetric Surface Data

CMAS Data Warehouse

NOAA Global Ensemble Forecast System (GEFS) Re-forecast

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

SpaceNet

RADARSAT-1

Indiana Statewide Digital Aerial Imagery Catalog

Pacific Ocean Sound Recordings

NOAA High-Resolution Rapid Refresh (HRRR) Model

Copernicus Digital Elevation Model (DEM)

CCAFS-Climate Data

EPA Risk-Screening Environmental Indicators

NOAA Global Real-Time Ocean Forecast System (Global RTOFS)

NOAA Unified Forecast System Weather Model (UFS-WM) Regression Tests

NASA Prediction of Worldwide Energy Resources (POWER)

10m Annual Land Use Land Cover (9-class)

Multi-Scale Ultra High Resolution (MUR) Sea Surface Temperature (SST)

Geosnap Data, Center for Geospatial Sciences

Normalized Difference Urban Index (NDUI)

Coupled Model Intercomparison Project 6

NOAA Water-Column Sonar Data Archive

NOAA National Digital Forecast Database (NDFD)

NOAA Global Forecast System (GFS)

NOAA National Blend of Models (NBM)

NOAA U.S. Climate Normals

Cloud to Street – Microsoft Flood and Clouds Dataset

NOAA Coastal Lidar Data

ASTER L1T Cloud-Optimized GeoTIFFs

High Resolution Population Density Maps + Demographic Estimates by CIESIN and Meta

High Resolution Downscaled Climate Data for Southeast Alaska

Terrain Tiles

NOAA U.S. Climate Gridded Dataset (NClimGrid)

NOAA Global Historical Climatology Network Daily (GHCN-D)

NOAA Atmospheric Climate Data Records

RAPID NRT Flood Maps

Storm EVent ImageRy (SEVIR)

Sentinel-2 L2A 120m Mosaic

NOAA/PMEL Ocean Climate Stations Moorings

Daylight Map Distribution of OpenStreetMap

NOAA Global Surface Summary of Day

Low Altitude Disaster Imagery (LADI) Dataset

Capella Space Synthetic Aperture Radar (SAR) Open Dataset

District of Columbia – Classified Point Cloud LiDAR

Sea Surface Temperature Daily Analysis: European Space Agency Climate Change Initiative product version 2.1

ArcticDEM

Coupled Model Intercomparison Project Phase 5 (CMIP5) University of Wisconsin-Madison Probabilistic Downscaling Dataset

Crowdsourced Bathymetry

Legal Entity Identifier (LEI) and Legal Entity Reference Data (LE-RD)

DOE’s Water Power Technology Office’s (WPTO) US Wave dataset

Sentinel-5P Level 2

Co-Produced Climate Data to Support California’s Resilience Investments

3000 Rice Genomes Project

Public Utility Data Liberation Project

NOAA Severe Weather Data Inventory (SWDI)

SMN Hi-Res Weather Forecast over Argentina

Global Database of Events, Language and Tone (GDELT)

Southern California Earthquake Data

PoroTomo

NOAA Emergency Response Imagery

NOAA 3-D Surge and Tide Operational Forecast System for the Atlantic Basin (STOFS-3D-Atlantic)

NOAA Global Mosaic of Geostationary Satellite Imagery (GMGSI)

Retired – UK Met Office Atmospheric Deterministic and Probabilistic Forecasts

NOAA Climate Forecast System (CFS)

NOAA Operational Forecast System (OFS)

Sounds of Central African landscapes

Ozone Monitoring Instrument (OMI) / Aura NO2 Tropospheric Column Density

iNaturalist Licensed Observation Images

Speedtest by Ookla Global Fixed and Mobile Network Performance Maps

Sentinel Near Real-time Canada Mirror

University of British Columbia Sunflower Genome Dataset

CAM6 Data Assimilation Research Testbed (DART) Reanalysis: Cloud-Optimized Dataset

New Jersey Statewide LiDAR

NOAA Unified Forecast System Subseasonal to Seasonal Prototypes

NOAA – hourly position, current, and sea surface temperature from drifters

ISERV

NOAA Unified Forecast System (UFS) Marine Reanalysis: 1979-2019

Africa Soil Information Service (AfSIS) Soil Chemistry

Earth Observation Data Cubes for Brazil

2021 Amazon Last Mile Routing Research Challenge Dataset

Amazonia EO satellite on AWS

NOAA Wave Ensemble Reforecast

NA-CORDEX – North American component of the Coordinated Regional Downscaling Experiment

A Global Drought and Flood Catalogue from 1950 to 2016

iSDAsoil

ECMWF ERA5 Reanalysis

NASA Earth Exchange Global Daily Downscaled Projections (NEX-GDDP-CMIP6)

NOAA Rapid Refresh Forecast System (RRFS) [Prototype]

Safecast

OpenEEW

World Bank – Light Every Night

NOAA Unified Forecast System Short-Range Weather (UFS SRW) Application

Central Weather Bureau OpenData

CAFE60 reanalysis

Ford Multi-AV Seasonal Dataset

NOAA S-111 Surface Water Currents Data

NOAA Fundamental Climate Data Records (FCDR)

NOAA Joint Polar Satellite System (JPSS)

National Herbarium of NSW

SILO climate data on AWS

NREL Wind Integration National Dataset

NASA Earth Exchange (NEX) Data Collection

Sentinel-1 SLC dataset for Germany

Downscaled Climate Data for Alaska

JMA Himawari-8/9

Department of Energy’s Open Energy Data Initiative (OEDI)

Reference Elevation Model of Antarctica (REMA)

New Jersey Statewide Digital Aerial Imagery Catalog

Community Earth System Model v2 Large Ensemble (CESM2 LENS)

Orcasound – bioacoustic data for marine conservation

Natural Earth

AgricultureVision

PALSAR-2 ScanSAR CARD4L (L2.2)

NOAA World Ocean Database (WOD)

GeoNet Aotearoa New Zealand Data

OpenAQ

NOAA National Water Model CONUS Retrospective Dataset

RarePlanes

Atmospheric Models from Météo-France

Analysis Ready Sentinel-1 Backscatter Imagery

Sentinel-3

Scottish Public Sector LiDAR Dataset

NOAA Oceanic Climate Data Records

Finnish Meteorological Institute Weather Radar Data

NOAA Terrestrial Climate Data Records

Earth Radio Occultation

QIIME 2 User Tutorial Datasets

Global Seasonal Sentinel-1 Interferometric Coherence and Backscatter Data Set

Community Earth System Model v2 ARISE (CESM2 ARISE)

NOAA National Bathymetric Source Data

ARPA-E PERFORM Forecast data

Open City Model (OCM)

NOAA Continuously Operating Reference Stations (CORS) Network (NCN)

NOAA Global Hydro Estimator (GHE)

IDEAM – Colombian Radar Network

Terra Fusion Data Sampler

NOAA Integrated Surface Database (ISD)

NEXRAD on AWS

Longitudinal Nutrient Deficiency

SondeHub Radiosonde Telemetry

NOAA Rapid Refresh (RAP)

Transform your ML development with DagsHub –
Try it now!

More categories

Computer Vision

Audio

NLP

Tabular

Biology

Urban

Back to top