Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel


open-data-registry aws-pds sustainability agriculture earth observation geospatial life sciences + 753


disaster response classification image classification object detection autonomous vehicles machine translation vision + 490

 Open Source Data Science Models

Dean / BioBERT-DAGsHub

Updated 5 months ago

Path: output/NCBI-disease

A DagsHub implementation of BioBERT: a pre-trained biomedical language representation model for biomedical text mining

dataset model nlp named entity recognition dvc git

Path: model

In this project were going to create a simple stock prediction model anduse it to predict yahoo BSD stock. Once a model has been created we're going to monitor the model using an ML-Monitoring tool like evidently.

model tabular scikit-learn time series forecasting dvc git mlflow

morrisalp / unikud

Updated 7 months ago

Path: models

UNIKUD is an open-source tool for adding vowel signs (nikud) to Hebrew text with deep learning, using absolutely no rule-based logic.

dataset model nlp dvc git mlflow github

Path: src/models

Open Source Data Science (OSDS) Monocular Depth Estimation – Turn 2d photos into 3d photos – show your grandma the awesome results.

dataset model computer vision depth estimation dvc git mlflow google cloud storage

Path: outputs

A repo for the tutorial explaining the benefits of DVC and DAGsHub, using the classification of questions for the Cross Validated statistics Stack Exchange as an example problem

dataset model dvc git mlflow

Path: .

This repository holds open-source machine learning models for various domains ready to download and use

model git github

Path: .

This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses two fixed, pretrained text encoders (OpenCLIP-ViT/G and CLIP-ViT/L).

model dvc git arxiv

Path: .

This model is a fine-tune checkpoint of DistilBERT-base-uncased, fine-tuned on SST-2.

model dvc git

Rutam21 / bart-large-cnn

Updated 8 months ago

Path: .

BART is a transformer encoder-encoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder.

model dvc git

Path: .

This is a fine-tuned version of the multi-modal LayoutLM model for the task of question answering on invoices and other documents.

model dvc git

Path: .

Vision-and-Language Transformer (ViLT) model fine-tuned on VQAv2 is up to tens of times faster than previous VLP models..

model dvc git

Path: .

Pix2Struct is an image encoder - text decoder model that is trained on image-text pairs for various tasks, including image captioning and visual question answering.

model dvc git

Path: .

This model is based on a multi-stage text-to-video generation diffusion model, which inputs a description text and returns a video that matches the text description.

model dvc git

Path: .

BLIP is a new VLP framework which transfers flexibly to both vision-language understanding and generation tasks.

model dvc git

Path: .

This is an image captioning model trained by @ydshieh in flax.

model dvc git

Rutam21 / bert-base-cased

Updated 8 months ago

Path: .

It is a pretrained model on English language using a masked language modeling (MLM) objective.

model dvc git

Path: .

ncar-cesm-lens-dataset is originate from the Registry of Open Data on AWS

dataset model git aws s3

Path: .

wrf-cmip6-dataset is originate from the Registry of Open Data on AWS

dataset model git aws s3