Dean Pleban
Dean
Dean
The MolBART project aims to pre-train a BART transformer language model on molecular SMILES strings by optimising a de-noising objective. We hypothesised that pre-training will lead to improved generalisation, performance, training speed and validity on downstream fine-tuned tasks. We tested the pre-trained model on downstream tasks such as reaction prediction, retrosynthetic prediction, molecular optimisation and molecular property prediction.
pytorch git github
This repository contains the code to import and integrate the book and rating data that we work with. It imports and integrates data from several sources in a homogenous tabular outputs; import scripts are primarily Rust, with Python implement analyses.
dataset nlp dvc git github
A self contained Active Learning repo, using local inference with DagsHub client to annotate objects with a YOLO model
computer vision object detection active learning dvc git mlflow ultralytics yolo
A deep learning model for small molecule drug discovery and cheminformatics based on SMILES
pytorch biomedical information retrieval git github
Updated 4 months ago
Showcasing how to add an externally created LS annotation to Data Engine
No topics have been added
A demonstration of LLM prompt engineering and tracing with DagsHub and MLflow
nlp dvc git mlflow langchain
A hello world project that covers dataset curation, annotation, experiment tracking and model management. All in one place
dataset model computer vision dvc git mlflow ultralytics yolo
Updated 7 months ago
A truly multimodal dataset containing images, audio, text, documents and video
No topics have been added
Dvc + Streamlit = ❤️ An example of working with DVC and Streamlit
dvc git github