Maziyar Panahi 3aa95bdc78 Update Python instructions 1 year ago
data 79fd33eb04 Add ocr pdf to main data directory 1 year ago
databricks b6e6d4ee21 Update Spark NLP version to 2.0.1 1 year ago
jupyter 3aa95bdc78 Update Python instructions 1 year ago
scala 882b0256a8 Update to 2.1.0 1 year ago
strata be77bea8a9 Use sparknlp.start() to create SparkSession 1 year ago
zeppelin ac5a03be15 clean notebook 1 year ago
.gitattributes 3038ebbb9c Ignore html from linguist-vendored 1 year ago
.gitignore 4e0c2c3e05 Ignore mac .DS_Store files 1 year ago
Dockerfile 189abbe3ff Update Dockerfile for pyspark 2.4.3 and spark-nlp 2.1.0 1 year ago 6d81dc83dd Create 1 year ago
LICENSE a28c56daec Initial commit 2 years ago e8ea034297 Update 1 year ago

Spark NLP Workshop

Build Status Maven Central PyPI version Anaconda-Cloud License

Showcasing notebooks and codes of how to use Spark NLP in Python and Scala.

Table of contents

Docker setup

If you want to experience Spark NLP and run Jupyter exmaples without installing anything, you can simply use our Docker image:

1- Get the docker image for spark-nlp-workshop:

docker pull johnsnowlabs/spark-nlp-workshop

2- Run the image locally with port binding.

 docker run -it --rm -p 8888:8888 -p 4040:4040 johnsnowlabs/spark-nlp-workshop

3- Open Jupyter notebooks inside your browser by using the token printed on the console.


Main repository

Project's website

Take a look at our official spark-nlp page: for user documentation and examples

Slack community channel

Join Slack


If you find any example that is no longer working, please create an issue.


Apache Licence 2.0