Maziyar Panahi 3aa95bdc78 Update Python instructions 3 months ago
data 79fd33eb04 Add ocr pdf to main data directory 7 months ago
databricks b6e6d4ee21 Update Spark NLP version to 2.0.1 6 months ago
jupyter 3aa95bdc78 Update Python instructions 3 months ago
scala 882b0256a8 Update to 2.1.0 3 months ago
strata be77bea8a9 Use sparknlp.start() to create SparkSession 5 months ago
zeppelin ac5a03be15 clean notebook 1 year ago
.gitattributes 3038ebbb9c Ignore html from linguist-vendored 6 months ago
.gitignore 4e0c2c3e05 Ignore mac .DS_Store files 6 months ago
Dockerfile 189abbe3ff Update Dockerfile for pyspark 2.4.3 and spark-nlp 2.1.0 3 months ago 6d81dc83dd Create 6 months ago
LICENSE a28c56daec Initial commit 1 year ago e8ea034297 Update 3 months ago

Spark NLP Workshop

Build Status Maven Central PyPI version Anaconda-Cloud License

Showcasing notebooks and codes of how to use Spark NLP in Python and Scala.

Table of contents

Docker setup

If you want to experience Spark NLP and run Jupyter exmaples without installing anything, you can simply use our Docker image:

1- Get the docker image for spark-nlp-workshop:

docker pull johnsnowlabs/spark-nlp-workshop

2- Run the image locally with port binding.

 docker run -it --rm -p 8888:8888 -p 4040:4040 johnsnowlabs/spark-nlp-workshop

3- Open Jupyter notebooks inside your browser by using the token printed on the console.


Main repository

Project's website

Take a look at our official spark-nlp page: for user documentation and examples

Slack community channel

Join Slack


If you find any example that is no longer working, please create an issue.


Apache Licence 2.0