def frameIt( u:String, h:Int ) : String = { """<iframe src=""""+ u+"""" width="95%" height="""" + h + """" sandbox> <p> <a href="http://spark.apache.org/docs/latest/index.html"> Fallback link for browsers that, unlikely, don't support frames </a> </p> </iframe>""" } displayHTML(frameIt("https://en.wikipedia.org/wiki/Natural_language_processing",550))
Last refresh: Never
Command took 0.16 seconds
%md ## SparkML What does SparkML Pipelines already provide for NLP? - http://spark.apache.org/docs/latest/ml-features.html - http://spark.apache.org/docs/latest/ml-features.html#extracting-transforming-and-selecting-features - http://spark.apache.org/docs/latest/ml-features.html#word2vec - ... - https://spark.apache.org/docs/latest/mllib-feature-extraction.html - https://spark.apache.org/docs/latest/mllib-feature-extraction.html#word2vec - ... ## SparkNLP How does John Snow Labs' [sparkNLP](https://nlp.johnsnowlabs.com/) build on SparkML? - https://databricks.com/blog/2017/10/19/introducing-natural-language-processing-library-apache-spark.html #### Why SparkNLP? - https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-training-spark-nlp-and-spacy-pipelines - https://blog.dominodatalab.com/comparing-the-functionality-of-open-source-natural-language-processing-libraries/ ### SparkNLP Pointers - https://nlp.johnsnowlabs.com/docs/en/quickstart - https://github.com/JohnSnowLabs/spark-nlp - The databricks notebooks in this directory are minor augnmentations/updates of: - https://johnsnowlabs.github.io/spark-nlp-workshop/databricks/index.html - https://nlp.johnsnowlabs.com/docs/en/models
SparkML
What does SparkML Pipelines already provide for NLP?
- http://spark.apache.org/docs/latest/ml-features.html
- http://spark.apache.org/docs/latest/ml-features.html#extracting-transforming-and-selecting-features
- https://spark.apache.org/docs/latest/mllib-feature-extraction.html
SparkNLP
How does John Snow Labs' sparkNLP build on SparkML?
Why SparkNLP?
- https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-training-spark-nlp-and-spacy-pipelines
- https://blog.dominodatalab.com/comparing-the-functionality-of-open-source-natural-language-processing-libraries/
SparkNLP Pointers
- https://nlp.johnsnowlabs.com/docs/en/quickstart
- https://github.com/JohnSnowLabs/spark-nlp
- The databricks notebooks in this directory are minor augnmentations/updates of:
- https://nlp.johnsnowlabs.com/docs/en/models
Last refresh: Never
displayHTML(frameIt("https://databricks.com/blog/2017/10/19/introducing-natural-language-processing-library-apache-spark.html",500))
Last refresh: Never
Command took 0.16 seconds
%md ## Spark NLP Latest ### 2019 Spark Summit [](https://www.youtube.com/embed/ziUrLnJ_Y8A) ### 2018 Data Science Summit [](https://www.youtube.com/embed/qG_bdbpOM-Y) #### 2018 Spark Summit - https://databricks.com/session/apache-spark-nlp-extending-spark-ml-to-deliver-fast-scalable-unified-natural-language-processing #### 2017 Spark Summit [](https://www.youtube.com/embed/k5X12mdEvb8) More videos, podcasts, articles, docs, etc. here: - https://nlp.johnsnowlabs.com/videos - https://nlp.johnsnowlabs.com/articles - https://nlp.johnsnowlabs.com/docs/en/quickstart
Spark NLP Latest
2019 Spark Summit
2018 Data Science Summit
2018 Spark Summit
2017 Spark Summit
More videos, podcasts, articles, docs, etc. here:
Last refresh: Never
displayHTML(frameIt("https://nlp.johnsnowlabs.com/articles", 500))
Last refresh: Never
Command took 0.14 seconds
SDS-2.x, Scalable Data Engineering Science
Last refresh: Never