spark-nlp
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
spark-nlp has been downloaded 170,365,710 times in total on PyPI, including 1,134,112 in the last 30 days. The latest version is 6.4.1rc3, released May 21, 2026.
Version6.4.1rc3
Downloads
170.37M
License—
AuthorJohn Snow Labs
UpdatedMay 21, 2026
Downloads
Weekly, last 90d.
Includes CI traffic.
VersionsTotal6.*5.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 189 series
Selected total5.11M
170.4MAll-time
1.1MLast 30 days
52.8kLast 24 h
0.45/sPer second
Sponsored
Sponsorships keep pepy free to read
Version distribution
Share of downloads by released version. Computed over the last quarter.
- 0120.6%
3.3.2
671.7k downloadsDownloads671.7k20.6% - 0220.3%
6.4.0
660.7k downloadsDownloads660.7k20.3% - 0310.4%
6.3.3
338.8k downloadsDownloads338.8k10.4% - 046.7%
6.4.1
219.2k downloadsDownloads219.2k6.7% - 056.4%
4.4.4
206.9k downloadsDownloads206.9k6.4% - 065.9%
3.4.2
193.2k downloadsDownloads193.2k5.9% - 074.4%
4.4.2
143.8k downloadsDownloads143.8k4.4% - 084.3%
5.3.3
140.1k downloadsDownloads140.1k4.3% - 0920.9%
Other
679.4k downloadsDownloads679.4k20.9%
Guess the next day
Thirteen recent days of spark-nlp downloads. Drag the green handle on the right to guess where day fourteen lands.