spark-nlp

John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.

spark-nlp has been downloaded 170,365,710 times in total on PyPI, including 1,134,112 in the last 30 days. The latest version is 6.4.1rc3, released May 21, 2026.

Version6.4.1rc3
Downloads
170.37M
License
AuthorJohn Snow Labs
UpdatedMay 21, 2026

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal6.*5.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 189 series
Selected total5.11M
170.4MAll-time
1.1MLast 30 days
52.8kLast 24 h
0.45/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    3.3.2

    671.7k downloads
    20.6%
  • 02

    6.4.0

    660.7k downloads
    20.3%
  • 03

    6.3.3

    338.8k downloads
    10.4%
  • 04

    6.4.1

    219.2k downloads
    6.7%
  • 05

    4.4.4

    206.9k downloads
    6.4%
  • 06

    3.4.2

    193.2k downloads
    5.9%
  • 07

    4.4.2

    143.8k downloads
    4.4%
  • 08

    5.3.3

    140.1k downloads
    4.3%
  • 09

    Other

    679.4k downloads
    20.9%

Guess the next day

Thirteen recent days of spark-nlp downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH52.8k37.2kTHUFRISATSUNMONTUEWEDTHUFRISATSUNMONTUE
    spark-nlp · 170.4M downloads on PyPI