spark-nlp

John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.

spark-nlp has been downloaded 171,253,592 times in total on PyPI, including 1,162,033 in the last 30 days. The latest version is 6.4.1rc3, released May 21, 2026.

Version6.4.1rc3
Downloads
171.25M
License
AuthorJohn Snow Labs
UpdatedMay 21, 2026

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal6.*5.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 191 series
Selected total5.27M
171.3MAll-time
1.2MLast 30 days
38.8kLast 24 h
0.43/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    6.4.0

    685.1k downloads
    20.6%
  • 02

    3.3.2

    668.7k downloads
    20.1%
  • 03

    6.4.1

    419.8k downloads
    12.6%
  • 04

    4.4.4

    225.0k downloads
    6.8%
  • 05

    3.4.2

    182.4k downloads
    5.5%
  • 06

    4.4.2

    142.6k downloads
    4.3%
  • 07

    5.3.3

    142.0k downloads
    4.3%
  • 08

    6.4.2

    135.0k downloads
    4.1%
  • 09

    Other

    726.0k downloads
    21.8%

Guess the next day

Thirteen recent days of spark-nlp downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH38.8k47.7kSATSUNMONTUEWEDTHUFRISATSUNMONTUEWEDTHU