spark-nlp

John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.

spark-nlp has been downloaded 170,091,559 times in total on PyPI, including 1,116,311 in the last 30 days. The latest version is 6.4.1rc3, released May 21, 2026.

Version6.4.1rc3
Downloads
170.09M
License
AuthorJohn Snow Labs
UpdatedMay 21, 2026

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal6.*5.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 188 series
Selected total5.15M
170.1MAll-time
1.1MLast 30 days
39.5kLast 24 h
0.42/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    3.3.2

    672.4k downloads
    20.8%
  • 02

    6.4.0

    651.9k downloads
    20.2%
  • 03

    6.3.3

    380.6k downloads
    11.8%
  • 04

    4.4.4

    214.1k downloads
    6.6%
  • 05

    3.4.2

    188.9k downloads
    5.8%
  • 06

    4.4.2

    143.3k downloads
    4.4%
  • 07

    5.3.3

    139.3k downloads
    4.3%
  • 08

    6.4.1

    118.0k downloads
    3.6%
  • 09

    Other

    726.4k downloads
    22.5%

Guess the next day

Thirteen recent days of spark-nlp downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH39.5k49.2kTHUFRISATSUNMONTUEWEDTHUFRISATSUNMONTUE
    spark-nlp · 170.1M downloads on PyPI