NLTK Lookup Error

Question

While running a Python script using NLTK I got this:

Traceback (most recent call last):
  File "cpicklesave.py", line 56, in <module>
    pos = nltk.pos_tag(words)
  File "/usr/lib/python2.7/site-packages/nltk/tag/__init__.py", line 110, in pos_tag
    tagger = PerceptronTagger()
  File "/usr/lib/python2.7/site-packages/nltk/tag/perceptron.py", line 140, in __init__
    AP_MODEL_LOC = str(find('taggers/averaged_perceptron_tagger/'+PICKLE))
  File "/usr/lib/python2.7/site-packages/nltk/data.py", line 641, in find
    raise LookupError(resource_not_found)
LookupError:
**********************************************************************
  Resource u'taggers/averaged_perceptron_tagger/averaged_perceptro
  n_tagger.pickle' not found.  Please use the NLTK Downloader to
  obtain the resource:  >>> nltk.download()
  Searched in:
    - '/root/nltk_data'
    - '/usr/share/nltk_data'
    - '/usr/local/share/nltk_data'
    - '/usr/lib/nltk_data'
    - '/usr/local/lib/nltk_data'
**********************************************************************

Can anyone explain the problem?

Asked By: Shiv Shankar

||

Source

Answer 1

Use

>>> nltk.download()

to install the missing module (the Perceptron Tagger).

(check also the answers to Failed loading english.pickle with nltk.data.load)

Answered By: user2314737

Answer 2

First answer said the missing module is ‘the Perceptron Tagger’, actually its name in nltk.download is ‘averaged_perceptron_tagger’

You can use this to fix the error

nltk.download('averaged_perceptron_tagger')

Answered By: Posuer

Answer 3

TL;DR

import nltk
nltk.download('averaged_perceptron_tagger')

Or to download all packages + data + docs:

import nltk
nltk.download('all')

See How do I download NLTK data?

Answered By: alvas

Answer 4

Problem:
Lookup error when extracting count vectorizer from scikit learn. Below is code snippet.

from sklearn.feature_extraction.text import CountVectorizer
bow_transformer = CountVectorizer(analyzer=text_process).fit(X)

Solution:
Try to run the below code and then try to install the stopwords from corpora natural language processing toolkit!!

import nltk
nltk.download()

Answered By: Rayudu Yarlagadda

Answer 5

Install all nltk resources in one line:

python3 -c "import nltk; nltk.download('all')"

the data will be saved at ~/nltk_data

Install only specific resource:

Substitute "all" for "averaged_perceptron_tagger" to install only this module.

python3 -c "import nltk; nltk.download('averaged_perceptron_tagger')"

Answered By: Lucas Azevedo

Answer 6

You can download NLTK missing module just by

import nltk
nltk.download()

This will shows the NLTK download screen.
If it shows SSL Certificate verify failed error. Then it should works by disabling SSL check with below code!

import nltk
import ssl

try:
    _create_unverified_https_context = ssl._create_unverified_context
except AttributeError:
    pass
else:
    ssl._create_default_https_context = _create_unverified_https_context

nltk.download()

Answered By: ishwardgret

Answer 7

Sometimes even by writing
nltk.download('module_name'), it does not get downloaded. At those times, you can open python in interactive mode and then download by using nltk.download('module_name').

Answered By: Lucky Sunda

Answer 8

If you have not downloaded ntlk then firstly download ntlk and then use this nltk.download('punkt') it will give you the result.

Answered By: Anisha Yadav

Answer 9

import nltk


nltk.download('vader_lexicon')

Use this this might work

Answered By: vishal kurre

Answer 10

Sorry! if I missed other editor but this is working fine in Google Colab

import nltk
nltk.download('all')

Answered By: akD

Answer 11

You just need to download that module for nltk.
The better way is to open a python command line and type

import nltk
nltk.download('all')

That’s all.

Answered By: Shivaditya kr

Answer 12

if you have already executed python -m textblob.download_corpora if not first run import nltk nltk.downlod('all') or nltk.downlod('all-corpora')
and if issue remains there then it might be because some packages are not being unzipp.
in my case have to unzip wordnet as my error was
Resource wordnet not found. Please use the NLTK Downloader to obtain the resource:
solutioncd /home/app/nltk_data/corpora then unzip wordnet.zip

https://github.com/nltk/nltk/issues/3028

Answered By: Darshan Jain

NLTK Lookup Error

Question:

Answers: