ModuleNotFoundError: No module named 'scrapy_user_agents'

Question:

I tried to use scrapy_user_agents with scrapy-proxy-pool.

I added these lines in my settings.py:

    DOWNLOADER_MIDDLEWARES = {
    'scrapy_proxy_pool.middlewares.ProxyPoolMiddleware': 610,
    'scrapy_proxy_pool.middlewares.BanDetectionMiddleware': 620,
    'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware': None,
    'scrapy_user_agents.middlewares.RandomUserAgentMiddleware': 700,
    }

when I run my spider, I get this error message:

ModuleNotFoundError: No module named ‘scrapy_user_agents’

I removed the lines of proxy in the middleware, but I get same issue, same error message.

You will find below the complete log errors:

2019-08-13 16:05:28 [scrapy.utils.log] INFO: Scrapy 1.7.3 started (bot: scraping_entreprises)
2019-08-13 16:05:28 [scrapy.utils.log] INFO: Versions: lxml 4.4.1.0, libxml2 2.9.5, cssselect 1.1.0, parsel 1.5.2, w3lib 1.21.0, Twisted 19.7.0, Python 3.7.4 (tags/v3.7.4:e09359112e, Jul
8 2019, 20:34:20) [MSC v.1916 64 bit (AMD64)], pyOpenSSL 19.0.0 (OpenSSL 1.1.1c 28 May 2019), cryptography 2.7, Platform Windows-10-10.0.17134-SP0
2019-08-13 16:05:28 [scrapy.crawler] INFO: Overridden settings: {‘BOT_NAME’: ‘scraping_entreprises’, ‘NEWSPIDER_MODULE’: ‘scraping_entreprises.spiders’, ‘ROBOTSTXT_OBEY’: True, ‘SPIDER_MOD
ULES’: [‘scraping_entreprises.spiders’]}
2019-08-13 16:05:28 [scrapy.extensions.telnet] INFO: Telnet Password: 0a7932c1a3ce188f
2019-08-13 16:05:28 [scrapy.middleware] INFO: Enabled extensions:
[‘scrapy.extensions.corestats.CoreStats’,
‘scrapy.extensions.telnet.TelnetConsole’,
‘scrapy.extensions.logstats.LogStats’]
Unhandled error in Deferred:
2019-08-13 16:05:29 [twisted] CRITICAL: Unhandled error in Deferred:
Traceback (most recent call last):
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 184, in crawl
return self._crawl(crawler, *args, **kwargs)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 188, in _crawl
d = crawler.crawl(*args, **kwargs)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagestwistedinternetdefer.py”, line 1613, in unwindGenerator
return _cancellableInlineCallbacks(gen)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagestwistedinternetdefer.py”, line 1529, in _cancellableInlineCallbacks
_inlineCallbacks(None, g, status)— —
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagestwistedinternetdefer.py”, line 1418, in _inlineCallbacks
result = g.send(result)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 86, in crawl
self.engine = self._create_engine()
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 111, in create_engine
return ExecutionEngine(self, lambda : self.stop())
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycoreengine.py”, line 69, in init
self.downloader = downloader_cls(crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycoredownloader_init.py”, line 86, in init
self.middleware = DownloaderMiddlewareManager.from_crawler(crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapymiddleware.py”, line 53, in from_crawler
return cls.from_settings(crawler.settings, crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapymiddleware.py”, line 34, in from_settings
mwcls = load_object(clspath)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapyutilsmisc.py”, line 46, in load_object
mod = import_module(module)
File “C:UsersNinoAppDataLocalProgramsPythonPython37libimportlib_init.py”, line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File “”, line 1006, in _gcd_import
File “”, line 983, in _find_and_load
File “”, line 953, in _find_and_load_unlocked
File “”, line 219, in _call_with_frames_removed
File “”, line 1006, in _gcd_import
File “”, line 983, in _find_and_load
File “”, line 965, in _find_and_load_unlocked
builtins.ModuleNotFoundError: No module named ‘scrapy_user_agents’
2019-08-13 16:05:29 [twisted] CRITICAL:
Traceback (most recent call last):
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagestwistedinternetdefer.py”, line 1418, in _inlineCallbacks
result = g.send(result)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 86, in crawl
self.engine = self._create_engine()
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycrawler.py”, line 111, in create_engine
return ExecutionEngine(self, lambda : self.stop())
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycoreengine.py”, line 69, in init
self.downloader = downloader_cls(crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapycoredownloader_init.py”, line 86, in init
self.middleware = DownloaderMiddlewareManager.from_crawler(crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapymiddleware.py”, line 53, in from_crawler
return cls.from_settings(crawler.settings, crawler)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapymiddleware.py”, line 34, in from_settings
mwcls = load_object(clspath)
File “c:usersninopycharmprojectsscrapingvenvlibsite-packagesscrapyutilsmisc.py”, line 46, in load_object
mod = import_module(module)
File “C:UsersNinoAppDataLocalProgramsPythonPython37libimportlib_init.py”, line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File “”, line 1006, in _gcd_import
File “”, line 983, in _find_and_load
File “”, line 953, in _find_and_load_unlocked
File “”, line 219, in _call_with_frames_removed
File “”, line 1006, in _gcd_import
File “”, line 983, in _find_and_load
File “”, line 965, in _find_and_load_unlocked
ModuleNotFoundError: No module named ‘scrapy_user_agents’

Asked By: Gauthier Buttez

||

Answers:

Try uninstalling and installing the module again to make sure its installed for your version of python.

pip (un)install ModuleName

Answered By: Rumi

This has not worked for me. Scrapy works with a regular USER-AGENT (googlebot), but not with scrapy-user-agents. I have installed scrapy-user-agents with pip (it appears in Pycharm-project interpreter), commented out the USER-AGENT, and pasted the required lines:

DOWNLOADER_MIDDLEWARES = {
    'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware': None,
    'scrapy_user_agents.middlewares.RandomUserAgentMiddleware': 400,
}

However, I still receive this error message when trying to scrape quotes.toscrape.com – ModuleNotFoundError: No module named ‘scrapy_user_agents’
.

Here is the full error message:

> Blockquote C:UsersarielPycharmProjectsmmi_spidersmmi_spiders> scrapy crawl quotes
2023-03-08 09:14:50 [scrapy.utils.log] INFO: Scrapy 2.6.1 started (bot: mmi_spiders)
2023-03-08 09:14:50 [scrapy.utils.log] INFO: Versions: lxml 4.8.0.0, libxml2 2.9.12, cssselect 1.1.0, parsel 1.6.0, w3lib 1.21.0, Twisted 22.2.0, Python
 3.9.12 (main, Apr  4 2022, 05:22:27) [MSC v.1916 64 bit (AMD64)], pyOpenSSL 21.0.0 (OpenSSL 1.1.1n  15 Mar 2022), cryptography 3.4.8, Platform Windows-10-10.0.22621-SP0
2023-03-08 09:14:50 [scrapy.crawler] INFO: Overridden settings:
{'AUTOTHROTTLE_ENABLED': True,
 'BOT_NAME': 'mmi_spiders',
 'NEWSPIDER_MODULE': 'mmi_spiders.spiders',
 'ROBOTSTXT_OBEY': True,
 'SPIDER_MODULES': ['mmi_spiders.spiders']}
2023-03-08 09:14:50 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.selectreactor.SelectReactor
2023-03-08 09:14:50 [scrapy.extensions.telnet] INFO: Telnet Password: 5f2e0c97136c31d5
2023-03-08 09:14:50 [scrapy.middleware] INFO: Enabled extensions:
['scrapy.extensions.corestats.CoreStats',
 'scrapy.extensions.telnet.TelnetConsole',
 'scrapy.extensions.logstats.LogStats',
 'scrapy.extensions.throttle.AutoThrottle']
Unhandled error in Deferred:
2023-03-08 09:14:51 [twisted] CRITICAL: Unhandled error in Deferred:

Traceback (most recent call last):
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 206, in crawl
    return self._crawl(crawler, *args, **kwargs)
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 210, in _crawl
    d = crawler.crawl(*args, **kwargs)
  File "C:UsersarielAnaconda3libsite-packagestwistedinternetdefer.py", line 1905, in unwindGenerator
    return _cancellableInlineCallbacks(gen)
  File "C:UsersarielAnaconda3libsite-packagestwistedinternetdefer.py", line 1815, in _cancellableInlineCallbacks
    _inlineCallbacks(None, gen, status)
--- <exception caught here> ---
  File "C:UsersarielAnaconda3libsite-packagestwistedinternetdefer.py", line 1660, in _inlineCallbacks
    result = current_context.run(gen.send, result)
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 102, in crawl
    self.engine = self._create_engine()
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 116, in _create_engine
    return ExecutionEngine(self, lambda _: self.stop())
  File "C:UsersarielAnaconda3libsite-packagesscrapycoreengine.py", line 83, in __init__
    self.downloader = downloader_cls(crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapycoredownloader__init__.py", line 83, in __init__
    self.middleware = DownloaderMiddlewareManager.from_crawler(crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapymiddleware.py", line 59, in from_crawler
    return cls.from_settings(crawler.settings, crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapymiddleware.py", line 40, in from_settings
    mwcls = load_object(clspath)
  File "C:UsersarielAnaconda3libsite-packagesscrapyutilsmisc.py", line 61, in load_object
    mod = import_module(module)
  File "C:UsersarielAnaconda3libimportlib__init__.py", line 127, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import

  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load

  File "<frozen importlib._bootstrap>", line 972, in _find_and_load_unlocked

  File "<frozen importlib._bootstrap>", line 228, in _call_with_frames_removed

  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import

  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load

  File "<frozen importlib._bootstrap>", line 984, in _find_and_load_unlocked

builtins.ModuleNotFoundError: No module named 'scrapy_user_agents'

2023-03-08 09:14:51 [twisted] CRITICAL: 
Traceback (most recent call last):
  File "C:UsersarielAnaconda3libsite-packagestwistedinternetdefer.py", line 1660, in _inlineCallbacks
    result = current_context.run(gen.send, result)
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 102, in crawl
    self.engine = self._create_engine()
  File "C:UsersarielAnaconda3libsite-packagesscrapycrawler.py", line 116, in _create_engine
    return ExecutionEngine(self, lambda _: self.stop())
  File "C:UsersarielAnaconda3libsite-packagesscrapycoreengine.py", line 83, in __init__
    self.downloader = downloader_cls(crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapycoredownloader__init__.py", line 83, in __init__
    self.middleware = DownloaderMiddlewareManager.from_crawler(crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapymiddleware.py", line 59, in from_crawler
    return cls.from_settings(crawler.settings, crawler)
  File "C:UsersarielAnaconda3libsite-packagesscrapymiddleware.py", line 40, in from_settings
    mwcls = load_object(clspath)
  File "C:UsersarielAnaconda3libsite-packagesscrapyutilsmisc.py", line 61, in load_object
    mod = import_module(module)
  File "C:UsersarielAnaconda3libimportlib__init__.py", line 127, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load
  File "<frozen importlib._bootstrap>", line 972, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 228, in _call_with_frames_removed
  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load
  File "<frozen importlib._bootstrap>", line 984, in _find_and_load_unlocked
ModuleNotFoundError: No module named 'scrapy_user_agents'
PS C:UsersarielPycharmProjectsmmi_spidersmmi_spiders> 
Answered By: Ariel Paz
Categories: questions Tags: , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.