您好,登錄后才能下訂單哦!
如何在win7 x64系統中安裝Scrapy?相信很多沒有經驗的人對此束手無策,為此本文總結了問題出現的原因和解決方法,通過這篇文章希望你能解決這個問題。
首先安裝python,裝完以后就可以安裝scrapy了,推薦使用pip方式安裝,因為scrapy需要調用很多額外的庫,pip會全部幫你安裝好,不需要你在到處翻找了。
pip在python安裝完后就已經有了,不需要額外安裝,下面只要按照scrapy官網推薦的方法在命令提示符中輸入pip installscrapy
圖1
裝完以后可以敲入命令pip list看看已安裝的庫(圖2),出來很多啊,pip真是好東西。
圖2
現在試下看看建個爬蟲項目,按照說明文檔鍵入命令scrapy startproject tutorial,目錄已經出來(圖3),看來是沒問題了。但為了驗證是否安裝成功,還得跑一下看看,第一次創建項目的時候,系統會提示可以跑個例子看看(圖4)。按照提示鍵入命令
圖3
圖4
scrapy genspider example example.com創建一個爬蟲,再鍵入命令scrapy crawl example
運行爬蟲,結果如下(圖5),報錯了,貌似是缺少win32api,立即上網下了一個(http://sourceforge.net/projects/pywin32/files/pywin32/Build%20219/),
圖5
下的時候注意對應的python版本。win32api裝好以后再運行一次爬蟲(圖6),這次成功了,應該是沒問題了。
圖6
總結一下,其實剛開始網上找資料的時候看到上面寫的要先裝這個庫那個庫的時候心中很忐忑,結果發現不是很復雜,大多數問題pip都給解決了,剩下的就是具體問題具體研究,不過也沒碰到很復雜解決不了的問題。另外吐下槽就是網上的教程互抄的太厲害,看著一搜一堆,其實多數都大同小異,真正有價值的沒幾個,沒大腿抱就是辛苦呀。
最后說一下,scrapy目前還不支持python3.x版本,我用的是python2.7,如果你碰到莫名其妙的問題時請先看看自己有沒有裝錯python版本。
下面是其他網友補充的文章
環境
Windows7 64位
Python2.7.6 64位
Python的安裝:
打開http://www.python.org/getit/releases/2.7.6/頁面,下載Python-2.7.6.amd64.msi 進行安裝,安裝完成后,需要配置環境變量,環境變量的配置可以參考該文章
測試python是否安裝成功,如果python成功安裝并且配置好環境變量,那么在cmd中輸入python,就能得到python版本的詳細信息(如32位或64位)
C:\Users\Administrator>python Python 2.7.6 (default, Nov 10 2013, 19:24:24) [MSC v.1500 64 bit (AMD64)] on win 32
easy_install的安裝
保存ez_setup.py至本地,如D盤(如果失效了,可以參考下https://www.jb51.net/article/151027.htm
#!/usr/bin/env python """ Setuptools bootstrapping installer. Maintained at https://github.com/pypa/setuptools/tree/bootstrap. Run this script to install or upgrade setuptools. This method is DEPRECATED. Check https://github.com/pypa/setuptools/issues/581 for more details. """ import os import shutil import sys import tempfile import zipfile import optparse import subprocess import platform import textwrap import contextlib from distutils import log try: from urllib.request import urlopen except ImportError: from urllib2 import urlopen try: from site import USER_SITE except ImportError: USER_SITE = None # 33.1.1 is the last version that supports setuptools self upgrade/installation. DEFAULT_VERSION = "33.1.1" DEFAULT_URL = "https://pypi.io/packages/source/s/setuptools/" DEFAULT_SAVE_DIR = os.curdir DEFAULT_DEPRECATION_MESSAGE = "ez_setup.py is deprecated and when using it setuptools will be pinned to {0} since it's the last version that supports setuptools self upgrade/installation, check https://github.com/pypa/setuptools/issues/581 for more info; use pip to install setuptools" MEANINGFUL_INVALID_ZIP_ERR_MSG = 'Maybe {0} is corrupted, delete it and try again.' log.warn(DEFAULT_DEPRECATION_MESSAGE.format(DEFAULT_VERSION)) def _python_cmd(*args): """ Execute a command. Return True if the command succeeded. """ args = (sys.executable,) + args return subprocess.call(args) == 0 def _install(archive_filename, install_args=()): """Install Setuptools.""" with archive_context(archive_filename): # installing log.warn('Installing Setuptools') if not _python_cmd('setup.py', 'install', *install_args): log.warn('Something went wrong during the installation.') log.warn('See the error message above.') # exitcode will be 2 return 2 def _build_egg(egg, archive_filename, to_dir): """Build Setuptools egg.""" with archive_context(archive_filename): # building an egg log.warn('Building a Setuptools egg in %s', to_dir) _python_cmd('setup.py', '-q', 'bdist_egg', '--dist-dir', to_dir) # returning the result log.warn(egg) if not os.path.exists(egg): raise IOError('Could not build the egg.') class ContextualZipFile(zipfile.ZipFile): """Supplement ZipFile class to support context manager for Python 2.6.""" def __enter__(self): return self def __exit__(self, type, value, traceback): self.close() def __new__(cls, *args, **kwargs): """Construct a ZipFile or ContextualZipFile as appropriate.""" if hasattr(zipfile.ZipFile, '__exit__'): return zipfile.ZipFile(*args, **kwargs) return super(ContextualZipFile, cls).__new__(cls) @contextlib.contextmanager def archive_context(filename): """ Unzip filename to a temporary directory, set to the cwd. The unzipped target is cleaned up after. """ tmpdir = tempfile.mkdtemp() log.warn('Extracting in %s', tmpdir) old_wd = os.getcwd() try: os.chdir(tmpdir) try: with ContextualZipFile(filename) as archive: archive.extractall() except zipfile.BadZipfile as err: if not err.args: err.args = ('', ) err.args = err.args + ( MEANINGFUL_INVALID_ZIP_ERR_MSG.format(filename), ) raise # going in the directory subdir = os.path.join(tmpdir, os.listdir(tmpdir)[0]) os.chdir(subdir) log.warn('Now working in %s', subdir) yield finally: os.chdir(old_wd) shutil.rmtree(tmpdir) def _do_download(version, download_base, to_dir, download_delay): """Download Setuptools.""" py_desig = 'py{sys.version_info[0]}.{sys.version_info[1]}'.format(sys=sys) tp = 'setuptools-{version}-{py_desig}.egg' egg = os.path.join(to_dir, tp.format(**locals())) if not os.path.exists(egg): archive = download_setuptools(version, download_base, to_dir, download_delay) _build_egg(egg, archive, to_dir) sys.path.insert(0, egg) # Remove previously-imported pkg_resources if present (see # https://bitbucket.org/pypa/setuptools/pull-request/7/ for details). if 'pkg_resources' in sys.modules: _unload_pkg_resources() import setuptools setuptools.bootstrap_install_from = egg def use_setuptools( version=DEFAULT_VERSION, download_base=DEFAULT_URL, to_dir=DEFAULT_SAVE_DIR, download_delay=15): """ Ensure that a setuptools version is installed. Return None. Raise SystemExit if the requested version or later cannot be installed. """ to_dir = os.path.abspath(to_dir) # prior to importing, capture the module state for # representative modules. rep_modules = 'pkg_resources', 'setuptools' imported = set(sys.modules).intersection(rep_modules) try: import pkg_resources pkg_resources.require("setuptools>=" + version) # a suitable version is already installed return except ImportError: # pkg_resources not available; setuptools is not installed; download pass except pkg_resources.DistributionNotFound: # no version of setuptools was found; allow download pass except pkg_resources.VersionConflict as VC_err: if imported: _conflict_bail(VC_err, version) # otherwise, unload pkg_resources to allow the downloaded version to # take precedence. del pkg_resources _unload_pkg_resources() return _do_download(version, download_base, to_dir, download_delay) def _conflict_bail(VC_err, version): """ Setuptools was imported prior to invocation, so it is unsafe to unload it. Bail out. """ conflict_tmpl = textwrap.dedent(""" The required version of setuptools (>={version}) is not available, and can't be installed while this script is running. Please install a more recent version first, using 'easy_install -U setuptools'. (Currently using {VC_err.args[0]!r}) """) msg = conflict_tmpl.format(**locals()) sys.stderr.write(msg) sys.exit(2) def _unload_pkg_resources(): sys.meta_path = [ importer for importer in sys.meta_path if importer.__class__.__module__ != 'pkg_resources.extern' ] del_modules = [ name for name in sys.modules if name.startswith('pkg_resources') ] for mod_name in del_modules: del sys.modules[mod_name] def _clean_check(cmd, target): """ Run the command to download target. If the command fails, clean up before re-raising the error. """ try: subprocess.check_call(cmd) except subprocess.CalledProcessError: if os.access(target, os.F_OK): os.unlink(target) raise def download_file_powershell(url, target): """ Download the file at url to target using Powershell. Powershell will validate trust. Raise an exception if the command cannot complete. """ target = os.path.abspath(target) ps_cmd = ( "[System.Net.WebRequest]::DefaultWebProxy.Credentials = " "[System.Net.CredentialCache]::DefaultCredentials; " '(new-object System.Net.WebClient).DownloadFile("%(url)s", "%(target)s")' % locals() ) cmd = [ 'powershell', '-Command', ps_cmd, ] _clean_check(cmd, target) def has_powershell(): """Determine if Powershell is available.""" if platform.system() != 'Windows': return False cmd = ['powershell', '-Command', 'echo test'] with open(os.path.devnull, 'wb') as devnull: try: subprocess.check_call(cmd, stdout=devnull, stderr=devnull) except Exception: return False return True download_file_powershell.viable = has_powershell def download_file_curl(url, target): cmd = ['curl', url, '--location', '--silent', '--output', target] _clean_check(cmd, target) def has_curl(): cmd = ['curl', '--version'] with open(os.path.devnull, 'wb') as devnull: try: subprocess.check_call(cmd, stdout=devnull, stderr=devnull) except Exception: return False return True download_file_curl.viable = has_curl def download_file_wget(url, target): cmd = ['wget', url, '--quiet', '--output-document', target] _clean_check(cmd, target) def has_wget(): cmd = ['wget', '--version'] with open(os.path.devnull, 'wb') as devnull: try: subprocess.check_call(cmd, stdout=devnull, stderr=devnull) except Exception: return False return True download_file_wget.viable = has_wget def download_file_insecure(url, target): """Use Python to download the file, without connection authentication.""" src = urlopen(url) try: # Read all the data in one block. data = src.read() finally: src.close() # Write all the data in one block to avoid creating a partial file. with open(target, "wb") as dst: dst.write(data) download_file_insecure.viable = lambda: True def get_best_downloader(): downloaders = ( download_file_powershell, download_file_curl, download_file_wget, download_file_insecure, ) viable_downloaders = (dl for dl in downloaders if dl.viable()) return next(viable_downloaders, None) def download_setuptools( version=DEFAULT_VERSION, download_base=DEFAULT_URL, to_dir=DEFAULT_SAVE_DIR, delay=15, downloader_factory=get_best_downloader): """ Download setuptools from a specified location and return its filename. `version` should be a valid setuptools version number that is available as an sdist for download under the `download_base` URL (which should end with a '/'). `to_dir` is the directory where the egg will be downloaded. `delay` is the number of seconds to pause before an actual download attempt. ``downloader_factory`` should be a function taking no arguments and returning a function for downloading a URL to a target. """ # making sure we use the absolute path to_dir = os.path.abspath(to_dir) zip_name = "setuptools-%s.zip" % version url = download_base + zip_name saveto = os.path.join(to_dir, zip_name) if not os.path.exists(saveto): # Avoid repeated downloads log.warn("Downloading %s", url) downloader = downloader_factory() downloader(url, saveto) return os.path.realpath(saveto) def _build_install_args(options): """ Build the arguments to 'python setup.py install' on the setuptools package. Returns list of command line arguments. """ return ['--user'] if options.user_install else [] def _parse_args(): """Parse the command line for options.""" parser = optparse.OptionParser() parser.add_option( '--user', dest='user_install', action='store_true', default=False, help='install in user site package') parser.add_option( '--download-base', dest='download_base', metavar="URL", default=DEFAULT_URL, help='alternative URL from where to download the setuptools package') parser.add_option( '--insecure', dest='downloader_factory', action='store_const', const=lambda: download_file_insecure, default=get_best_downloader, help='Use internal, non-validating downloader' ) parser.add_option( '--version', help="Specify which version to download", default=DEFAULT_VERSION, ) parser.add_option( '--to-dir', help="Directory to save (and re-use) package", default=DEFAULT_SAVE_DIR, ) options, args = parser.parse_args() # positional arguments are ignored return options def _download_args(options): """Return args for download_setuptools function from cmdline args.""" return dict( version=options.version, download_base=options.download_base, downloader_factory=options.downloader_factory, to_dir=options.to_dir, ) def main(): """Install or upgrade setuptools and EasyInstall.""" options = _parse_args() archive = download_setuptools(**_download_args(options)) return _install(archive, _build_install_args(options)) if __name__ == '__main__': sys.exit(main())
在cmd中運行:
d:\>python ez_setup.py
進行SetupTools的安裝
在運行的時候會發生一個錯誤,該錯誤為"ascii codec can't decode byte 0xe8 in position 0:ordinal not in range(128)",大意為ascii編碼不能解析byte 0xe8。
解決方法:找到并打開python根目錄/Lib/mimetypes.py文件,在import urllib后,添加代碼:
reload(sys) sys.setdefaultencoding('gbk')
把默認編碼方式改為gbk(網上有寫用utf8的,在這個腳本中是無效的,需要改成gbk格式)。重新執行python ez_setup.py,如果出現刷屏的安裝信息,則說明安裝成功了。此時,在python目錄下多了一個Script文件夾,easy_install就在里面
Scrapy依賴項的安裝
Scrapy的依賴項
安裝lxml-3.2.4.win32-py2.7.exe(64位系統需要安裝lxml-3.2.4.win-amd64-py2.7.exe)
安裝pywin32-218.win32-py2.7.exe(64位系統需要安裝pywin32-218.win-amd64-py2.7.exe)
安裝Twisted-13.2.0.win32-py2.7.exe(64位系統需要安裝Twisted-13.2.0.win-amd64-py2.7.exe)
安裝pyOpenSSL-0.13.1.win32-py2.7.exe(64位系統需要安裝pyOpenSSL-0.13.1.win-amd64-py2.7.exe)
將zope.interface-4.0.5-py2.7-win32.egg拷貝到C:\Python27\Scripts目錄下,執行$ easy_install.exe zope.interface-4.0.5-py2.7-win32.egg
驗證scrapy依賴項是否安裝成功的方法:
cmd執行$ python進入python控制臺
執行import lxml,如果沒報錯,則說明lxml安裝成功
執行import twisted,如果沒報錯,則說明twisted安裝成功
執行import OpenSSL,如果沒報錯,則說明OpenSSL安裝成功
執行import zope.interface,如果沒報錯,則說明zope.interface安裝成功
如果安裝成功,那么在cmd中執行& python,然后執行import lxml,如果沒有報錯,則說明lxml安裝成功。
安裝Scrapy
方法1: 控制臺輸入:easy_install scrapy
方法2:解壓縮Scrapy-0.22.2.tar.gz,在其目錄下執行$ python setup.py install進行Scrapy的安裝。
檢查Scrapy是否安裝成功的方法:可以在cmd控制臺執行 $ scrapy ,如果沒有報錯,說明安裝成功。
看完上述內容,你們掌握如何在win7 x64系統中安裝Scrapy的方法了嗎?如果還想學到更多技能或想了解更多相關內容,歡迎關注億速云行業資訊頻道,感謝各位的閱讀!
免責聲明:本站發布的內容(圖片、視頻和文字)以原創、轉載和分享為主,文章觀點不代表本網站立場,如果涉及侵權請聯系站長郵箱:is@yisu.com進行舉報,并提供相關證據,一經查實,將立刻刪除涉嫌侵權內容。