国产探花免费观看_亚洲丰满少妇自慰呻吟_97日韩有码在线_资源在线日韩欧美_一区二区精品毛片,辰东完美世界有声小说,欢乐颂第一季,yy玄幻小说排行榜完本

首頁 > 編程 > Python > 正文

Python實現在線程里運行scrapy的方法

2019-11-25 17:49:17
字體:
來源:轉載
供稿:網友

本文實例講述了Python實現在線程里運行scrapy的方法。分享給大家供大家參考。具體如下:

如果你希望在一個寫好的程序里調用scrapy,就可以通過下面的代碼,讓scrapy運行在一個線程里。

"""Code to run Scrapy crawler in a thread - works on Scrapy 0.8"""import threading, Queuefrom twisted.internet import reactorfrom scrapy.xlib.pydispatch import dispatcherfrom scrapy.core.manager import scrapymanagerfrom scrapy.core.engine import scrapyenginefrom scrapy.core import signalsclass CrawlerThread(threading.Thread):  def __init__(self):    threading.Thread.__init__(self)    self.running = False  def run(self):    self.running = True    scrapymanager.configure(control_reactor=False)    scrapymanager.start()    reactor.run(installSignalHandlers=False)  def crawl(self, *args):    if not self.running:      raise RuntimeError("CrawlerThread not running")    self._call_and_block_until_signal(signals.spider_closed, /      scrapymanager.crawl, *args)  def stop(self):    reactor.callFromThread(scrapyengine.stop)  def _call_and_block_until_signal(self, signal, f, *a, **kw):    q = Queue.Queue()    def unblock():      q.put(None)    dispatcher.connect(unblock, signal=signal)    reactor.callFromThread(f, *a, **kw)    q.get()# Usage example below: import osos.environ.setdefault('SCRAPY_SETTINGS_MODULE', 'myproject.settings')from scrapy.xlib.pydispatch import dispatcherfrom scrapy.core import signalsfrom scrapy.conf import settingsfrom scrapy.crawler import CrawlerThreadsettings.overrides['LOG_ENABLED'] = False # avoid log noisedef item_passed(item):  print "Just scraped item:", itemdispatcher.connect(item_passed, signal=signals.item_passed)crawler = CrawlerThread()print "Starting crawler thread..."crawler.start()print "Crawling somedomain.com...."crawler.crawl('somedomain.com) # blocking callprint "Crawling anotherdomain.com..."crawler.crawl('anotherdomain.com') # blocking callprint "Stopping crawler thread..."crawler.stop()

希望本文所述對大家的Python程序設計有所幫助。

發表評論 共有條評論
用戶名: 密碼:
驗證碼: 匿名發表
主站蜘蛛池模板: 衡阳县| 周口市| 永平县| 右玉县| 新田县| 黔东| 客服| 宝清县| 商南县| 辛集市| 岳阳县| 洛南县| 河北省| 健康| 察隅县| 玉溪市| 麟游县| 竹溪县| 夏河县| 通江县| 泰来县| 志丹县| 确山县| 双柏县| 寻甸| 油尖旺区| 太仆寺旗| 庄浪县| 二连浩特市| 嘉兴市| 昌邑市| 遂昌县| 卓资县| 静乐县| 六安市| 杂多县| 延安市| 基隆市| 墨玉县| 基隆市| 石渠县|