楼主: trans
2305 0

[问答] python爬虫运行过程报错,求救 [推广有奖]

  • 1关注
  • 0粉丝

本科生

38%

还不是VIP/贵宾

-

威望
0
论坛币
205 个
通用积分
1.0059
学术水平
2 点
热心指数
0 点
信用等级
0 点
经验
679 点
帖子
48
精华
0
在线时间
75 小时
注册时间
2008-3-12
最后登录
2020-12-15

楼主
trans 发表于 2017-6-8 20:29:16 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

d:\Python\ajk>scrapy crawl fangjia
2017-06-08 20:17:26 [scrapy] INFO: Scrapy 1.0.3 started (bot: ajk)
2017-06-08 20:17:26 [scrapy] INFO: Optional features available: ssl, http11
2017-06-08 20:17:26 [scrapy] INFO: Overridden settings: {'NEWSPIDER_MODULE': 'ajk.spiders', 'SPIDER_MODULES': ['ajk.spiders'], 'BOT_NAME': 'ajk'}
2017-06-08 20:17:26 [scrapy] INFO: Enabled extensions: CloseSpider, TelnetConsole, LogStats, CoreStats, SpiderState
2017-06-08 20:17:27 [scrapy] INFO: Enabled downloader middlewares: HttpAuthMiddleware, DownloadTimeoutMiddleware, UserAgentMiddleware, RetryMiddleware, DefaultHeadersMiddleware, MetaRefreshMiddleware, HttpCompressionMiddleware, RedirectMiddleware, CookiesMiddleware, ChunkedTransferMiddleware, DownloaderStats
2017-06-08 20:17:27 [scrapy] INFO: Enabled spider middlewares: HttpErrorMiddleware, OffsiteMiddleware, RefererMiddleware, UrlLengthMiddleware, DepthMiddleware
2017-06-08 20:17:27 [scrapy] INFO: Enabled item pipelines: AjkPipeline
2017-06-08 20:17:27 [scrapy] INFO: Spider opened
2017-06-08 20:17:27 [scrapy] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2017-06-08 20:17:27 [scrapy] DEBUG: Telnet console listening on 127.0.0.1:6023
2017-06-08 20:17:27 [scrapy] DEBUG: Redirecting (301) to <GET http://hangzhou.anjuke.com/community/> from <GET http://hangzhou.anjuke.com/community>[/url]
2017-06-08 20:17:27 [scrapy] DEBUG: Redirecting (302) to <GET http://hangzhou.anjuke.com/404/?from=antispam> from <GET http://hangzhou.anjuke.com/community/>
2017-06-08 20:17:27 [scrapy] DEBUG: Crawled (404) <GET http://hangzhou.anjuke.com/404/?from=antispam> (referer: None)
2017-06-08 20:17:27 [scrapy] DEBUG: Ignoring response <404 [url]http://hangzhou.anjuke.com/404/?from=antispam>: HTTP status code is not handled or not allowed
2017-06-08 20:17:27 [scrapy] INFO: Closing spider (finished)
2017-06-08 20:17:27 [scrapy] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 856,
'downloader/request_count': 3,
'downloader/request_method_count/GET': 3,
'downloader/response_bytes': 10394,
'downloader/response_count': 3,
'downloader/response_status_count/301': 1,
'downloader/response_status_count/302': 1,
'downloader/response_status_count/404': 1,
'finish_reason': 'finished',
'finish_time': datetime.datetime(2017, 6, 8, 12, 17, 27, 808000),
'log_count/DEBUG': 5,
'log_count/INFO': 7,
'response_received_count': 1,
'scheduler/dequeued': 3,
'scheduler/dequeued/memory': 3,
'scheduler/enqueued': 3,
'scheduler/enqueued/memory': 3,
'start_time': datetime.datetime(2017, 6, 8, 12, 17, 27, 415000)}
2017-06-08 20:17:27 [scrapy] INFO: Spider closed (finished)


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:python爬虫 python compression downloader Extensions python

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-5 15:57