Python网络爬虫--项目实战--用scrapy框架爬取王者荣耀英雄信息_爬虫王者荣耀隐藏战绩

作者：运维做开发 | 2024-08-14 22:26:55

踩

爬虫王者荣耀隐藏战绩

wzry.py

import scrapy

from LearnScrapy.items import HeroItem


class WzrySpider(scrapy.Spider):
    name = 'wzry'
    allowed_domains = ['pvp.qq.com']
    start_urls = ['https://pvp.qq.com/web201605/herolist.shtml']

    hero_detail_base_url = "https://pvp.qq.com/web201605/"

    def parse(self, response):
        # print(response)
        hero_list = response.xpath("//div[contains(@class, 'herolist-content')]/ul[contains(@class, 'herolist')]/li/a/@href").extract()
        # print(hero_list)
        # for hero_detail in hero_list:
            # yield scrapy.Request(url=self.hero_detail_base_url + hero_detail, callback=self.parse_hero_detail, meta={"msg": "ok"})
            # yield scrapy.Request(url=response.urljoin(hero_detail), callback=self.parse_hero_detail, meta={"msg": "ok"})
            # yield response.follow(url=hero_detail, callback=self.parse_hero_detail, meta={"msg": "ok"})

        requests = response.follow_all(urls=hero_list, callback=self.parse_hero_detail, meta={
   "msg": "ok"}
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/运维做开发/article/detail/980914