你想知道某个站点的所有者是谁么?网络爬虫工具之WHOIS协议的使用

  发布日期:   2017-06-18
  最新修改:   2020-05-02
  阅读次数:   33 次

更多分享:http://www.catbro.cn

一、前言:

  • 在爬取一些网站的时候,我们可能会关心其所有者是谁。因为像Google这些,我们最好把下载速度控制得更加保守一些,不然很容易我们的爬虫就会被封了哦。为了找到网站的所有者,我们可以使用WHOIS协议查询域名的注册者是谁。Pyton中有一个针对该协议的封 装库,其文档地址为https://pypi.python.org/pypi/python-whois,我们可以通过pip进行安装。

二、安装使用

  • 1、执行pip3 install python-whois,如果你在Python2开发环境下,可以执行pip install python-whois

  • 2、安装后我们来验证一下吧

      import whois
    
      print(whois.whois('baidu.com'))
    
      #执行结果
          {
        "domain_name": [
          "BAIDU.COM",
          "baidu.com"
        ],
        "registrar": "MarkMonitor, Inc.",
        "whois_server": "whois.markmonitor.com",
        "referral_url": "http://www.markmonitor.com",
        "updated_date": [
          "2015-09-10 00:00:00",
          "2017-02-13 01:04:21-08:00"
        ],
        "creation_date": [
          "1999-10-11 00:00:00",
          "1999-10-11 04:05:17-07:00"
        ],
        "expiration_date": [
          "2017-10-11 00:00:00",
          "2017-10-11 00:00:00-07:00"
        ],
        "name_servers": [
          "DNS.BAIDU.COM",
          "NS2.BAIDU.COM",
          "NS3.BAIDU.COM",
          "NS4.BAIDU.COM",
          "NS7.BAIDU.COM",
          "ns3.baidu.com",
          "ns2.baidu.com",
          "dns.baidu.com",
          "ns4.baidu.com",
          "ns7.baidu.com"
        ],
        "status": [
          "clientDeleteProhibited https://icann.org/epp#clientDeleteProhibited",
          "clientTransferProhibited https://icann.org/epp#clientTransferProhibited",
          "clientUpdateProhibited https://icann.org/epp#clientUpdateProhibited",
          "serverDeleteProhibited https://icann.org/epp#serverDeleteProhibited",
          "serverTransferProhibited https://icann.org/epp#serverTransferProhibited",
          "serverUpdateProhibited https://icann.org/epp#serverUpdateProhibited",
          "clientUpdateProhibited (https://www.icann.org/epp#clientUpdateProhibited)",
          "clientTransferProhibited (https://www.icann.org/epp#clientTransferProhibited)",
          "clientDeleteProhibited (https://www.icann.org/epp#clientDeleteProhibited)",
          "serverUpdateProhibited (https://www.icann.org/epp#serverUpdateProhibited)",
          "serverTransferProhibited (https://www.icann.org/epp#serverTransferProhibited)",
          "serverDeleteProhibited (https://www.icann.org/epp#serverDeleteProhibited)"
        ],
        "emails": [
          "abusecomplaints@markmonitor.com",
          "domainmaster@baidu.com"
        ],
        "dnssec": "unsigned",
        "name": "Domain Admin",
        "org": "Beijing Baidu Netcom Science Technology Co., Ltd.",
        "address": "3F Baidu Campus No.10, Shangdi 10th Street Haidian District",
        "city": "Beijing",
        "state": "Beijing",
        "zipcode": "100085",
        "country": "CN"
      }

   转载规则

《你想知道某个站点的所有者是谁么?网络爬虫工具之WHOIS协议的使用字》GajAngels 采用 知识共享署名-非商业性使用 4.0 国际许可协议 进行许可。