宜配屋

python使用beautifulsoup从爱奇艺网抓取视频播放

yipeiwu_com6年前 (2020-03-06)Python爬虫

import sys
import urllib
from urllib import request
import os
from bs4 import BeautifulSoup

class DramaItem:
    def __init__(self, num, title, url):
        self.num = num
        self.title = title
        self.url = url
    def __str__(self):
        return self.num + '    ' + self.title
    def openDrama(self):
        os.startfile(self.url)

response = urllib.request.urlopen('http://www.iqiyi.com/a_19rrgja8xd.html')
html = response.read()
soup = BeautifulSoup(html)
dramaList = soup.findAll('div', attrs={'class':'list_block1 align_c'})
dramaItems = []

if(dramaList):
    lis = dramaList[0].findAll('li')
    for li in lis:
        ps = li.findAll('p')
        description = ps[1].text if len(ps)>1 else ''
        num = ps[0].find('a').text
        url = ps[0].find('a')['href']
        di = DramaItem(num, description, url)
        dramaItems.append(di)

for di in dramaItems:
print(di)
diLen = len(dramaItems)
userChoice = int(input('input number to watch the drama:'))
if userChoice >= 1 and userChoice <=diLen:
dramaItems[userChoice-1].openDrama()

python使用beautifulsoup从爱奇艺网抓取视频播放

相关文章

python3之微信文章爬虫实例讲解

使用PyV8在Python爬虫中执行js代码

python爬虫面试宝典(常见问题)

Python爬虫代理IP池实现方法

Python实现抓取百度搜索结果页的网站标题信息

© YiPeiWu.com 【宜配屋】粤ICP备17031333号

Powered By Z-BlogPHP. Theme by TOYEAN.

宜配屋

python使用beautifulsoup从爱奇艺网抓取视频播放

相关文章

python3之微信文章爬虫实例讲解

使用PyV8在Python爬虫中执行js代码

python爬虫面试宝典(常见问题)

Python爬虫代理IP池实现方法

Python实现抓取百度搜索结果页的网站标题信息

© YiPeiWu.com 【宜配屋】 粤ICP备17031333号 var _hmt = _hmt || [];(function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?8aa60ae04b767b2af31903508928acc0"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s);})();

Powered By Z-BlogPHP. Theme by TOYEAN.

© YiPeiWu.com 【宜配屋】粤ICP备17031333号