python抓取网页中的图片示例

yipeiwu_com6年前 (2020-03-06)Python爬虫

复制代码代码如下:

#coding:utf8
import re
import urllib
def getHTML(url):
    page = urllib.urlopen(url)
    html = page.read()
    return html

def getImg(html,imgType):
    reg = r'src="(.*?\.+'+imgType+'!slider)" '
    imgre = re.compile(reg)
    imgList = re.findall(imgre, html)
    x=0
    for imgurl in imgList:
        print imgurl
        urllib.urlretrieve(imgurl, '%s.%s' % (x, imgType))
        x =x+1

html= getHTML("//www.jb51.net")

getImg(html,'jpg')

返回列表

上一篇：Python字符转换

下一篇：PHP生成静态页面详解

相关文章

python爬取51job中hr的邮箱

本文实例为大家分享了python爬取51job中hr的邮箱具体代码，供大家参考，具体内容如下 #encoding=utf8 import urllib2 import cookie...

Python打印scrapy蜘蛛抓取树结构的方法

本文实例讲述了Python打印scrapy蜘蛛抓取树结构的方法。分享给大家供大家参考。具体如下：通过下面这段代码可以一目了然的知道scrapy的抓取页面结构，调用也非常简单 #!/...

python爬虫猫眼电影和电影天堂数据csv和mysql存储过程解析

字符串常用方法 # 去掉左右空格 'hello world'.strip() # 'hello world' # 按指定字符切割 'hello world'.split(' ')...

Python3网络爬虫中的requests高级用法详解

Python3网络爬虫中的requests高级用法详解

本节我们再来了解下 Requests 的一些高级用法，如文件上传，代理设置，Cookies 设置等等。 1. 文件上传我们知道 Reqeuests 可以模拟提交一些数据，假如有的网站需...

Python爬虫使用脚本登录Github并查看信息

Python爬虫使用脚本登录Github并查看信息

前言分析目标网站的登录方式目标地址： https://github.com/login 登录方式做出分析：第一，用form表单方式提交信息，第二...