實(shí)驗(yàn)環(huán)境:windows 7,anaconda 3(python 3.5),tensorflow(gpu/cpu)
函數(shù)介紹:所用函數(shù)為six.moves下的urllib中的函數(shù),調(diào)用如下urllib.request.urlretrieve(url,[filepath,[recall_func,[data]]])。簡(jiǎn)單介紹一下,url是必填的指的是下載地址,filepath指的是保存的本地地址,recall_func指的是回調(diào)函數(shù),下載過(guò)程中會(huì)調(diào)用可以用來(lái)顯示下載進(jìn)度。
實(shí)驗(yàn)代碼:以下載cifar10的dataset和抓取斗魚(yú)首頁(yè)為例
下載cifar10的dataset,并解壓
from six.moves import urllibimport osimport sysimport tensorflow as tfimport tarfileFLAGS = tf.app.flags.FLAGS#提取系統(tǒng)參數(shù)作用的變量tf.app.flags.DEFINE_string('dir','D:/download_html','directory of html')#將下載目錄保存到變量dir中,通過(guò)FLAGS.dir提取directory = FLAGS.dir#從FLAGS中提取dir變量url = 'http://www.cs.toronto.edu/~kriz/cifar-10-binary.tar.gz'filename = url.split('/')[-1]#-1表示分割后的最后一個(gè)元素filepath = os.path.join(directory,filename)if not os.path.exists(directory): os.makedirs(directory)if not os.path.exists(filepath): def _recall_func(num,block_size,total_size): sys.stdout.write('/r>> downloading %s %.1f%%' % (filename,float(num*block_size)/float(total_size)*100.0)) sys.stdout.flush() urllib.request.urlretrieve(url,filepath,_recall_func) print() file_info = os.stat(filepath) print('Successfully download',filename,file_info.st_size,'bytes')tar = tarfile.open(filepath,'r:gz')#指定解壓路徑和解壓方式為解壓gziptar.extractall(directory)#全部解壓
抓取斗魚(yú)首頁(yè)
from six.moves import urllibimport osimport sysimport tensorflow as tfFLAGS = tf.app.flags.FLAGS#提取系統(tǒng)參數(shù)作用的變量tf.app.flags.DEFINE_string('dir','D:/download_html','directory of html')#將下載目錄保存到變量dir中,通過(guò)FLAGS.dir提取directory = FLAGS.dir#從FLAGS中提取dir變量url = 'http://www.douyu.com/'filename = 'douyu.html'#保存成你想要的名字,這里保存成douyu.htmlfilepath = os.path.join(directory,filename)if not os.path.exists(directory): os.makedirs(directory)if not os.path.exists(filepath): def _recall_func(num,block_size,total_size): sys.stdout.write('/r>> downloading %s %.1f%%' % (filename,float(num*block_size)/float(total_size)*100.0)) sys.stdout.flush() urllib.request.urlretrieve(url,filepath,_recall_func) print() file_info = os.stat(filepath)#獲取文件信息 print('Successfully download',filename,file_info.st_size,'bytes')#.st_size文件的大小,以字節(jié)為單位
新聞熱點(diǎn)
疑難解答
圖片精選