国产探花免费观看_亚洲丰满少妇自慰呻吟_97日韩有码在线_资源在线日韩欧美_一区二区精品毛片,辰东完美世界有声小说,欢乐颂第一季,yy玄幻小说排行榜完本

首頁 > 編程 > Python > 正文

python使用BeautifulSoup分析網頁信息的方法

2019-11-25 17:51:10

字體：大中小

來源：轉載

供稿：網友

本文實例講述了python使用BeautifulSoup分析網頁信息的方法。分享給大家供大家參考。具體如下：

這段python代碼查找網頁上的所有鏈接，分析所有的span標簽，并查找class包含titletext的span的內容

復制代碼代碼如下:

#import the library used to query a website
import urllib2

#specify the url you want to query
url = "http://www.python.org"

#Query the website and return the html to the variable 'page'
page = urllib2.urlopen(url)

#import the Beautiful soup functions to parse the data returned from the website
from BeautifulSoup import BeautifulSoup

#Parse the html in the 'page' variable, and store it in Beautiful Soup format
soup = BeautifulSoup(page)

#to print the soup.head is the head tag and soup.head.title is the title tag
print soup.head
print soup.head.title

#to print the length of the page, use the len function
print len(page)

#create a new variable to store the data you want to find.
tags = soup.findAll('a')

#to print all the links
print tags

#to get all titles and print the contents of each title
titles = soup.findAll('span', attrs = { 'class' : 'titletext' })
for title in allTitles:
print title.contents

希望本文所述對大家的Python程序設計有所幫助。

上一篇：python實現分析apache和nginx日志文件并輸出訪客ip列表的方法

下一篇：python使用webbrowser瀏覽指定url的方法

學習交流

索泰發布一款GTX 1070 Mini迷你版本:小機

索泰發布一款GTX 1070 Mini迷你版本:小機箱大愛...

熱門圖片

猜你喜歡的新聞

猜你喜歡的關注

新聞熱點

榮耀總裁趙明烏鎮演講：榮耀首款5G手機V30下月發布

2019-10-23 09:17:05

搜狐張朝陽：回歸媒體是搜狐重新崛起的關鍵

2019-10-21 09:20:02

華為輪值董事長郭平：虛擬技術創造現實價值

2019-10-21 09:00:12

滴滴英文服務上線兩周年用戶已超200萬

2019-09-26 08:57:12

華為推出全球至快AI訓練集群Atlas900

2019-09-25 08:46:36

馬斯克：特斯拉正組建中國技術團隊

2019-09-25 08:15:43

疑難解答

圖片精選

網友關注

主站蜘蛛池模板：双柏县| 嘉定区| 隆化县| 光山县| 龙海市| 兴化市| 新巴尔虎左旗| 五原县| 博野县| 鸡东县| 吕梁市| 罗城| 社旗县| 徐水县| 郸城县| 乐平市| 浮山县| 隆化县| 巴中市| 莱芜市| 抚远县| 福清市| 荥经县| 阿克苏市| 德清县| 武强县| 吉安县| 夹江县| 延川县| 巨鹿县| 江陵县| 长子县| 连云港市| 伊吾县| 靖远县| 伊金霍洛旗| 大城县| 姜堰市| 元谋县| 吉林市| 集安市|