Python自帶的tarfile模塊可以方便讀取tar歸檔文件,牛b的是可以處理使用gzip和bz2壓縮歸檔文件tar.gz和tar.bz2。
與tarfile對應的是zipfile模塊,zipfile是處理zip壓縮的。請注意:os.system(cmd)可以使Python腳本執行命令,當然包括:tar -czf *.tar.gz *,tar -xzf *.tar.gz,unzip等,當我覺得這樣盡管可以解決問題,但我覺得很業余。
使用tarfile壓縮
import tarfile
#創建壓縮包名
tar = tarfile.open("/tmp/tartest.tar.gz","w:gz")
#創建壓縮包
for root,dir,files in os.walk("/tmp/tartest"):
for file in files:
fullpath = os.path.join(root,file)
tar.add(fullpath)
tar.close()
使用tarfile解壓
def extract(tar_path, target_path):
try:
tar = tarfile.open(tar_path, "r:gz")
file_names = tar.getnames()
for file_name in file_names:
tar.extract(file_name, target_path)
tar.close()
except Exception, e:
raise Exception, e
其中open的原型是:
tarfile.open(name=None, mode='r', fileobj=None, bufsize=10240, **kwargs)
mode的值有:
'r' or 'r:*' Open for reading with transparent compression (recommended).
'r:' Open for reading exclusively without compression.
'r:gz' Open for reading with gzip compression.
'r:bz2' Open for reading with bzip2 compression.
'a' or 'a:' Open for appending with no compression. The file is created if it does not exist.
'w' or 'w:' Open for uncompressed writing.
'w:gz' Open for gzip compressed writing.
'w:bz2' Open for bzip2 compressed writing.
更多請參考:tarfile ― Read and write tar archive files