每日存档十二月 13th, 2010

批量下载某网站pdf文件down_pdf.py

2010年12月13日写评论作者 夜行人

最近在学Python，练手作品

[python]
#!/bin/env python
import urllib
import re
import os
from sgmllib import SGMLParser

found=[]

class URLLister(SGMLParser):
def reset(self):
SGMLParser.reset(self)
self.urls = []

def start_a(self, attrs):
href = [v for k, v in attrs if k==’href’]
if href:
self.urls.extend(href)

if not os.path.isdir(os.getcwd()+’/ppts’):
os.mkdir(os.getcwd()+’/ppts’)

usock = urllib.urlopen(“http://velocity.oreilly.com.cn/index.php?func=slidesvideos”)
parser = URLLister()
parser.feed(usock.read())
usock.close()
parser.close()
for url in parser.urls:
found.append(re.findall(‘ppts/\w*.pdf’,url))

for pdfurl in found:
for i in range(0,len(pdfurl),1):
urllib.urlretrieve(“http://velocity.oreilly.com.cn/”+pdfurl[i],os.getcwd()+”/”+pdfurl[i])

[/python]

Python

python2.7.1+pysvn on CentOS

2010年12月13日写评论作者 夜行人

CentOS上的Python真是老掉牙了，2.4.3
我需要用python去update svn

主要参考了pysvn-1.7.2里的INSTALL.html

Subversion常规安装，安装目录
/opt/svn_1.6.12

Python安装目录
/usr/local/python271

mkdir /data/download
cd !$
wget http://www.python.org/ftp/python/2.7.1/Python-2.7.1.tar.bz2
wget http://pysvn.barrys-emacs.org/source_kits/pysvn-1.7.2.tar.gz

安装Python 2.7.1
tar xjf Python-2.7.1.tar.bz2
cd Python-2.7.1
./configure –prefix=/usr/local/python271
make && make install

安装pysvn
tar xzf pysvn-1.7.2.tar.gz
cd pysvn-1.7.2/Source
/usr/local/python271/bin/python setup.py configure –svn-root-dir=/opt/svn_1.6.12
make
mkdir -p /usr/local/python271/lib/python2.7/pysvn
/bin/cp -a pysvn/* /usr/local/python271/lib/python2.7/pysvn/

测试
vi svn_up.py

[python]

#!/usr/local/python271/bin/python
import pysvn
client=pysvn.Client()
client.update(‘/data/website/’)
[/python]

Linux, Python

Python

一	二	三	四	五	六	日
« 11月				1月 »
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

每日存档十二月 13th, 2010

批量下载某网站pdf文件down_pdf.py

python2.7.1+pysvn on CentOS

Search

文章归档

分类目录

链接表

功能

每日存档 十二月 13th, 2010

批量下载某网站pdf文件down_pdf.py

python2.7.1+pysvn on CentOS

Search

标签

文章归档

分类目录

链接表

功能

每日存档十二月 13th, 2010