Python实现对网站目录扫描
一个很简单的版本,以后会做进一步的修改:多线程,从文件中读取,跟据Head头判断等等.
需要提供一个网站和不存在页面的错误提示
CODE:
#!/usr/bin/env python
# -*- coding:utf-8 -*-
import urllib
import urlparse
import htmlParser
class CheckHTML(HTMLParser.HTMLParser):
available = True
def check_data(self,data,ErrorSym):
error = ErrorSym
if error in data:
self.available = False
target = raw_input("Please enter your target:")
errorsyb = raw_input("Enter the errorsyb:(eg:404 Error)")
check_list = ['admin','inc','data','include','count']
for li in check_list:
new_urls = urlparse.urljoin("Http://"+target+"/",li)
d = urllib.urlopen(new_urls)
data = d.read()
check = CheckHTML()
check.check_data(data,errorsyb)
if check.available:
print "%s------------->Found!"%new_urls
else:
print "%s------------->Not Found!"%new_urls
--------------------------------------------------------------------------------------------
The Result maybe like this :
Please enter your target:www.fuck.com
Enter the errorsyb:(eg:404 Error)HTTP 错误 404
http://www.fuck.com/admin------------->Found!
http://www.fuck.com/inc------------->Not Found!
http://www.fuck.com/data------------->Found!
http://www.fuck.com/include------------->Found!
http://www.fuck.com/count------------->Found!
相关文章