heritrix 3.X 环境搭建
依赖
<dependency>
<groupId>org.arcHive.heritrix</groupId>
<artifactId>heritrix-commons</artifactId>
<version>3.1.0</version>
<scope>compile</scope>
</dependency>
<dependency>
<groupId>org.archive.heritrix</groupId>
<artifactId>heritrix-modules</artifactId>
<version>3.1.0</version>
<scope>compile</scope>
</dependency>
<dependency>
<groupId>org.archive.heritrix</groupId>
<artifactId>heritrix-engine</artifactId>
<version>3.1.0</version>
<scope>compile</scope>
</dependency>
2. 入口函数
public static void main(String[] args) throws Exception{
new Heritrix().instanceMain(args);
}
3. 启动
run/debug as application.在Arguments中填写:-a admin:admin
4. 访问
浏览器:https:localhost:8443 用户名/密码 admin/admin (注意是https)
Sun Microsystems Inc. Java(TM) SE Runtime Environment 1.6.0_43-b01
Using ad-hoc HttpS certificate with fingerprint...
SHA1:DD:19:6B:EB:54:61:98:7C:C2:FC:4C:A4:CB:7C:19:EC:A0:33:0D:7C
Verify in browser before accepting exception.
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/C:/Users/Administrator/.m2/repository/org/slf4j/slf4j-log4j12/1.6.1/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/C:/Users/Administrator/.m2/repository/org/slf4j/slf4j-jdk14/1.6.6/slf4j-jdk14-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
engine listening at port 8443
operator login set per command-line
NOTE: We recommend a longer, stronger passWord, especially if your WEB
interface will be internet-accessible.
Heritrix version: 3.3.0-SNAPSHOT-${Maven.build.timestamp}
5.遇到的一些问题:依赖无法下载,是因为无法访问资源库,需要使用代理或者×××来解决。
相关文章