mirror of
https://github.com/btdig/dhtcrawler2.git
synced 2025-01-19 12:41:36 +00:00
38 lines
947 B
Markdown
38 lines
947 B
Markdown
|
## dhtcrawler
|
||
|
|
||
|
dhtcrawler is a DHT crawler written in erlang. It can join a DHT network and crawl many P2P torrents. The program save all torrent info into database and provide an http interface to search a torrent by a keyword.
|
||
|
|
||
|
![screenshot](https://raw.github.com/kevinlynx/dhtcrawler/master/screenshot.png)
|
||
|
|
||
|
## Usage
|
||
|
|
||
|
* Download mongodb and start it with text search, i.e:
|
||
|
|
||
|
mongod --dbpath db --setParameter textSearchEnabled=true
|
||
|
|
||
|
* Download dhtcrawler source code
|
||
|
* Use `rebar` to download and install all dependent libraries
|
||
|
|
||
|
rebar get-deps
|
||
|
|
||
|
* compile
|
||
|
|
||
|
rebar compile
|
||
|
|
||
|
* start dhtcrawler
|
||
|
|
||
|
crawler_app:start()
|
||
|
|
||
|
* start the http front-end
|
||
|
|
||
|
crawler_http:start().
|
||
|
|
||
|
* Open a web browser and point to `localhost:8000/index.html`
|
||
|
|
||
|
## Config
|
||
|
|
||
|
see priv/dhtcrawler.config.
|
||
|
|
||
|
**NOTE**, when you change `node_count` value in dhtcrawler.config, you'd better delete all files saved in dhtstate directory.
|
||
|
|