mirror of
https://github.com/tiennm99/atnvc-crawler.git
synced 2026-06-21 13:40:38 +00:00
18 lines
554 B
Markdown
18 lines
554 B
Markdown
# atnvc-crawler
|
|
|
|
Crawl data of "Anh trai nhân vật chính", a novel from https://ln.hako.vn/sang-tac/8476-kiep-nay-la-anh-trai-cua-nhan-vat-chinh
|
|
|
|
# How to run
|
|
1. Install requirements
|
|
```
|
|
pip install -r requirements.txt
|
|
```
|
|
2. Run main.py to crawl data
|
|
```
|
|
python main.py
|
|
```
|
|
__Note__: You may get `HTTP Error 429: Too Many Requests`. Then you can try again later, and skip downloaded chapters, example skip 50 first chapters like:
|
|
```
|
|
for chapter in chapters[50:]:
|
|
```
|
|
3. The data will be saved in `data` folder |