Files
atnvc-crawler/README.md
2024-10-11 21:03:49 +07:00

554 B

atnvc-crawler

Crawl data of "Anh trai nhân vật chính", a novel from https://ln.hako.vn/sang-tac/8476-kiep-nay-la-anh-trai-cua-nhan-vat-chinh

How to run

  1. Install requirements
    pip install -r requirements.txt
    
  2. Run main.py to crawl data
    python main.py
    
    Note: You may get HTTP Error 429: Too Many Requests. Then you can try again later, and skip downloaded chapters, example skip 50 first chapters like:
    for chapter in chapters[50:]:
    
  3. The data will be saved in data folder