Files
phow2sim/docker-compose.yml
T
tiennm99 6b1b401283 feat: fetch model via Nextcloud WebDAV with Basic auth
The upstream public.vinai.io mirror is dead and PhoW2V's research
license forbids public redistribution, so anonymous auto-download is
no longer viable. Expect a private Nextcloud (WebDAV or password-
protected public share) per deployment.

- Stream downloads in 1MiB chunks (flat RAM for ~1GB zips)
- Basic auth via MODEL_DOWNLOAD_USER / MODEL_DOWNLOAD_PASSWORD
- Drop the broken public.vinai.io default; compose requires MODEL_URL
- Add .env.example with WebDAV and public-share recipes
- Remove scripts/download-phow2v.sh (pointed at the dead mirror)
- README rewritten around the NC workflow; update license caveat
2026-04-23 10:44:33 +07:00

27 lines
1.1 KiB
YAML

services:
phow2sim:
build: .
ports:
- "8001:8000"
environment:
# MODEL_URL is a Nextcloud WebDAV (or public-share+password) URL to a
# PhoW2V .zip. Example WebDAV path:
# https://cloud.example.com/remote.php/dav/files/<user>/phow2v/word2vec_vi_words_300dims.zip
# Example public share (password-protected):
# https://cloud.example.com/s/<token>/download (leave MODEL_DOWNLOAD_USER empty)
MODEL_URL: ${MODEL_URL:?set MODEL_URL in .env to your Nextcloud zip}
MODEL_PATH: ${MODEL_PATH:-/data/phow2v/word2vec_vi_words_300dims.txt}
MODEL_VARIANT: ${MODEL_VARIANT:-word}
# Basic-auth for the MODEL_URL. For Nextcloud WebDAV use the Nextcloud
# username + an app-password (not your login password). For a
# password-protected public share, leave MODEL_DOWNLOAD_USER empty
# and put the share password in MODEL_DOWNLOAD_PASSWORD.
MODEL_DOWNLOAD_USER: ${MODEL_DOWNLOAD_USER:-}
MODEL_DOWNLOAD_PASSWORD: ${MODEL_DOWNLOAD_PASSWORD:?set MODEL_DOWNLOAD_PASSWORD in .env}
volumes:
- phow2v-cache:/data/phow2v
restart: unless-stopped
volumes:
phow2v-cache: