Skip to content

Spider management for Scrapyd/Open Source ScrapingHub

License

Notifications You must be signed in to change notification settings

Zephyrrus/UBGuardian

 
 

Repository files navigation

UBGuardian

Latest Version Python Versions The MIT License

A scalable admin ui for scrapy spider service

UBGuardian is a fork of ScrapyKeeper which is a fork of SpiderKeeper

Forked to provide:

  • Enhanced statistics (Errors, Exceptions, Retries...)
  • Cache support
  • Dashboarding
  • Most of pending SpideKeeper ahead commits

Screenshot

job dashboard periodic job project stats spider stats

Installing

git clone https://github.com/zephyrrus/UBGuardian.git
cd ScrapyKeeper
pip install .

Deployment

scrapykeeper -h

Usage: scrapykeeper [options]

Admin ui for scrapy spider service

Options:
  -h, --help            show this help message and exit
  --host=HOST           host, default:0.0.0.0
  --port=PORT           port, default:5000
  --username=USERNAME   basic auth username ,default: admin
  --password=PASSWORD   basic auth password ,default: admin
  --type=SERVER_TYPE    access spider server type, default: scrapyd
  --server=SERVERS      servers, default: ['http://localhost:6800']
  --database-url=DATABASE_URL
                        ScrapyKeeper metadata database default: sqlite://./ScrapyKeeper.db
  --no-auth             disable basic auth
  --no-sentry           disable sentry.io error reporting
  -v, --verbose         log level

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

Authors

See also the list of contributors who participated in this project.

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Contributing

Contributions are welcomed!

About

Spider management for Scrapyd/Open Source ScrapingHub

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 49.9%
  • HTML 38.7%
  • JavaScript 10.9%
  • CSS 0.5%