Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add retries to crawler #6

Open
eleweek opened this issue Dec 20, 2014 · 2 comments
Open

Add retries to crawler #6

eleweek opened this issue Dec 20, 2014 · 2 comments

Comments

@eleweek
Copy link
Owner

eleweek commented Dec 20, 2014

In the episode 6 (http://www.youtube.com/watch?v=dAp0GkWSxaA) I noticed that crawler crashes when it tries to get current_page (current_page = downloadRedditUrl(current_page_url)). It would be nice to have retries here.

@CheoR
Copy link
Contributor

CheoR commented Jan 13, 2015

this gives a simple workaround for the 504 HTTP Error 504 Gateway timeout , from the link, "This problem is entirely due to slow IP communication between back-end computers, possibly including the Web server. Only the people who set up the network at the site which hosts the Web server can fix this problem."

@eleweek
Copy link
Owner Author

eleweek commented Mar 16, 2015

Hm, this issue may still be relevant in certain parts. The project uses reddit api/praw now though

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants