Twitter_Scraper

NOTE: currently under development

Scrapes tweets from twitter.com and inserts into a SQL server database.
Uses Celery the asynchronous task queue as a framework.
Tested on Ubuntu 14.04 with pyhton 3.4

Install requirements

Python
Celery
- pip install Celery
pymssql
- sudo apt-get install freetds-dev freetds-bin
- pip install pymssql
requests
lxml
- sudo apt-get install python3-lxml
cssselect
- pip install cssselect
RabbitMQ
- sudo apt-get install rabbitmq-server

create a file keys.json file which contains the SQL server connection parameters

{
    "server":  "SERVER.database.windows.net",
    "user": "USER@SERVER",
    "password": "password",
    "database": "databasename"
}

note: Use the --recursive option when cloning to also clone the submodule

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
sql		sql
twitterWebsiteSearch @ 98decbf		twitterWebsiteSearch @ 98decbf
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
celeryconfig.py		celeryconfig.py
createtable.sql		createtable.sql
temp.py		temp.py
temp1.py		temp1.py
twit_tasks.py		twit_tasks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Twitter_Scraper

Install requirements

About

Uh oh!

Releases

Packages

Languages

License

dtuit/twitter_scraper

Folders and files

Latest commit

History

Repository files navigation

Twitter_Scraper

Install requirements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages