NewsAnalyzer

Intro

This software tool allows to extract big collections of Twitter news-sharing users, their news tweets and the full data structure of the shared articles.

The application is automatic and self-powered so that it can be run for indefinitely long sessions.

Installation Guide

Requirements

Python (>3.4.0) and pip
MongoDB
Twitter API keys
(optional) Face++ keys

Setting up the application

Clone the repository:

git clone https://github.com/DataSciencePolimi/NewsAnalyzer.git

Inside the project folder initialize a python environment

virtualenv newsanalyzer-env

Activate it

source newsanalyzer-env/bin/activate

The install the requirements

pip install -r requirements.txt

Setting up keystore

If you don't have yet, obtain Twitter API credentials

Open credential.json and fill the values with your keys:

{
    "consumer_key" : "<twitter API consumer key>",
    "consumer_secret" : "<twitter API consumer secret>",
    "access_token" : "<twitter API access token>",
    "access_token_secret" : "<twitter API access token secret>",
    "faceplus_key" : "<face++ key (optional)>",
    "faceplus_secret" : "<face++ secret (optional)>"
}

Setting up database

Download and install MongoDB
Run command mongod to start a MongoDB server on localhost (may require priviledges)
Run setup.py script inside application folder

Running the pipeline

In order to start collecting users, tweets and articles your database need to contain at least one article entity to feed the recursive pipeline.

You can run utils/get_seeds.py to get a set of initial seeds or you can download our pre-collected dataset.

Then run main_pipeline.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
category_classifier		category_classifier
demo		demo
twitter_pipeline		twitter_pipeline
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
credentials.json		credentials.json
main_pipeline.py		main_pipeline.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NewsAnalyzer

Intro

Installation Guide

Requirements

Setting up the application

Setting up keystore

Setting up database

Running the pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Languages

DataSciencePolimi/NewsAnalyzer

Folders and files

Latest commit

History

Repository files navigation

NewsAnalyzer

Intro

Installation Guide

Requirements

Setting up the application

Setting up keystore

Setting up database

Running the pipeline

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages