Skip to content

Analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification, and sentiment analysis to make sense of crowdsourced civic input.

Notifications You must be signed in to change notification settings

jammily/civic-crowdanalytics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Civic CrowdAnalytics

Novel data analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification, and sentiment analysis to make sense of crowdsourced civic input. This tool automatically organizes contributions into executive summaries and compelling visualizations, which are easy to comprehend, searchable, and interrelated. Civic CrowdAnalytics is based on the scientific publication Civic CrowdAnalytics: Making sense of crowdsourced civic input with big data tools.

Civic CrowdAnalytics features a simple user-interface for submitting an unstructured dataset for analysis. The user can choose, for example, to organize ideas by pre-defined categories, visualize the frequency of recurring concepts, and sort the sentiments of related comments. The tool displays the results in both tabular summaries and interactive visualizations, which users can search and manipulate. Users can also choose to export the results in various formats, such as CSV, PNG, JPEG, SVG, or PDF.

Screenshots

dashboard

categorization

concept_extraction

Motivation

Civic technologies are currently bottlenecked by a common need for more effective processing of citizen contributions. Civic CrowdAnalytics provides a solution. By using innovative NLP and ML techniques, the tool automates the analysis and synthesis of key aspects of crowdsourced civic input. This automation will dramatically accelerate and improve the standard data management features that Civic Backoffice will also provide.

Features

In its first version the tool supports the following analytics features:

  1. Categorization: This feature organizes the data into main- and subcategories by using a well-known concept extraction algorithm that we adapt to our purposes. To train the algorithm, the user first codes a part of the dataset by labeling main categories and subcategories, and then lets the algorithm categorize the rest of the data;
  2. Concept Occurrence: Expressions and words are extracted from the data and displayed by frequency. Concept extraction provides lists of key terms and phrases, distributed by occurrence, which are then further analyzed using a variety of statistical and qualitative methods;
  3. Sentiment Analysis: The data is analyzed in terms of positive, negative, or neutral sentiment which is assessed in terms of established values of words and expressions. For example, words such as reduce, remove, and problem would show a negative sentiment, whereas increase, resolve, and good would show a positive sentiment. Sentiment analysis is already a common feature of algorithmic filtering on social media platforms;
  4. Entity Similarity: This feature display associations between ideas and comments based on their content similarity, which is also a common feature of algorithmic filtering on social media platforms.

Installation

  1. Install Node.js and update npm (see here for a guide)
  2. Clone the repository git clone https://github.com/ParticipaPY/civic-crowdanalytics
  3. Get inside civic-crowdanalytics/frontoffice
  4. Install project's dependencies by running npm install
  5. Start local server by running npm run dev
  6. Go to the following url http://localhost:8080

Dependencies

  1. Node.js version higher than 0.10.32
  2. Npm version higher than 2.1.8

Technologies

  1. Node.js and npm
  2. Vue.js
  3. CoreUI
  4. Chart.js

About

Analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification, and sentiment analysis to make sense of crowdsourced civic input.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • CSS 82.2%
  • Vue 11.1%
  • JavaScript 6.4%
  • HTML 0.3%