Chinese-Multimodal-Sentiment-Analysis

Introduction

Chinese-Multimodal-Sentiment-Analysis is a comprehensive repository dedicated to advancing the field of sentiment analysis using multimodal data inputs, focusing on Chinese language data. This repository houses code, datasets, and models that integrate and analyze textual, audio, and visual information to understand and predict sentiments in Chinese multimedia content.

Objective

The primary objective of this repository is to provide resources and tools for researchers and practitioners to perform sentiment analysis in Chinese, leveraging the power of multimodal learning techniques. This includes handling complex linguistic phenomena and capturing nuanced emotional expressions.

Installation

To get started with Chinese-Multimodal-Sentiment-Analysis, clone the repository and install the required dependencies:

git clone https://github.com/yourusername/Chinese-Multimodal-Sentiment-Analysis.git
cd Chinese-Multimodal-Sentiment-Analysis
pip install -r requirements.txt

Dataset

CH-SIMS Dataset

The Chinese Multimodal Sentiment Analysis project utilizes the CH-SIMS dataset, a comprehensive collection for Chinese sentiment analysis using multimodal data.

CH-SIMS dataset uses feature files that are organized as follows:

{
    "train": {
        "raw_text": [],              # raw text
        "audio": [],                 # audio feature
        "vision": [],                # video feature
        "id": [],                    # [video_id$_$clip_id, ..., ...]
        "text": [],                  # bert feature
        "text_bert": [],             # word ids for bert
        "audio_lengths": [],         # audio feature lenth(over time) for every sample
        "vision_lengths": [],        # same as audio_lengths
        "annotations": [],           # strings
        "classification_labels": [], # Negative(0), Neutral(1), Positive(2). Deprecated in v_2.0
        "regression_labels": []      # Negative(<0), Neutral(0), Positive(>0)
    },
    "valid": {***},                  # same as "train"
    "test": {***},                   # same as "train"
}

Overview

CH-SIMS is a richly annotated dataset that includes a variety of video clips with corresponding textual transcriptions and audio data. It is specifically designed for the analysis of sentiment in the Chinese language using a multimodal approach, integrating text, audio, and visual information.

Features

Multimodal Annotations: Each segment in the dataset is annotated with sentiments based on multimodal information, providing insights from text, audio, and visual modalities.
Diverse Sources: The dataset includes video segments from movies, TV series, and other multimedia sources, ensuring a diverse range of expressions and contexts.
Fine-Grained Analysis: CH-SIMS supports fine-grained sentiment analysis, allowing for a detailed understanding of emotional states and nuances in Chinese language content.

Download

The dataset can be accessed from CH-SIMS repository or a similar source.

The use of the CH-SIMS dataset in this project allows for an in-depth exploration and analysis of sentiment in the Chinese context, leveraging the strengths of multimodal data to achieve more accurate and nuanced sentiment analysis results.

Model

Adapter Models for Multimodal Feature Fusion

The Chinese-Multimodal-Sentiment-Analysis project leverages advanced adapter models to effectively combine and analyze features from different modalities, including text, audio, and video. These models play a pivotal role in the comprehensive sentiment analysis process, ensuring that all aspects of the data are utilized optimally.

Training the Adapter Models

To train the adapter models with your dataset, follow the command below. This command will initiate the training process, taking into consideration the multimodal nature of the dataset and applying appropriate feature fusion techniques.

To train the adaptor models, use the following command:

python main.py \
  --batch_size 64 \
  --lr 5e-5 \
  --eposhs 100 \
  --early_stop 20 \
  --model_save_to ./fusion_model.pth

Command Line Arguments:

--batch_size: Size of each training batch.
--lr: Learning rate for the training process.
--epochs: Number of epochs for model training.
--early_stop: Early stopping criteria to prevent overfitting.
--model_save_to: Path where the trained model will be saved.

Usage

To illustrate, suppose you have a new set of data for sentiment analysis and the trained fusion model saved as ./fusion_model.pth. You can analyze the sentiment of this data by executing:

python demo.py --fusion_model ./fusion_model.pth

Results

CH-SIMS

Model	From	Acc_3	F1-score_3
EF LSTM	MultimodalDNN	54.27	38.18
LF DNN	MultimodalDNN	70.20	65.29
TFN	Tensor-Fusion-Network	65.95	62.04
LMF	Low-rank-Multimodal-Fusion	66.87	62.46
MFN	Memory-Fusion-Network	54.14	67.57
Graph MFN	Graph-Memory-Fusion-Network	68.44	63.44
MulT	Multimodal-Transformer	68.27	64.23
MISA	MISA	67.05	60.98
MLF DNN	MMSA	70.37	65.94
MTFN	MMSA	70.28	66.44
MLMF	MMSA	71.60	70.45
Ours		72.87	71.03

Contributing

We welcome contributions to the Chinese-Multimodal-Sentiment-Analysis repository. If you have suggestions, bug reports, or want to contribute code or documentation, please submit a pull request or open an issue.

License

This project is released under the Apache License Version 2.0.

Apache License 2.0 is a permissive open source license that allows for great freedom in using the software. It is widely used in many open source projects and allows for both commercial and non-commercial use of the software.

You can find a copy of the license at Apache License Version 2.0.

Please review the license terms to understand what you can and cannot do with the source code and the documentation.

Acknowledgments

This repository builds upon the work and findings of various research papers and datasets, including the CH-SIMS dataset and associated research. We thank all the contributors and researchers in the field for their valuable insights and contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
demo.py		demo.py
fusion_model.pth		fusion_model.pth
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
system.png		system.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chinese-Multimodal-Sentiment-Analysis

Introduction

Objective

Installation

Dataset

CH-SIMS Dataset

Overview

Features

Download

Model

Adapter Models for Multimodal Feature Fusion

Training the Adapter Models

Command Line Arguments:

Usage

Results

Contributing

License

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

Shengwei-Peng/Chinese-Multimodal-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Chinese-Multimodal-Sentiment-Analysis

Introduction

Objective

Installation

Dataset

CH-SIMS Dataset

Overview

Features

Download

Model

Adapter Models for Multimodal Feature Fusion

Training the Adapter Models

Command Line Arguments:

Usage

Results

Contributing

License

Acknowledgments

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages