Skip to content

Nour-rabih/gulf-Diacritizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project was done during SibaqLahja. It was awarded as the best diacritizer. we present a new public diacritized dataset for Gulf Arabic in accordance to the pronounciation of the city of Dubai in the United Arab Emirates (UAE). The dataset is a 19,850 words subset of the Gumar corpus (Khalifa et al., 2018), which is composed of roughly 200 thousand words from Emirati internet novels.

Gulf Dialect Diacritizer

A machine learning model that adds diacritics to Emirati text.

Installation

Use the package manager pip to install foobar.

pip install -r requirements.txt

Usage

Add any nondiacritized emirati text in quotatioins after main.py

python main.py 'السلام عليكم'

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages