Conversational-Bot

Conversational Bots

• Literature Survey:

Neural Responding Machine for Short-Text Conversation , Link: http://arxiv.org/pdf/1503.02364v2.pdf
Predicting the next sequence given the previous sequence or sequences using recurrent networks, Link: https://arxiv.org/pdf/1410.8206v4.pdf
Neural Conversational Model, Link: http://arxiv.org/pdf/1506.05869v3.pdf
Named Entity Recognition in Tweets, Link: http://turing.cs.washington.edu/papers/ritter-emnlp2011-twitter_ner.pdf
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses, Link: http://arxiv.org/pdf/1506.06714v1.pdf
Attention with Intention for a Neural Network Conversation Model, Link: http://arxiv.org/pdf/1510.08565v3.pdf
How to Generate a Good Word Embedding, Link: http://arxiv.org/pdf/1507.05523v1.pdf

• High Level Architecture:

Data Collection:

This could be done using the data available online using fashion domain websites or through some open source data, if available.
We can create our own data which will be like a conversation between two people related to fashion domain.

Word Embeddings:

Used Gensim to get the word embeddings.

Clustering:

Used K-means clustering to cluster semantically similar words.

• Sample Conversation: U: Hi C: Hello U: I’m looking for green shirts with floral pattern. C: This is what I’ve found for you … U: I’m looking for a small sized shirt. C: This is what I have found for you…

• Intent Identification in the query made by the user:

Currently we are focussing on “Search” as our intent, which we’ll be extending later.
To identify our intent, we’re making use of POS in the query and lemmatization while pre-processing the query. POS is used for performing the lemmatization.
After this, we get a Part of Speech tag for each word in the query. This is used in the later stages.

• Approach for identifying entity (in the query) cluster:

Type of clothing (TOC): Our system trains on the type of data which should cluster all the clothing types together. Once the user query comes in, we can search for all the nouns from the query (since TOC should be a noun). The noun which is present in the TOC cluster is our clothing type. Our API can only identify “dresses” as TOC at the moment. So, we can categorise them into “dress” and “not-a-dress”.
Color: After embeddings we should get a color cluster. Identify all the nouns in the query. Use those words to search in the cluster.
Length: Three available, maxi, midi, mini. Need to identify these using a predefined dictionary.
Size: Four available, S, M, L and XL. Need to identify these using a predefined dictionary.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Conversation-Bot		Conversation-Bot
README.md		README.md
chat.py		chat.py
clustering.py		clustering.py
intent_identification.py		intent_identification.py
remove_duplicates_from_details.py		remove_duplicates_from_details.py
run_bot.py		run_bot.py
word_embeddings.py		word_embeddings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Conversational-Bot

About

Uh oh!

Releases

Packages

Languages

vunb/Conversation-Bot

Folders and files

Latest commit

History

Repository files navigation

Conversational-Bot

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages