Named Entity Recognition for Hindi-English Code-Mixed Social Media Text
Named Entity Recognition for Hindi-English Code-Mixed Social Media Text
Named Entity Recognition (NER) is a major task in the field of Natural Language Processing (NLP), and also is a subtask of Information Extraction. The challenge of NER for tweets lies in the insufficient information available in a tweet. There has been a significant amount of work done related to entity extraction, but only for resource-rich languages and domains such as the newswire. Entity extraction is, in general, a challenging task for such an informal text, and code-mixed text further complicates the process with it’s unstructured and incomplete information. We propose experiments with different machine learning classification algorithms with word, character and lexical features. The algorithms we experimented with are Decision tree, Long Short-Term Memory (LSTM), and Conditional Random Field (CRF). In this paper, we present a corpus for NER in Hindi-English Code- Mixed along with extensive experiments on our machine learning models which achieved the best f1-score of 0.95 with both CRF and LSTM.
Python Demo
Hand Gesture Recognition using Opencv Python
The emerging of need of domestic robots in real world applications has raised enormous need for instinctive and interaction among human and computer interaction (HCI). In a few conditions where humans can’t contact hardware, the hand motion recognition framework more suitable. Hand gesture recognition is used on controlling robots, portable controllers, or application in smart home. Gesture recognition is mainly applicable for video conferencing, sign language recognition, distance learning and in some forensic identification. Based on fingers’ angles relative to the wrist, a finger angle prediction algorithm and a template matching metric are proposed. All possible gesture types of the captured image are first predicted, and then evaluated and compared to the template image to achieve the classification. Unlike other template matching methods relying highly on large training set, this scheme possesses high flexibility since it requires only one image as the template, and can classify gestures formed by different combinations of fingers.
Python Demo
Location Prediction in Twitter using Machine learning Techniques
Location, e.g., Countries, states, cities, and point-of-interests, are central to news, emergency events, and people’s daily lives. Automatic identification of locations associated with or mentioned in documents has been explored for decades. As one of the most popular online social network platforms, Twitter has attracted a large number of users who send millions of tweets on daily basis. Due to the world-wide coverage of its users and real-time freshness of tweets, location prediction on Twitter has gained significant attention in recent years. Research efforts are spent on dealing with new challenges and opportunities brought by the noisy, short, and context-rich nature of tweets. In proposed system, an overall picture of location prediction on Twitter is offered. Specifically, the prediction of user home locations, tweet locations, and mentioned locations is considered. By summarizing Twitter network, tweet content, and tweet context as potential inputs, it is structurally highlighted how the problems depend on these inputs
Location Prediction in Twitter using Machine learning Techniques
—————————————————————————
Implementation Details:
——————————————–
1. Data Collection:
——————————–
1. Collect Live tweet based on location (we used three locations such as Chennai, Mumbai, Kerala)
2. Live tweets are collected as json file
2. Data Pre-processing:
—————————————
Tweet text is pre-processed with following steps
1. Remove extra characters and special characters
2. Convert all words to Capitalize first letter (needed to apply geography package)
3. If tweet location & user location is null, then remove the tweet
4. If tweet location is null, then apply home location as tweet location
5. If no location is mentioned in tweet then remove the tweet
6. Apply Geodata on tweet_text to identify location
7. Apply Lableencoder to convert text (location from tweet_text) to integer value”Lvalue” (e.g Chennai=1, Mumbai=2, Kerala=3)
6. Write the pre-processed dataset as csv file with following features
a. Tweet ID
b. Name
c. Screen name
d. Tweet Text
e. Home location
f. Tweet location
g. Mentioned location
h. Lvalue (location value)
3. Split Training Set & test Set
4. Apply Naive Bayes, SVM and Decision Tree algorithm
5. Calculate and compare accuracy and error values for all above algorithm
Diagnosis of liver diseases using machine learning
Diagnosis of liver diseases using machine learning
Liver Diseases account for over 2.4% of Indian deaths per annum. Liver disease is also difficult to diagnose in the early stages owing to subtle symptoms. Often the symptoms become apparent when it is too late. This paper aims to improve diagnosis of liver diseases by exploring 2 methods of identification patient parameters and genome expression. The paper also discusses the computational algorithms that can be used in the aforementioned methodology and lists demerits. It proposes methods to improve the efficiency of these algorithms.
Python Demo
Handwritten Character Recognition using python opencv
Handwritten character recognition is much evolving technique. In this work, dataset is trained with SVM model. Capital Letters and Small letters of Alphabets are recognized suing opencv.
Python Demo