Qualification:
at least BS in Computer Science / Software Engineering
Experience:
Minimum 6 month - 1 year experience of working with textual data (news corpus, blogs, customer feedbacks/reviews), social network streams
Working for multi-lingual data is a plus.
Required Skills:
Must know about the conventional (RegEx, PoS tagging, chunking, dependency parsing, wordnets, etc.) and the recent techniques/algorithms used in predictive analytics for natural languages.
Must know about the information extraction and information retrieval techniques.
Familiar with Sequence processing like machine translation, speech transliteration, speech transcription.
Hands-on skills on data clustering and multi-class classification with Scikit-learn, NLTK, Scrapy/BeautifulSoup, TensorFLow/PyTorch, Pandas, NumPy, SciPy. Knowledge of using SVN is mandatory.
Knowledge of Test-Driven programming, A/B testing is a plus.
Technical writing skills, and fluency in English is a plus.