Biography

I am a Technical (Research) Associate at the Robert Bosch Centre for Cyber-Physical Systems (RBCCPS), Indian Institute of Science (IISc). My research interests include Scientific Machine Learning, Natural Language Processing, Reinforcement Learning, Robotics, Human-Robot Interaction and Multi-modal Machine Learning.

I am currently working on a project to enable richer and natural interactions with social robots. My contributions to the project include two-way remote speech communication with phoneme segmentation based lip synchronization and Wav2Vec2 architecture based speech to text conversion followed by NLP based semantic action identification and execution on social tele-robot “Asha”. Part of the work is open sourced as sonorus pypi package and imperio github repository. As a future step, in addition to tele-operation, I am also working towards enabling language (NLP) based instruction execution through reinforcement learning (RL).

Our team “Aham” (Asha) was one of the finalists in the International Conference on Social Robotics (ICSR), 2021 (Competition track) and was also among the thirty eight semi-finalists in the ANA Avatar XPRIZE competition.

I have also worked as a first co-author on the paper titled “MIMICause: Representation and automatic extraction of causal relation types from clinical notes” which was accepted in the Findings of the Association for Computational Linguistics (ACL 2022). The research pertains to defining causal annotation schema with a focus on causality types and directionality of interaction as communicated in the text between the cause and effect bio-medical concepts/entities. We created an annotated “MIMICause” corpus based on the MIMIC-III clinical notes and the National NLP Clinical Challenges (n2c2) shared task dataset; and trained baselines using BERT and Clinical-BERT based language models.

In the past at Hewlett-Packard (HP) Inc., under the guidance of Dr. Niranjan Damera-Venkata, I have worked on customer review analysis using self-attention based LSTM network and predicting printer part failures with an LSTM based architecture using multi-instance learning based hybrid loss. I received a Master’s degree from the Department of Computational and Data Sciences (CDS), Indian Institute of Science (IISc), Bangalore where I worked on information-theoretic graph characterization and generation of non-isomorphic potential drug molecular graph structures as well as shanon entropy based chemical properties prediction of hydrocarbons under the supervision of Prof. Debnath Pal and Dr. Chandan Raychaudhury. My B.Tech. thesis work under the supervision of Prof. Subrata Kumar Ghosh at IIT (ISM), Dhanbad was developing an improved mathematical model of thermal conductivity for nanofluids based on a linear conductivity gradient across the interfacial nano-layer between the nano-particles and the base fluid, and solving the resulting steady state heat conduction differential equation.

Interests
  • Scientific Machine Learning
  • Natural Language Processing
  • Reinforcement Learning
  • Robotics
  • Human-Robot Interaction
  • Multi-modal Machine Learning
Education
  • M.Tech, Computational & Data Sciences (CDS), 2015-2017

    Indian Institute of Science (IISc)

  • B.Tech, Mechanical Engineering, 2008-2012

    Indian Institute of Technology (Indian School of Mines), IIT-ISM

Recent News

All news»

[2022-07-19] Our paper titled “Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach” is published in the Journal of Medical Internet Research (JMIR) Medical Informatics 2022.

[2022-02-24] Our paper titled “MIMICause: Representation and automatic extraction of causal relation types from clinical notes” got accepted in the Findings of the Association for Computational Linguistics, ACL 2022

[2021-11-17] Our entry – “ASHA: A Tele-operated robot nurse” secured a runner-up position in the “Innovation in Responses to COVID-19” category of the International Conference on Social Robotics (ICSR) 2021 Robot Design Competition.

Projects

Language identification using machine learning

Language identification using machine learning

A machine learning model for identifying 20 diverse languages as well as reporting ‘Other’ for languages on which model was not trained. The approach uses character (within word boundaries) level TF-IDF featurizer followed by a Multinomial Naive Bayes classifier.

Retrofitting Word Vectors to Semantic Lexicons

Retrofitting Word Vectors to Semantic Lexicons

A vectorized iterative implementation of the paper “Retrofitting Word Vectors to Semantic Lexicons” in which the vector space representations are further refined using relational information from the semantic lexicons.

Parallel Hybrid (Cuda and MPI) implementation of Support Vector Machines (SVM)

Parallel Hybrid (Cuda and MPI) implementation of Support Vector Machines (SVM)

Parallel implementation of SVM with cascading using MPI and Sequential Mimimal Optimization (SMO) using Cuda.

Real-time probablistic count (Durand-Flajolet) implementation using Apache Storm

Real-time probablistic count (Durand-Flajolet) implementation using Apache Storm

Real-time analytics use case implementation for probablistic distinct count using distributed stream processing (Apache Storm).

Command line scientific calculator

C++ prototype implementation of a command line scientific calculator for real and complex numbers.

Contact

  • imbugene@gmail.com
  • Indian Institute of Science (IISc), CV Raman Road, Bangalore, KA 560012
  • Robert Bosch Centre for Cyber Physical Systems (RBCCPS), 3rd Floor, SID Building