Utkarsh Jain

Utkarsh Jain

Graduate Student (MS CS)

University of California San Deigo

Biography

Last updated on: 16th June, 2023

Hi! My name is Utkarsh Jain, and I’m currently pursuing a graduate degree in Computer Science at the University of California San Diego. My research focuses on the development of biologically inspired Convolutional Neural Networks (CNNs) under the guidance of Prof. Gary Cottrell and Prof. Virginia De Sa. This exciting field lies at the intersection of Cognitive Science and Computer Science, and it has yielded numerous fascinating discoveries. One particularly intriguing finding is that as CNNs improve their performance in image categorization tasks, their internal representations become increasingly similar to those found in the human brain. Currently, I am investigating whether further aligning CNN representations with brain-like patterns can enhance their performance.

Prior to joining UC San Diego, I worked as a researcher at the Open Source Intelligence Lab in the Indian Institute of Technology Guwahati under the guidance of Prof. Sanasam Ranbir Singh. During my time there, I delved into the application of Natural Language Processing (NLP) and Graph Neural Networks to predict the home locations of Twitter users. This research had significant implications for various online services, such as targeted advertising, opinion mining, and event detection. It was my first foray into NLP, and I thoroughly enjoyed the experience of learning new concepts and collaborating with talented individuals.

I completed my Bachelor of Technology in Computer Science and Engineering at the Indian Institute of Technology Mandi. As part of a semester exchange program, I had the privilege of studying at Aalto University in Finland, where I had the opportunity to explore several captivating courses. It was during this period that I discovered my passion for mathematics and made the decision to pursue a career in Machine Learning. Upon returning from Aalto University, I worked with Prof. Aditya Nigam on investigating the applications of Reinforcement Learning (RL) in the task of Image Classification. Our research revealed that while RL performed equally well as CNNs in classifying images, it is significantly less time-efficient. Also, it is tricky to convert a supervised problem into a reward-based problem. How do you design an efficient reward function for image classification? And what does the environment look like?. Furthermore, I had the privilege of collaborating with Prof. Mousa Marzband at Northumbria University, United Kingdom, where we applied time-series modeling to weather now-casting.

Over the past few years, I have experienced profound moments of enlightenment that have shaped my life’s trajectory. I discovered my passion (or did I develop it?) which has ignited a deep sense of purpose within me, and I am now determined to pursue it wholeheartedly. I have come to recognize the inherent unpredictability that permeates our existence and the significant influence of circumstance and luck in shaping our paths. My ongoing journey has been a tremendous source of learning, with one of the most valuable lessons being the art of embracing my humanity.

Interests

  • Biologically-inspired AI
  • Computer Vision
  • Natural Language Processing

Education

  • Master of Science in Computer Science, 2022 - 2024

    University of California San Diego

  • B.Tech in Computer Science and Engineering, 2017 - 2021

    Indian Institute of Technology Mandi

  • Exchange Studies, Master's Programme in Computer, Communication and Information Sciences, 2019 - 2020

    Aalto University, Finland

Experience

 
 
 
 
 

Junior Research Fellow

Indian Institute of Technology Guwahati

Dec 2021 – May 2022 Assam, India

I worked on using Natural Langauge Processing and Graph Neural Networks in predicting the home locations of Twitter users using their tweets, profile metadata, and social network graph. This research has significant implications for various online services, such as targeted advertising, opinion mining, and event detection. Advised by Prof. Sanasam Ranbir Singh.

  1. Built Bi-LSTM and BERT baseline models in PyTorch and Python to predict the home location of Twitter users and achieved an accuracy of 36% and a Mean Absolute Error (MAE) of 703 by using users’ tweets.

  2. Increased accuracy to 58% and reduced MAE to 516 by incorporating tweet metadata and user-mention network and using field-level Attention layers and Transformer-encoder for feature fusion.

  3. Developed a loss function to capture hierarchical relationships among geolocations which improved accuracy by 7%.

 
 
 
 
 

Research Assistant

Indian Institute of Technology Mandi

Apr 2021 – Jul 2021 Himachal Pradesh, India

This was my first formal experience in Machine Learning. The goal of this project was to explore the usage of Reinforcement Learning in the image classification task and compare its efficacy with existing Supervised approaches. Advised by Prof. Aditya Nigam.

  1. Designed a Reinforcement Learning (RL) based Image Classification pipeline in TensorFlow and attained an accuracy of 92% on the MNIST dataset by using Dueling Deep Q-Network. Increased accuracy by 3% with Proximal Policy Optimization.

  2. Reduced training time by 40% by implementing Classification with Costly Features training procedure.

  3. Conducted evaluations on various benchmark datasets and observed competitive performance wrt. supervised methods.

In a nutshell, we concluded that RL can be in fact used for image classification but demands significantly longer training time and is more complex to implement as compared to a simple, yet very effective, CNN model.

 
 
 
 
 

Research Assistant

Northumbria University

Aug 2020 – Oct 2020 Newcastle upon Tyne, England

This was my first research experience and I worked on using statistical and machine learning approaches for solar nowcasting. Advised by Prof. Mousa Marzband.

  1. Explored various auto-regressive and exponential smoothing models to forecast solar power generation.

  2. Worked on Statistical Time-Series Models, and an ensemble of LSTM Encoder-Decoder to predict univariate and multivariate time series.

  3. Used Approximate Bayesian Computation coupled with MCMC to build non-linear univariate and bivariate time series models. Achieved a Mean Average Percentage Error of 17%.

 
 
 
 
 

Software Engineer Intern

The Solar Labs

Jun 2020 – Aug 2020 Delhi, India

I worked as a Software Development Engineer Intern in the Generation Engine team of The Solar Labs.

  1. Improved Shadow Analysis software runtime by 15% by optimizing the data processing pipeline and migrating CPU-intensive jobs over to GPU with CUDA and Python.

  2. Implemented a stochastic disaggregation procedure for generating synthetic sets of hourly solar irradiation values, suitable for use in solar simulation design work. Achieved a mean percentage error of 40%.

  3. Proposed and developed a computationally inexpensive model for synthetic data generation which performed better than previously published works and brought down the average error to 5%. Resulted in a 35% increase in customer satisfaction and annual savings of $100,000 in outsourcing costs.

  4. Integrated and tested the model with the main production software for client use.

 
 
 
 
 

Software Engineer Intern

Indian Space Research Organization

Dec 2018 – Feb 2019 Punjab, India

Worked as a Software Engineer in the the Information Technology and Networking Division (IT&ND).The main aim was to automate the data collection process from several metrology tools in the production lab. This data holds importance in the later stages of fabrication where it is used for process control and tool health monitoring.

  1. Used SECS/GEM protocol for equipment-to-host data communications to automate the data submission procedure into the database to accelerate wafer production.

  2. Created databases, data entry systems, web forms, and other applications for diverse uses by engineers.

Positions of Responsibility

 
 
 
 
 

Teaching Assistant

CS-100: Advanced Data Structures

Feb 2021 – Jun 2021 University of California San Diego
 
 
 
 
 

Teaching Assistant

CS-307: System Practicum

Feb 2021 – Jun 2021 Indian Institute of Technology Mandi
 
 
 
 
 

Web Coordinator

Sports Society, Student Gymkhana

Aug 2020 – Jul 2021 Indian Institute of Technology Mandi
 
 
 
 
 

Teaching Assistant

CS-3150: Software Engineering

Jan 2020 – Jul 2020 Aalto University
 
 
 
 
 

Teaching Assistant

CS-207: Applied Database Practicum

Jan 2019 – Jun 2019 Indian Institute of Technology Mandi
 
 
 
 
 

Web Coordinator

IEEE Student Chapter

Aug 2018 – Jun 2019 Indian Institute of Technology Mandi
 
 
 
 
 

Volunteer

National Service Scheme

Aug 2017 – Jul 2019 Indian Institute of Technology Mandi

Contact