Saiteja Talluri
email:
cs.jtti.citaa.b@esenaii unscramble
CV |
Resume |
Github
Linkedin |
Twitter
|
I am a final year undergraduate pursuing a major (with honors) in Computer Science and Engineering and a minor in Electrical Engineering at Indian Institute of Technology, Bombay.
My broad interests lie in the areas of Natural Language Processing and Computer Vision. Though I am a little inclined towards NLP, I am passionate to get hands on in other fields, so I completed
courses related to Speech Recognition, Reinforcement Learning, Blockchain and Web Mining.
I am currently working on automatic question generation from paragraphs using hierarchial models and numerical question answering as a part of my Batchelor Thesis
under the guidance of Prof. Ganesh Ramakrishnan. When I am at leisure, I spend my time hanging out with friends, playing table tennis or cricket, going on long tours and trying out different cuisines. ( Shh!! Don't say to anyone I am a foodie ).
|
News
- I will be joining Optiver (HQ, Amsterdam) as a Software Engineer from Fall this year.
- Paper on question generation using hierarchial models submitted at ACL 2020.
- Completed the GSoC 2019 for OpenCV by developing python bindings to their facial landmark detection API.
|
![hier-QG](files/qg-hier/hier_QG.png) |
Question Generation from Paragraphs : A Tale of Two Hierarchical Models
Vishwajeet Kumar, Raktim Chaki, Sai Teja Talluri , Ganesh Ramakrishnan, Yuan-Fang Li, Gholamreza Haffari
ACL 2020 (under review)
webpage |
pdf |
abstract |
bibtex |
arXiv
Automatic question generation from paragraphs is an important and challenging problem, particularly due to the long context from paragraphs. In this paper, we propose and study two hierarchical models for the task of question generation from paragraphs. Specifically, we propose (a) a novel hierarchical BiLSTM model with selective attention and (b) a novel hierarchical Transformer architecture, both of which learn hierarchical representations of paragraphs.
We model a paragraph in terms of its constituent sentences, and a sentence in terms of its constituent words. While the introduction of the attention mechanism benefits the hierarchical BiLSTM model, the hierarchical Transformer, with its inherent attention and positional encoding mechanisms also performs better than flat transformer model.
We conducted empirical evaluation on the widely used SQuAD and MS MARCO datasets using standard metrics.
The results demonstrate the overall effectiveness of the hierarchical models over their flat counterparts.
Qualitatively, our hierarchical models are able to generate fluent and relevant questions.
@misc{
kumar2019question,
title={Question Generation from Paragraphs:
A Tale of Two Hierarchical Models},
author={Vishwajeet Kumar
and Raktim Chaki and Sai Teja Talluri
and Ganesh Ramakrishnan and Yuan-Fang Li
and Gholamreza Haffari},
year={2019},
eprint={1911.03407},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
|
![speech2face](files/speech2face/model.png) |
Extraction of Facial Features from Speech
Saiteja Talluri, Ankit, Neelesh Verma
Automatic Speech Recognition, Autumn '19, IIT Bombay.
pdf |
presentation |
abstract |
code
In this project, the main motivation was to infer about a person's look from the way they speak. We design and train a deep neural network to
perform this task using thousands of natural YouTube videos of people speaking. During training, our model learns voice-face correlations and
then we used it for voice recognition to evaluate the efficiency of our model. The training is done in a self-supervised manner, by utilizing
the natural co-occurrence of faces and speech in Internet videos, without the need to model attributes explicitly.
|
![captioning](files/context-aware-captioning/captioning.png) |
Context-aware Captions from Context-agnostic Supervision
Saiteja Talluri, Suraj Soni, Jagadeep Sai
Computer Vision, Spring '19, IIT Bombay.
pdf |
abstract |
code
Our objective is to produce pragmatic, context aware descriptions of images (i.e., captions that describe differences between images or visual concepts)
using context agnositic data (i.e., captions that describe a concept or an image in isolation). We attempt the following two problems.
1. Justification : Given an image, a target (ground-truth) class, and a distractor class, describe the target image to explain why it belongs to the target class, and not the distractor class.
2. Discriminative image captioning : Given two similar images, produce a sentence to identify a target image from the distractor image.
|
![inpainting](files/vector-valued-regularization/inpainting.png) |
Vector-Valued Image Regularization with Partial Differential Equations
Saiteja Talluri, Shreyas Pimpalgaonkar, Meet Kathiriya
Digital Image Processing, Spring '18, IIT Bombay.
pdf |
abstract |
code
We focussed on techniques for vector-valued image regularization, based on variational methods and partial differential equations.
We performed denoising, image reconstruction, inpainting, magnification and flow visualization on color images using these techinques and presented our results and conclusions.
|
![dram](files/architectural-attacks/dram.png) |
Software-based Microarchitectural Attacks
Saiteja Talluri, Suraj Soni, Jagadeep Sai, Chaithanya Naik, Aditya Mahto
Computer Architecture, Fall '18, IIT Bombay.
pdf |
abstract |
code
Modern processors are highly optimized systems where every single cycle of computation
time matters. Many optimizations depend on the data that is being processed. Softwarebased microarchitectural attacks exploit effects of these optimizations. Microarchitectural
fault attacks exploit the physical imperfections of modern computer systems. We show
that imperfections of the hardware, introduced by optimizations on a microarchitectural
level, undermine system security and software security. The hardware leaks part of its
internal state including potentially secret information through differences in behavior and
timing. We discuss FLUSH+RELOAD attack, Cache Template attack and Cross-CPU attacks exploiting DRAM addressing.
|
![landmarks](files/panasonic/face_landmarks.jpg) |
End to End pipeline for Digital Signage Analytics
Machine Learning Intern, Panasonic Innovation Center
Bangalore, India, Summer 2019
abstract |
code (private)
The main goal of the research is to analyse the digital signage video to get insights regarding the gender, age and emotion of the audience and integrate facial recognition to identify
face id and performed facial clustering to add new face ids and use this information to estimate the reach of the advertisements. We were also able to implement gaze tracking to estimate the
average attention time of audience on the digital signage board. Most of this research has been inspired from papers on faial recognition, detection and gaze tracking.
|
![segementtation](files/rupeek/segmentation.png) |
Stone Deduction from Jewellery using Image Segmentation
Machine Learning Intern, Rupeek FinTech
Bangalore, India, Winter 2017
abstract |
report |
code (private)
In this project we developed a prototype to automate the calculation of stone deduction of jewellery using vision and image processing. We used background subtraction algorithms like Rolling Ball Algorithm
for efficient detection of background and employed clustering algirithms like Kmean, DBScan and HDBScan for noise removal in the result. Later we used AWS S3 for raw image storage and deployed the prototype as
a web API using Node.JS framework on AWS EC2 instance.
|
![landmarks](files/gsoc/face_landmarks.gif) |
Facial Landmark Detection
Guide : Satya Mallick, Interim CEO, OpenCV
Google Summer of Code, OpenCV, 2019
project |
report |
abstract |
code |
video |
certificate
Facial feature detection and tracking is a high value area of computer vision since humans are interested in what humans are paying attention to, feeling, and enhancing face pictures in selfies etc.
So the main aim of this project is to make it a standard "built in" ability in OpenCV that computer vision users can just call/rely on. The major outcomes of this project is to create python friendly code
and make available a smaller model with lesser points ( e.g. 5 instead of 68 ) for mobile applications with support for face stabilization.
|
Selected Awards
- Teaching Assistant of the Semester Award (Fall '19) at IIT Bombay
- Teaching Assistant of the Month Award (Oct'19) at IIT Bombay
- Gold medal at OCSC for International Physics Olympiad (IPhO) held in Mumbai (2016)
- Infosys award for exceptional performance in International Olympiads (2014)
- Gold medal at 11th International Junior Science Olympiad (IJSO) held in Argentina (2014)
- Kishore Vaigyanik Protsahan Yojana (KVPY) Fellowship from Govt. of India (2014)
- National Talent Search Examination (NTSE) Scholarship from Govt. of India (2012)
|
Key Positions
- Head of Computer Science Department Academic Mentorship Program (2019-20)
- Batch Representative for the CSE Class of 2020 for three consecutive years (2017,18,19)
- Member of the Institute Data Analytics and Visualization Team, IIT Bombay (2018-19)
- Summer of Science Mentor of Machine Learning and Computer Vision under MnP club, IIT Bombay (2019-20)
|
|