Madhav Mahesh Kashyap

Graduate Student | Software Developer | Researcher

Aspiring Machine Learning Engineer
Passionate about integrating cutting-edge research into real-world engineering applications
NLP + CS Graduate student at the University of Washington, Seattle
Ex Software Engineer 2 at Akamai Technologies
Bachelor's in Computer Science from PES University, Bangalore
Published research on self-supervised speech denoising and evolutionary algorithms
In my free time you can find me hiking or practicing Carnatic music.

Industry Experience

Akamai Technologies

Bengaluru, India
Software Engineer 2
Dec 2022 – Aug 2023

.

Software Engineer 1
Jul 2021 – Nov 2022
Working in the core backend team of the zero-trust cybersecurity product "Enterprise Application Access". My responsibilities include end-to-end delivery of features enhancing network security and increasing scalability.
Intern
Jan 2021 – Jun 2021
Jun 2020 – Jul 2020

Implemented Django middleware to reduce master database load by rerouting SQL queries to alternate eventually consistent read-only replica databases. Resulted in a 40% reduction in the primary database’s CPU load.

Designed agent inline security upgrade pipeline to keep network-isolated systems up to date with the latest Debian package patches. My pipeline assisted with the Ubuntu 18.04 to 22.04 OS upgrade on 5,000+ running production agents.

Led development of configurable DNS feature enabling customers to set which public and internal DNS upstream servers their agents utilize. The dnsmasq based design worked on heterogeneous agent platforms including Virtualbox, Azure etc.

Conducted 8 technical interviews for Intern and SDE-1 positions.

Tantragyaan

Bengaluru, India
Intern
Aug 2018 – Jul 2019

Predicted systolic and diastolic blood pressures non-invasively for use in the ”Swasthya” cuffless health monitoring watch. Achieved 4.6% standard error using an 8th-degree polynomial regression model.

Developed an Angular2+SpringBoot full-stack web app to plot real-time patient biometric charts. Data was periodically polled from a FireStore database to update the frontend chart.

Publications

Speech Denoising without Clean Training Data: a Noise2Noise Approach

Interspeech 2021
Sep 2021
Brno, Czech Republic

This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio-denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples. Conventional wisdom dictates that in order to achieve good speech denoising performance, there is a requirement for a large quantity of both noisy speech samples and perfectly clean speech samples, resulting in a need for expensive audio recording equipment and extremely controlled soundproof recording studios. These requirements pose significant challenges in data collection, especially in economically disadvantaged regions and for low resource languages.

This work shows that speech denoising deep neural networks can be successfully trained utilizing only noisy training audio. Furthermore it is revealed that such training regimes achieve superior denoising performance over conventional training regimes utilizing clean training audio targets, in cases involving complex noise distributions and low Signal-to-Noise ratios (high noise environments). This is demonstrated through experiments studying the efficacy of our proposed approach over both real-world noises and synthetic noises using the 20 layered Deep Complex U-Net architecture.

PyTorch implementation open-sourced at GitHub with 100+ stars and 13 citations.

Exam Seating Allocation to Prevent Malpractice Using Genetic Multi-optimization Algorithm

SoMMA 2020
Springer CCIS
Oct 2020
Chennai, India

Despite unceasing debate about it’s pros and cons, exams and standardized testing have emerged as the main mode of evaluation and comparison in our increasingly competitive world. Inevitably, some examinees attempt to illegally gain an unfair advantage over other candidates by indulging in cheating and malpractice. Even a single case of examination malpractice can destroy an Examination body’s credibility and even lead to costly and time-consuming legal proceedings.

Our paper attempts to strategically allot examinees in specific seats and rooms, such as to mitigate the overall probability of malpractice. It involves examining multiple crucial factors such as subject similarity, distancing between examinees, and human field of vision to find the most optimal seating arrangement. We have exploited the property of Evolutionary Genetic Algorithms to find globally optimal or close to optimal solutions in an efficient time for this otherwise NP-complete permutation problem.

Education

M.Sc in Computational Linguistics
University of Washington - Seattle, USA
Sep 2023 – Present
B.Tech in Computer Science
PES University - Bengaluru, India
Aug 2017 – Mar 2021
9.06 CGPA
Cambridge AS and A Levels
Sri Vani International School, Bengaluru, India
91.7 % (Physics-Chemistry-Mathematics)
Cambridge IGCSE
Sri Vani International School, Bengaluru, India
92.0 %

Technical Skills

Primary Languages | Python, SQL Secondary Languages | Golang, Java PyTorch, Django, Docker, RabbitMQ, Celery, ProtoBuf messaging, Dnsmasq, Bash scripting AWS EC2 & RDS, Google Cloud IOT Core & Firestore DB

Awards | Projects | Memberships

International Speech Communication Association - Student Advisory Committee Volunteer
Jan 2022 - Present
Organized 50+ one-to-one and round-table mentoring sessions at the Interspeech 2022 & 2023 conferences by inviting mentors from academia and industry and matching them with suitable student mentees. Personally responsible for creating backend Django infrastructure to parse and store mentor-mentee info in AWS RDS database.
Link to ISCA-SAC Member page
Link to GitHub source code
2nd Place at Intel Student Contest
₹ 45,000 Cash Award
Jul 2019
Engineered an open-domain question answering VoiceBot capable of comprehending spoken English questions, searching in offline and online corpora, and speaking out the most appropriate answer phrase. End-to-end integration of DeepSpeech speech2text, Spacy named entity recognition, BERT-based QA model, and Tacatron text2speech models built on top of a Flask web server architecture.
Link to GitHub source code
Interspeech 2021 Travel Grant
€ 420 Cash Amount and 1-year ISCA Membership
Jun 2021
Awarded to select student authors to support their travel and participation at the Interspeech 2021 conference.
CNR Rao & MRD Merit Scholarship
30% Tuition Fee Waiver for Top 20% of Students
3rd Semester 2018, 6th Semester 2020, 7th Semester 2020
Carnatic Classical Music Junior Certification in Vocal Category
2016

Contact Information

You can get in touch with me using:

Download Resume : Download My Resume