Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Building a Music Genre Classifier That Actually Understands Sound
Published:
After 18 years of pursuing music, I’ve developed an ear for what makes genres distinct. It’s not just the obvious markers—tempo, instrumentation, vocal style. It’s the subtle stuff: the way tension builds in a Carnatic raga versus a jazz standard, the textural differences between layered synthesizers in electronic music versus acoustic instruments in folk, the non-linear patterns that define a sound but resist simple categorization.
Treating Goals Like Sprints: An Agile Approach to 2026
Published:
January is a fascinating time of year. Coffee shops fill with people journaling their dreams, gym parking lots overflow with fresh determination, and social media buzzes with declarations of transformation. There’s something genuinely hopeful about it—this collective belief that a new calendar year means a clean slate, a chance to finally become the person we’ve been meaning to be.
Inter-Subject Correlation Analysis in Music-Induced EEG Responses
Published:
An exploration of neural synchronization patterns in participants with varying levels of agreement on musical preferences, conducted as part of an independent study with the IIIT-H Music Cognition Group with the support and guidance of Prof. Vinoo Alluri.
projects
LyrAssist - AI-Powered Lyric Transcription & Video Generation
Full-stack web application that automatically transcribes audio/video files and generates synchronized lyric videos using OpenAI Whisper, WhisperX, and Demucs AI models
NYC Sidewalk Time Machine
Interactive visualization tool analyzing 20 years of Manhattan pedestrian infrastructure data using React, D3.js, and geospatial processing
Multi-Agent Reinforcement Learning System (Taxi-MARL)
Advanced multi-agent reinforcement learning implementation with parameter sharing and Independent Q-Learning, achieving robust scalability across 2-5 agents![]()
publications
Decoding Human Emotions: Analyzing Multi-channel EEG Data Using LSTM Networks
Published in International Conference on Data Science and Applications (ICDSA 2024), 2024
This study aims to understand and improve the predictive accuracy of emotional state classification through metrics such as valence, arousal, dominance, and likeness by applying a long short-term memory (LSTM) network to analyze EEG signals.
Citation: Sateesh, S. K., Sparsh, B. K., & Uma, D. (2024). "Decoding Human Emotions: Analyzing Multi-channel EEG Data Using LSTM Networks." International Conference on Data Science and Applications. Springer Nature Singapore, 503-515.
Download Paper | View on Springer
Meta-learning in Audio and Speech Processing: An End to End Comprehensive Review
Published in International Conference on Multi-disciplinary Trends in Artificial Intelligence (MIWAI 2024), 2024
This survey overviews various meta-learning approaches used in audio and speech processing scenarios. Meta-learning is used where model performance needs to be maximized with minimum annotated samples, making it suitable for low-sample audio processing.
Citation: Raimon, A., Masti, S., Sateesh, S. K., Vengatagiri, S., & Das, B. (2024). "Meta-learning in Audio and Speech Processing: An End to End Comprehensive Review." International Conference on Multi-disciplinary Trends in Artificial Intelligence. Springer Nature Singapore, 140-154.
Download Paper | View on Springer
