2025 - DSP LOG

Machine Learning

Word Embeddings using neural networks

Krishna Sankar2 months ago2 months ago041 mins

The post covers various neural network based word embedding models. Starting from the Neural Probabilistic Language Model from Bengio et al 2003, then reduction of complexity using Hierarchical softmax and Noise Contrastive Estimation. Further works like CBoW, GlOVe, Skip Gram and Negative Sampling which helped to train on much higher data.

Machine Learning

Gradients for multi class classification with Softmax

Krishna Sankar8 months ago8 months ago122 mins

In a multi class classification problem, the output (also called the label or class) takes a finite set of discrete values . In this post, system model for a multi class classification with a linear layer followed by softmax layer is defined. The softmax function transforms the output of a linear layer into values lying…

Machine Learning

Gradients for Binary Classification with Sigmoid

Krishna Sankar9 months ago1 month ago113 mins

In a classification problem, the output (also called the label or class) takes a small number of discrete values rather than continuous values. For a simple binary classification problem, where output takes only two discrete values : 0 or 1, the sigmoid function can be used to transform the output of a linear regression model…

Machine Learning

Gradients for linear regression

Krishna Sankar10 months ago6 months ago112 mins

Understanding gradients is essential in machine learning, as they indicate the direction and rate of change in the loss function with respect to model parameters. This post covers the gradients for the vanilla Linear Regression case taking two loss functions Mean Square Error (MSE) and Mean Absolute Error (MAE) as examples. The gradients computed analytically…