CasaLLM: An LLM built from scratch.


Custom CLIP
This was a 34 million parameter model coded from scratch using PyTorch and trained on 3 million image-caption pairs. An RNN was used as the text encoder to explore its performance in the CLIP architecture.
See the full blogpost.

Master’s Thesis: Contactless Current and Voltage Detector
The goal was to replace expensive Hall Effect sensors with inexpensive magnetic field point measurements, using algorithmic methods to remove interference from external magnetic field sources.
Methods applied included Autoencoder Neural Networks, Linear Regression, Polynomial Regression, non-linear solvers, and Generalized Least Squares.
See the full thesis.
US Patent No. 12085591

DropAlan: A Novel Dropout Variant
Instead of pairing traditional activation functions with standard dropout, DropAlan directly assigns each neuron a dropout probability based on its input values. In expectation, this mimics the behavior of ReLU activations while avoiding issues like dead ReLUs and making the method easier to analyze theoretically.
Tested on MNIST and CIFAR-10, it achieved similar accuracy to traditional dropout while converging as much as 20% faster.
See the full report.

OCR System
In this project, we trained a VGG-16 variant CNN to classify letters in an image. Candidate image sections were isolated for classification using edge-detection techniques with the help of the OpenCV library.

Pole-Climbing Robot
The robot was built out of aluminum plates and I used several machine shop tools, including a mill and lathe, to construct the robot. I was also required to perform physical analysis of my robot design and justify my design decisions.

MASLAB Robot
The robot was built using MDF and laser cutting, and was designed in Solidworks. We used the OpenCV library to process a webcam feed to enable the robot to autonomously locate and pick up the blocks on the field.