Why Good Engineers Should Not Be Managers

The inspiration for this blog post came from a conversation with a colleague who lamented the sheer number of meetings he now has to attend. As a recent promotee to team lead, his days are filled with managerial tasks, leaving less room for the technical work that once was his primary focus. This scenario is commonplace in large organizations where career advancement often equates to a transition into management. Such a shift means trading hands-on technical work for overseeing others, a change that isn’t always welcomed.

Read More

Multilabel Binary Classification with TensorFlow and PyTorch

figure
The Toxic Comments Kaggle dataset is widely recognized as a valuable resource and serves as a prominent benchmark for multilabel text classification. I invite you to take a look at my insightful blog post where I delve into the exploration of this dataset and apply simple machine learning algorithms with resampling methods for treating imbalanced datasets. Within this work, I utilize cutting-edge deep learning models implemented with the Tensorflow and Pytorch frameworks. Moreover, I conduct a comprehensive comparison of various methodologies to effectively tackle the task of multi-label text classification.

Read More

Table Extraction with Machine Learning Models

figure1
Concepts - Computer Vision, Detection Transformer, Table Detection, Table Extraction, Optical Character Recognition (OCR)
Table extraction from documents using machine learning involves training algorithms to automatically identify and extract tables from a given document. This process can be challenging, as tables can come in various formats and layouts, and may be embedded within larger documents such as research papers, reports, or financial statements.

Read More

Toxic Comments Classification NLP Project

figure
Concepts - NLP, Multilabel Classification, Imbalanced Dataset, Text Embedding
The internet has become an integral part of modern society and has greatly impacted the way we communicate and access information. While it provides many benefits, it also has its downsides, one of which is the prevalence of toxic comments. In this project, I attempt to solve the problem we have with toxicity on the internet by building a machine learning model that detects these toxic comments.

Read More

Modeling Churn and Revenue for Ride-Hailing Service

figure1
Concepts - Monte-Carlo simulation, price modeling, statistical modeling
I built a Monte-Carlo simulation based on the Poisson process to model the price dynamics of a ride-hailing business. Simulations provided projections of earnings and profit of the company for a 1-12 month period. Strategies for improving profitability are written at the end of this report.

Read More

Bitcoin Risk Asset Analysis

figure
Concepts - Bitcoin, Risk Analysis, Portfolio Optimization
In this work I analyze Bitcoin as an asset compared to other more traditional assets such as stocks and gold. Recently there has been large interest in Bitcoin as an investable asset due to it’s high returns in the last decade. I will attempt to give a full picture of the asset using the tools and techniques from traditional finance, then draw some insights and recommendations about whether or not to include it in the portfolio of any investment fund.

Read More