Tabular Data Annotation for Financial Forecasting Models
This was an academic capstone project focused on forecasting stock price movements and business performance using macroeconomic and financial indicators. I was responsible for labeling data points based on market movement categories (e.g., rise, fall, stable) and annotating events based on earnings releases, economic reports, and company news. Labels were used to train a logistic regression and ensemble learning model. The project involved verifying data accuracy, reducing noise in the labels, and ensuring consistent formatting for ML training. I also helped create logic for mapping features to specific model outputs, simulating function calling for LLM-based forecasting assistance.