Kacper Krakowiak

Software Developer | Data Engineer | Machine Learning Enthusiast

Machine Learning Projects

Throughout my final year, I've experimented with a range of machine learning projects across different domains. These projects showcase my skills in data preprocessing, model development, evaluation, and interpretation.

Semester 1 Semester 2: Classical ML Semester 2: Neural Networks

Semester 1: Data Science Fundamentals

Linear and Polynomial Regression - Gold Price Analysis

Python Pandas Matplotlib Time Series Analysis

This project focused on analyzing historical gold price data (from 2004 to 2024) to identify patterns and trends. Using various statistical techniques, I preprocessed the data, performed linear and polynomial regression data analysis, and created a graph for both.

Key Findings: taught the model on a fluctuating graph, and created a line of best fit (linear and polynomial).

The analysis utilized the XAU_1Month_data dataset and implemented Linear/polynomial Regression models to map the prices on a simple 2D x,y plain.

Decision Tree for Weather Prediction

Decision Trees Sklearn Weather Forecasting Feature Selection

Developed a decision tree model to predict weather patterns using the 'weather_forecast_data.csv' dataset. This project demonstrates my ability to implement classification algorithms for real-world prediction problems.

Key Achievements: Created an interpretable model that can predict weather conditions with significant accuracy based on meteorological features.

The implementation included pre-processing of weather data, feature selection to identify the most predictive variables, and visualization of the decision tree for interpretability. I also evaluated the model's performance using cross-validation and confusion matrices.

Semester 2: Classical Machine Learning Algorithms

Semester 2: Neural Network Projects

Recurrent Neural Networks for Sequence Analysis

RNN LSTM Sequence Prediction NLP

Implemented Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) networks for sequence analysis tasks. This project demonstrates my ability to work with sequential data and time-dependent patterns.

The models were trained on Shakespearean text data for language processing tasks such as sentiment analysis and text generation. I experimented with different network architectures, temperatures, embedding techniques, and sequence lengths to optimize performance.

Key Achievement: Successfully implemented an LSTM model that could generate coherent Shakespearean text passages after training on a large dataset of his writing.