ImageAware+ Lorcan Kelly Zazera | SETU Carlow FYP 2026

The Project

What is ImageAware+?

Phishing attacks increasingly embed malicious content inside image files to bypass traditional email security filters. A fake Geek Squad invoice, a PayPal billing alert, or a DocuSign impersonation email rendered as a graphic is completely invisible to text-based scanners but convincing to any human who reads it.

ImageAware+ was built to close this gap. It combines multi-pass OCR with OpenCV image preprocessing, QR code detection, HTML href URL extraction, email header analysis, and a 29-indicator rule-based scoring engine to produce an explainable forensic risk assessment for any submitted image or email file.

Every point in the final score is traced back to a named indicator with supporting evidence making the system suitable for forensic documentation, not just binary classification. The system is deployed as a live educational platform covering phishing awareness, attack types, and real-time sample analysis.

The Pipeline

How It Works

01

Upload

Submit a phishing image (PNG, JPG) or email file (.eml) through the web interface.

02

Extract

Multi-pass OCR extracts text, HTML hrefs recover hidden URLs, QR codes are decoded.

03

Enrich

Extracted URLs are checked against VirusTotal, URLScan.io, and PhishTank APIs.

04

Score

29 indicators across 8 attack categories produce an explainable risk score from 0 to 100.

Evaluation

Detection Performance

Formally evaluated on 300 labelled samples 150 phishing emails from the Nazario 2025 corpus and 150 legitimate emails from the TREC 2007 ham corpus. Image pipeline evaluated on 22 labelled samples.