>Arshia Pessaran
Biology degree from UCF and went on to go to medical school at FIU. Withdrew to pivot into tech. Finished a second degree in data science in the span of a year and working as a software engineer ever since.
A Windows background application that automates hourly capture of the TradingView
S&P 500 heatmap and delivers it to a Discord channel. Uses Playwright to
launch a headless Chromium browser, navigate to TradingView, configure the heatmap
view, and capture a screenshot — all without user interaction. A time-aligned
scheduler fires on the hour, and the captured image is posted via Discord's
incoming webhook REST API.
Packaged as a standalone Windows executable with PyInstaller. Runs as a persistent
system tray process with pause, resume, and manual trigger controls.
Python · Playwright · headless browser automation · Discord API ·
webhook integration · task scheduling · PyInstaller · Windows system tray
A multi-stage pipeline that reconstructs scanned PDFs into clean digital documents.
Preprocessing corrects page skew via projection profile analysis and Otsu thresholding,
denoises with a bilateral filter to preserve edges, and binarizes for stroke analysis.
Pages are classified in two passes — a fast pixel statistics pass filters obvious
outliers, while a full pass combines Surya's deep learning layout model with OCR
confidence metrics to label each page as normal, mixed, or outlier.
A two-phase calibration system processes the first batch of pages to establish a
font size and style baseline before full reconstruction. OCR is batched across pages
for GPU efficiency, with support for three engines: Surya, Tesseract, and PaddleOCR.
Python · PyMuPDF · ReportLab · OpenCV · Surya · PaddleOCR · Tesseract ·
deep learning · OCR · computer vision · image processing · pipeline architecture · customtkinter