Benson Duong

Data Science major at UC San Diego

Interests in Data Science, Python Programming, Machine Learning.

View My GitHub Profile

Typography Image Data Science Project

This image data science project is a personal practice of unstructured image data. Much like the traffic project, I found the data first and that decided the project’s direction. The dataset is images of all english alphabet letters and symbols in varying fonts.

All of this is part of a Kaggle article tutorial that I authored.

(Click to view project)

Github Repo

(Click to view Github)

Unsupervised Methods and Clustering

The font types include designations like “serif”, or “sans-serif”, and are actually labelled groupings of various fonts with similar visual traits.

Convolutional Neural Network

The final endproduct of the project is an image classification model that can take in photo-scanned text documents, and automatically categorize the type of font most prevalent in the document. It will involve a convolutional neural network of variable input size, meaning it can take in images of varying dimensions.