lung-cancer-image-classification

Lung cancer image classification in Python using LIDC dataset.

View the Project on GitHub yeexunwei/lung-cancer-image-classification

lung-cancer-image-classification

Lung cancer image classification in Python using LIDC dataset. Images are processed using local feature descriptors and transformation methods before input into classifiers.

Project Objective

Methods Used

Technologies

Project Description

Data source from cancerimagingarchive.net consists of 1018 labelled CT scans cases.

CT scan slices
Dataset CT scan slices.

Data from dicom format is read into array.

Process flow diagram
Flow of data to classifiers.

K-means algorithm is used to group features extracted from images. Images transformed are directly fed into classiifers. A comparison is made for the each local feature descriptors and image transformation methods in the diagram.

Image after wavelet transformation
One example of image transformations, wavelet tranform.
Accuracy score
Best accuracy obtained after 3rd wavelet transformation and LBP clustering
Flask app
Screenshot of flask app running.

Process Flow

Future Improvements

This is my first time experimenting on a large dataset. Make use of data pipeline for clean and reusable codes. Try on hadoop to handle insufficient memory.