Activity 14: Logistic Regression
For this activity [1], I used the banana dataset from [2] which has 273 images of bananas separated by underripe, midripe, yellowish-green, and overripe labels.
Feature extraction: RGB
For the training set I decided to take only images of underripe (green) and ripe (yellow) bananas and assign them class numbers 0 and 1, respectively. The feature vector extracted from the images consist only of the mean of each color channel (whose values are normalized to 1), without any further preprocessing or segmentation. I simply reused my code from the previous activity, with minor modifications. The feature space in RGB is shown in Fig. 1.

Figure 1: Feature space of ripe (1) and unripe (0) bananas in RGB.
After training all the ripe/unripe images, I then fed in the midripe images and plotted their activations. We can see from Fig. 2 that the predictions appear biased toward the upper half, which subjectively does not agree with the corresponding images. Preferably, the predictions are nicely distributed about the center. To this end, I try another approach.

Figure 2: Final activations of the midripe banana images in RGB.
Feature extraction:
In order to be able to plot the decision contours as well as reduce the computational complexity, I decided to convert
the images to




Figure 2:
References
- M. N. Soriano, A14 - Logistic regression (2019).
- F. Mazen, and A. Nashat, Ripeness classification of bananas using an artificial neural network. Arabian Journal for Science and Engineering (2019).