Transfer Learning with TensorFlow 2

Bjorn Eriksson on November 11, 2019 at 6:47 pm

Thank you for this very well written and informative article! I’m looking to do exactly this during the coming week, and this is a great head start. I just have three questions:

1) What does super(Wrapper, self).__init__() do?

2) In general, what do you think is the best resource for tf documentation? The tensorflow documentation itself is really sparse, just listing available methods with no explanation.

3) How can the validation error be lower than the training error for Googlenet and Resnet?

Thanks again!

Reply

rubikscode on November 12, 2019 at 9:18 am

Hi Bjorn,

Thank you for reading the article, I am glad that you liked it and find it useful.

Here are the answers:
1. Since Wrapper class inherits Model from tf.keras we need to call its constructor as well. This is done using this line.

2. TF documentation is still best start + the additional googling.

3. Usually training error will be lower than the validation, you are right about that. However, it is not rare occurance that validation error is lower. This might be to the fact that validation set had “easier” data in it.

Reply

lxy on November 18, 2019 at 10:13 am

I’m right now testing your ” TransferLearning with Tensorflow” using Colab.
I’m stopped with following error, just before displaying images:

TypeError Traceback (most recent call last)
in ()
—-> 1 data_loader = DataLoader(IMG_SIZE, BATCH_SIZE, SHUFFLE_SIZE)
2
3 plt.figure(figsize=(10, 8))
4 i = 0
5 for img, label in data_loader.get_random_raw_images(20):

TypeError: __init__() takes 3 positional arguments but 4 were given

Your advice ?

Reply

rubikscode on November 18, 2019 at 10:17 am

Hi Ixy,

Thank you for reading our blog.
I’ve must have coppied the wrong code.
Here is how that line should look like:

data_loader = DataLoader(IMG_SIZE, BATCH_SIZE)

I’ve already fixed it in the gist.
Sorry for the inconvinience.

Cheers,
RC Team

Reply

lxy on November 18, 2019 at 10:31 am

Referring to my previous msg.
Colab is still using Tensorflow 1, I suspect that error comes from differences with Tensoorflow 2

Reply

lxy on November 18, 2019 at 10:48 am

Hi,
Sorry, but in image random plotting instructions I have following error :
NameError: name ‘random_train_raw_data’ is not defined

In the Training part I’m blocked with
NameError: name ‘data_loader’ is not defined

Anyway thank you for your quick answer.
I appreciate your work.
X. Leroy

Reply

rubikscode on November 18, 2019 at 10:58 am

Hi Leroy,

Can you copy over the code from here https://gist.github.com/NMZivkovic/e5952df07680fbe7f4bfd4a7793e9e81 for Data Loader again?
That should fix both problems.

Thanks!

Reply

lxy on November 18, 2019 at 12:10 pm

Hi!
We have made progress. Now I am blocked with the following message :

I am now blocked in the instruction starting with “historty by :
“ValueError: When using data tensors as input to a model, you should specify the `steps_per_epoch` argument.”

Curiously instructions starts but is blocked afer a while.
lxy

Reply

lxy on November 18, 2019 at 12:43 pm

same error in instruction starting with history
“ValueError: When using data tensors as input to a model, you should specify the `steps_per_epoch` argument.”

Next instruction starting with loss :
“NameError: name ‘validation_steps’ is not defined”

Reply

rubikscode on November 18, 2019 at 12:51 pm

Hi Leroy,

Evidently I forgot to copy over one part of the code at initial_evaluation.py
I have updated it at https://gist.github.com/NMZivkovic/ad8de59a5b27607549ed0fefb4ea6f59.
Could you copy over it and try again?

Reply

lxy on November 18, 2019 at 6:20 pm

Instruction starting with “history runs during a moment then crashes with :
“ValueError: When using data tensors as input to a model, you should specify the `steps_per_epoch` argument.”

Surprisingly the after instruction starting with “loss1” works and gives following results:
“——–VGG16———
Loss: 2.45
Accuracy: 0.48
—————————
——–GoogLeNet———
Loss: 6.25
Accuracy: 0.48
—————————
——–ResNet———
Loss: 0.74
Accuracy: 0.95
—————————”

Instruction starting with data_loader runs but after a while gives following error :

RuntimeError: __iter__() is only supported inside of tf.function or when eager execution is enabled.

Debugging Deep Learning application is a mess, to my opinion !

Reply

	import numpy as np
	import matplotlib.pyplot as plt

	import tensorflow as tf
	import tensorflow_datasets as tfds

	IMG_SIZE = 160
	BATCH_SIZE = 32
	SHUFFLE_SIZE = 1000
	IMG_SHAPE = (IMG_SIZE, IMG_SIZE, 3)

	class DataLoader(object):
	def __init__(self, image_size, batch_size):

	self.image_size = image_size
	self.batch_size = batch_size

	# 80% train data, 10% validation data, 10% test data
	split_weights = (8, 1, 1)
	splits = tfds.Split.TRAIN.subsplit(weighted=split_weights)

	(self.train_data_raw, self.validation_data_raw, self.test_data_raw), self.metadata = tfds.load(
	'cats_vs_dogs', split=list(splits),
	with_info=True, as_supervised=True)

	# Get the number of train examples
	self.num_train_examples = self.metadata.splits['train'].num_examples*80/100
	self.get_label_name = self.metadata.features['label'].int2str

	# Pre-process data
	self._prepare_data()
	self._prepare_batches()

	# Resize all images to image_size x image_size
	def _prepare_data(self):
	self.train_data = self.train_data_raw.map(self._resize_sample)
	self.validation_data = self.validation_data_raw.map(self._resize_sample)
	self.test_data = self.test_data_raw.map(self._resize_sample)

	# Resize one image to image_size x image_size
	def _resize_sample(self, image, label):
	image = tf.cast(image, tf.float32)
	image = (image/127.5) – 1
	image = tf.image.resize(image, (self.image_size, self.image_size))
	return image, label

	def _prepare_batches(self):
	self.train_batches = self.train_data.shuffle(1000).batch(self.batch_size)
	self.validation_batches = self.validation_data.batch(self.batch_size)
	self.test_batches = self.test_data.batch(self.batch_size)

	# Get defined number of not processed images
	def get_random_raw_images(self, num_of_images):
	random_train_raw_data = self.train_data_raw.shuffle(1000)
	return random_train_raw_data.take(num_of_images)

	def __init__(self, image_size, batch_size):

	self.image_size = image_size
	self.batch_size = batch_size

	# 80% train data, 10% validation data, 10% test data
	split_weights = (8, 1, 1)
	splits = tfds.Split.TRAIN.subsplit(weighted=split_weights)

	(self.train_data_raw, self.validation_data_raw, self.test_data_raw), self.metadata = tfds.load(
	'cats_vs_dogs', split=list(splits),
	with_info=True, as_supervised=True)

	# Get the number of train examples
	self.num_train_examples = self.metadata.splits['train'].num_examples*80/100
	self.get_label_name = self.metadata.features['label'].int2str

	# Pre-process data
	self._prepare_data()
	self._prepare_batches()

	data_loader = DataLoader(IMG_SIZE, BATCH_SIZE)

	plt.figure(figsize=(10, 8))
	i = 0
	for img, label in data_loader.get_random_raw_images(20):
	plt.subplot(4, 5, i+1)
	plt.imshow(img)
	plt.title("{} – {}".format(data_loader.get_label_name(label), img.shape))
	plt.xticks([])
	plt.yticks([])
	i += 1
	plt.tight_layout()
	plt.show()

	class Wrapper(tf.keras.Model):
	def __init__(self, base_model):
	super(Wrapper, self).__init__()

	self.base_model = base_model
	self.average_pooling_layer = tf.keras.layers.GlobalAveragePooling2D()
	self.output_layer = tf.keras.layers.Dense(1)

	def call(self, inputs):
	x = self.base_model(inputs)
	x = self.average_pooling_layer(x)
	output = self.output_layer(x)
	return output

Transfer Learning with TensorFlow 2

Architectures

Dataset

Implementation

Data Loader

Base Models & Wrapper

Training

Evaluation

Conclusion

11 Comments

Trackbacks/Pingbacks

Leave a Reply to lxyCancel reply

Feel Free To Message Us

Contact Info

Visit Us

Email Us

Call Us

Ultimate Guide to Machine Learning with Python

	vgg16_base = tf.keras.applications.VGG16(input_shape=IMG_SHAPE, include_top=False, weights='imagenet')
	googlenet_base = tf.keras.applications.InceptionV3(input_shape=IMG_SHAPE, include_top=False, weights='imagenet')
	resnet_base = tf.keras.applications.ResNet101V2(input_shape=IMG_SHAPE, include_top=False, weights='imagenet')

	base_learning_rate = 0.0001

	vgg16_base.trainable = False
	vgg16 = Wrapper(vgg16_base)
	vgg16.compile(optimizer=tf.keras.optimizers.RMSprop(lr=base_learning_rate),
	loss='binary_crossentropy',
	metrics=['accuracy'])

	googlenet_base.trainable = False
	googlenet = Wrapper(googlenet_base)
	googlenet.compile(optimizer=tf.keras.optimizers.RMSprop(lr=base_learning_rate),
	loss='binary_crossentropy',
	metrics=['accuracy'])

	resnet_base.trainable = False
	resnet = Wrapper(resnet_base)
	resnet.compile(optimizer=tf.keras.optimizers.RMSprop(lr=base_learning_rate),
	loss='binary_crossentropy',
	metrics=['accuracy'])

	steps_per_epoch = round(data_loader.num_train_examples)//BATCH_SIZE
	validation_steps = 20

	loss1, accuracy1 = vgg16.evaluate(data_loader.validation_batches, steps = 20)
	loss2, accuracy2 = googlenet.evaluate(data_loader.validation_batches, steps = 20)
	loss3, accuracy3 = resnet.evaluate(data_loader.validation_batches, steps = 20)

	print("——–VGG16———")
	print("Initial loss: {:.2f}".format(loss1))
	print("Initial accuracy: {:.2f}".format(accuracy1))
	print("—————————")

	print("——–GoogLeNet———")
	print("Initial loss: {:.2f}".format(loss2))
	print("Initial accuracy: {:.2f}".format(accuracy2))
	print("—————————")

	print("——–ResNet———")
	print("Initial loss: {:.2f}".format(loss3))
	print("Initial accuracy: {:.2f}".format(accuracy3))
	print("—————————")

	history = vgg16.fit(data_loader.train_batches,
	epochs=10,
	validation_data=data_loader.validation_batches)

Transfer Learning with TensorFlow 2

Architectures

Dataset

Implementation

Data Loader

Base Models & Wrapper

Training

Evaluation

Conclusion

11 Comments

Trackbacks/Pingbacks

Leave a Reply to lxyCancel reply

Discover more from Rubix Code