Creating a Hand Gesture Recognition System with Convolutional Neural Networks (CNN) and OpenCV

January 29, 2024May 26, 2024

Hand gesture recognition is a fascinating application that involves the intersection of computer vision and machine learning. In this blog post, we’ll explore how to build a hand gesture recognition system using a Convolutional Neural Network (CNN) and OpenCV for real-time video processing.

Building the Neural Network

Let’s start by assembling the neural network using the Keras library and compiling it for training. The neural network is a simple CNN with three convolutional and max-pooling layers, followed by dense layers. The model is then compiled with categorical crossentropy loss and the Adam optimizer.

# Import necessary libraries
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Activation, Flatten, Dense, Dropout
from tensorflow.keras.preprocessing.image import ImageDataGenerator, img_to_array, load_img

# Define the model
model = Sequential()
# Add convolutional layers
model.add(Conv2D(32, (3, 3), input_shape=(54, 54, 1)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
# Add more convolutional layers
model.add(Conv2D(32, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
# Add additional convolutional layers
model.add(Conv2D(64, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
# Flatten the tensor and add dense layers
model.add(Flatten())
model.add(Dense(64))
model.add(Activation('relu'))
model.add(Dropout(0.5))
model.add(Dense(4))
model.add(Activation('softmax'))

# Compile the model
model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])

Preparing Data for Training

To train the model, we need to prepare our data. We use an ImageDataGenerator for data augmentation and flow the data from directories.

# Set batch size
batch_size = 16

# Create data generators
training_datagen = ImageDataGenerator(
    rotation_range=50,
    width_shift_range=0.1,
    height_shift_range=0.1,
    shear_range=0.2,
    zoom_range=0.2,
    horizontal_flip=True,
    fill_mode='nearest'
)

validation_datagen = ImageDataGenerator(zoom_range=0.2, rotation_range=10)

# Flow data from directories
training_generator = training_datagen.flow_from_directory(
    'training_data',
    target_size=(54, 54),
    batch_size=batch_size,
    color_mode='grayscale'
)

validation_generator = validation_datagen.flow_from_directory(
    'validation_data',
    target_size=(54, 54),
    batch_size=batch_size,
    color_mode='grayscale'
)

Training the Network

Now, we can train the model on the augmented data.

# Train the model
model.fit_generator(
    generator=training_generator,
    steps_per_epoch=2000 // batch_size,
    epochs=50,
    verbose=1,
    validation_data=validation_generator,
    validation_steps=200 // batch_size,
    workers=8,
)

Plotting Model History

We can visualize the training and validation performance using the model’s history.

# Plot the training history
import re
import matplotlib.pyplot as plt

# Read the model fitting output
with open(MODEL_HISTORY) as history_file:
    history = history_file.read()

# Extract relevant data
data = {}
data['acc'] = re.findall(' acc: ([0-9]+\.[0-9]+)', history)
# Extract other metrics like loss, val_acc, val_loss

# Plot the data
plt.figure()
plt.title("Training Loss")
plt.xlabel("Epoch #")
plt.ylabel("Loss")
plt.plot(data['loss'])

# Repeat for other metrics (accuracy, validation loss, validation accuracy)


plt.show()

This concludes the training phase of our hand gesture recognition model. In the next section, we’ll integrate this model with OpenCV for real-time video processing and hand tracking.

Data Analytics | Machine Learning | NLP

Exploring Named Entity Recognition with Conditional Random Fields

ByKishore January 9, 2024January 10, 2024

Named Entity Recognition (NER) is a fundamental task in natural language processing that involves identifying and classifying entities, such as names of people, organizations, and locations, within a text. NER plays a crucial role in various applications, including information retrieval, question answering, and text summarization. In this blog post, we’ll dive into the world of…

Data Analytics

Enhancing Sentiment Analysis with ELMo Embeddings: A TensorFlow Experiment

ByKishore January 11, 2024May 27, 2024

Introduction Natural Language Processing (NLP) has witnessed a significant boost with the advent of transfer learning. In this blog post, we explore ELMo Embeddings, a cutting-edge approach to word embeddings, leveraging a large unlabelled text corpus for enhanced sentiment analysis. We’ll delve into the implementation using TensorFlow and TensorFlow Hub. Preparation Let’s start by setting…

Data Analytics

Mastering Advanced Techniques for Python Dictionary Sorting

ByKishore January 10, 2024May 25, 2024

Dictionaries in Python are powerful data structures that allow you to store key-value pairs. Often, there arises a need to sort a dictionary based on its values. In this exploration, we’ll uncover the techniques to efficiently sort a dictionary in both ascending and descending order. Example Dictionary Object Let’s consider a sample dictionary to demonstrate…

NLP

Unveiling the Power of Word Embeddings with Gensim

ByKishore January 11, 2024May 28, 2024

In the realm of Natural Language Processing (NLP), word embeddings have emerged as a game-changer. Unlike traditional approaches that use words as features, word embeddings leverage dense, low-dimensional vectors to capture the meaning and usage of a word. One pioneering model in this domain is Word2Vec, developed by Thomas Mikolov and team at Google. In…

Generative AI

Parameter-Efficient Fine-Tuning of Large Language Models with Hugging Face’s PEFT Library

ByKishore April 25, 2024May 24, 2024

Introduction: Large Language Models (LLMs) like GPT, T5, and BERT have shown remarkable performance in NLP tasks. However, fine-tuning these models on downstream tasks can be computationally expensive. Parameter-Efficient Fine-Tuning (PEFT) approaches aim to address this challenge by fine-tuning only a small number of parameters while freezing most of the pretrained model. In this blog…

Machine Learning

Understanding Support Vector Machines (SVMs) in Depth

ByKishore December 20, 2023May 28, 2024

Support Vector Machines (SVMs) are a powerful class of supervised algorithms used for both classification and regression tasks. In this blog post, we will delve into the intuition behind SVMs and their application in solving classification problems. Motivation To begin, let’s consider a simple classification task with well-separated classes. We’ll generate some synthetic data with…

Building the Neural Network

Preparing Data for Training

Training the Network

Plotting Model History

Similar Posts

Leave a Reply Cancel reply