Object Detection for Aerial View Images: Dataset and Learning Rate (open access)

Object Detection for Aerial View Images: Dataset and Learning Rate

In recent years, deep learning based computer vision technology has developed rapidly. This is not only due to the improvement of computing power, but also due to the emergence of high-quality datasets. The combination of object detectors and drones has great potential in the field of rescue and disaster relief. We created an image dataset specifically for vision applications on drone platforms. The dataset contains 5000 images, and each image is carefully labeled according to the PASCAL VOC standard. This specific dataset will be very important for developing deep learning algorithms for drone applications. In object detection models, loss function plays a vital role. Considering the uneven distribution of large and small objects in the dataset, we propose adjustment coefficients based on the frequencies of objects of different sizes to adjust the loss function, and finally improve the accuracy of the model.
Date: May 2021
Creator: Qi, Yunlong
System: The UNT Digital Library
Mixed Reality Tailored to the Visually-Impaired (open access)

Mixed Reality Tailored to the Visually-Impaired

The goal of the proposed device and software architecture is to apply the functionality of mixed reality (MR) in order to make a virtual environment that is more accessible to the visually-impaired. We propose a glove-based system for MR that will use finger and hand movement tracking along with tactile feedback so that the visually-impaired can interact with and obtain a more detailed sense of virtual objects and potentially even virtual environments. The software architecture makes current MR frameworks more accessible by augmenting the existing software and extensive 3D model libraries with both the interfacing of the glove-based system and the audibly navigable user interface (UI) of a virtual environment we have developed. We implemented a circuit with finger flexion/extension tracking for all 5 fingers of a single hand and variable vibration intensities for the vibromotors on all 5 fingertips of a single hand. The virtual environment can be hosted on a Windows 10 application. The virtual hand and its fingers can be moved with the system's input and the virtual fingertips touching the virtual objects trigger vibration motors (vibromotors) to vibrate while the virtual objects are being touched. A rudimentary implementation of picking up and moving virtual objects inside …
Date: August 2022
Creator: Omary, Danah M
System: The UNT Digital Library
Notch Filter Design for Power Line Interference Artifact Reduction of ECG Signal and Feature Extraction in LabVIEW (open access)

Notch Filter Design for Power Line Interference Artifact Reduction of ECG Signal and Feature Extraction in LabVIEW

Electrocardiogram (ECG) is a biological signal that represents the heart's electrical activity. Interference from power lines introduces a frequency component of 50 to 60 Hz into the signal, which is the principal cause of ECG corruption. By using the Cadence Virtuoso Spectre circuit simulator and typical TSMC RF 180 nm CMOS technology, a notch filter was created to reduce powerline interference. The advantage of utilizing a notch filter for PLI is that noise at 60 Hz is completely eliminated without sacrificing any important information. Additionally, this study contains a MATLAB-based model for, which is used to compute the power spectral density for the obtained time-domain signal. By incorporating power spectral density into data gathering procedures, it is feasible to enhance data collection methodologies, construct models that appropriately account for observed power and aid in the removal of undesired components. NI LabVIEW is used to extract features. The advantage of ECG feature extraction is that it provides information that assists in the identification of cardiac rhythm issues, and gives information about the occurrence of heart attack. In this study, several patient data sets are utilized to extract characteristics and provide information regarding heart condition abnormalities.
Date: May 2022
Creator: Kasidi, Divyasri
System: The UNT Digital Library
Localization of UAVs Using Computer Vision in a GPS-Denied Environment (open access)

Localization of UAVs Using Computer Vision in a GPS-Denied Environment

The main objective of this thesis is to propose a localization method for a UAV using various computer vision and machine learning techniques. It plays a major role in planning the strategy for the flight, and acts as a navigational contingency method, in event of a GPS failure. The implementation of the algorithms employs high processing capabilities of the graphics processing unit, making it more efficient. The method involves the working of various neural networks, working in synergy to perform the localization. This thesis is a part of a collaborative project between The University of North Texas, Denton, USA, and the University of Windsor, Ontario, Canada. The localization has been divided into three phases namely object detection, recognition, and location estimation. Object detection and position estimation were discussed in this thesis while giving a brief understanding of the recognition. Further, future strategies to aid the UAV to complete the mission, in case of an eventuality, like the introduction of an EDGE server and wireless charging methods, was also given a brief introduction.
Date: May 2022
Creator: Aluri, Ram Charan
System: The UNT Digital Library

Wireless Surface Acoustic Wave Sensor for PM2.5 Detection

Currently, there is no equipment to measure the real-time fit of EHMR or N-95masks which are used in harsh environments. Improper fit of these EHMRs or N-95 masks exposes the personnel to hazardous environments. Surface acoustic wave (SAW) sensors have been around for few decades and are being used in various applications. In this work, real-time PM2.5 detection using passive wireless SAW sensors is presented. The design of meander antenna at 433MHz for wireless interrogation of SAW sensor using HFSS and ADS is also presented in this thesis. This works also includes the design of YZ-lithium niobate SAW sensor including COMSOL simulation.
Date: May 2022
Creator: Mamidipally, Sai Karthik
System: The UNT Digital Library
Small-Scale Dual Path Network for Image Classification and Machine Learning Applications to Color Quantization (open access)

Small-Scale Dual Path Network for Image Classification and Machine Learning Applications to Color Quantization

This thesis consists of two projects in the field of machine learning. Previous research in the OSCAR UNT lab based on KMeans color quantization is further developed and applied to individual color channels and segmented input images to explore compression rates while still maintaining high output image quality. The second project implements a small-scale dual path network for image classifiaction utilizing the CIFAR-10 dataset containing 60,000 32x32 pixel images ranging across ten categories.
Date: May 2022
Creator: Murrell, Ethan Davis
System: The UNT Digital Library
Development and Application of Novel Computer Vision and Machine Learning Techniques (open access)

Development and Application of Novel Computer Vision and Machine Learning Techniques

The following thesis proposes solutions to problems in two main areas of focus, computer vision and machine learning. Chapter 2 utilizes traditional computer vision methods implemented in a novel manner to successfully identify overlays contained in broadcast footage. The remaining chapters explore machine learning algorithms and apply them in various manners to big data, multi-channel image data, and ECG data. L1 and L2 principal component analysis (PCA) algorithms are implemented and tested against each other in Python, providing a metric for future implementations. Selected algorithms from this set are then applied in conjunction with other methods to solve three distinct problems. The first problem is that of big data error detection, where PCA is effectively paired with statistical signal processing methods to create a weighted controlled algorithm. Problem 2 is an implementation of image fusion built to detect and remove noise from multispectral satellite imagery, that performs at a high level. The final problem examines ECG medical data classification. PCA is integrated into a neural network solution that achieves a small performance degradation while requiring less then 20% of the full data size.
Date: August 2021
Creator: Depoian, Arthur Charles, II
System: The UNT Digital Library
Neural Network Classifiers for Object Detection in Optical and Infrared Images (open access)

Neural Network Classifiers for Object Detection in Optical and Infrared Images

This thesis presents a series of neural network classifiers for object detection in both optical and infrared images. The focus of this work is on efficient and accurate solutions. The thesis discusses the evolution of the highly efficient and tiny network Binary Classification Vision Transformer (BC-ViT) and how through thoughtful modifications and improvements, the BC-ViT can be utilized for tasks of increasing complexity. Chapter 2 discusses the creation of BC-ViT and its initial use case for underwater image classification of optical images. The BC-ViT is able to complete its task with an accuracy of 99.29\% while being comprised of a mere 15,981 total trainable parameters. Chapter 3, Waste Multi-Class Vision Transformer (WMC-ViT), introduces the usefulness of mindful algorithm design for the realm of multi-class classification on a mutually exclusive dataset. WMC-ViT shows that the task oriented design strategy allowed for a network to achieve an accuracy score of 94.27\% on a five class problem while still maintaining a tiny parameter count of 35,492. The final chapter demonstrates that by utilizing functional blocks of BC-ViT, a simple and effective target detection algorithm for infrared images can be created. The Edge Infrared Vision Transformer (EIR-ViT) showed admirable results with a high IoU …
Date: December 2023
Creator: Adams, Ethan Richard
System: The UNT Digital Library
Air Corridors: Concept, Design, Simulation, and Rules of Engagement (open access)

Air Corridors: Concept, Design, Simulation, and Rules of Engagement

Air corridors are an integral part of the advanced air mobility infrastructure. They are the virtual highways in the sky for transportation of people and cargo in the controlled airspace at an altitude of around 1000 ft. to 2000 ft. above the ground level. This paper presents fundamental insights into the design of air corridors with high operational efficiency as well as zero collisions. It begins with the definitions of air cube, skylane or track, intersection, vertiport, gate, and air corridor. Then, a multi-layered air corridor model is proposed. Traffic at intersections is analyzed in detail with examples of vehicles turning in different directions. The concept of capacity of an air corridor is introduced along with the nature of distribution of locations of vehicles in the air corridor and collision probability inside the corridor are discussed. Finally, the results of simulations of traffic flows are presented.
Date: December 2021
Creator: Muna, Sabrina Islam
System: The UNT Digital Library