EVOLUTION OF OBJECT DETECTION, TRACKING, AND MOTION ESTIMATION ALONG WITH DEEP NEURAL NETWORKS/ Khaled Adel Ezzat

By:

Khaled Adel Ezzat

Material type: Text

TextLanguage: English Summary language: English Publication details: 2022Description: 61 p. ill. 21 cmSubject(s):

Genre/Form:

Dissertation, Academic

DDC classification:

Contents:

Contents: Chapters: 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.3 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2. MinkowRadon Multi-Object Tracking Technique . . . . . . . . . . . . . . 12 2.1 Dataset - MOTChallenge (MOT20 & MOT17) . . . . . . . . . . . 13 2.2 Model Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.3 Numerical Results and Discussion . . . . . . . . . . . . . . . . . . . 20 2.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3. The Adaptive MinkowYolo Multi-Object Detector-Tracker . . . . . . . . 24 3.1 Dataset - MOT + Visual Tracker Benchmark (VTB) . . . . . . . . 26 3.2 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.3 Numerical Results and Discussion . . . . . . . . . . . . . . . . . . . 37 3.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 4. Conclusions and Future Work . . . . . . . . . . . . . . . . . . . . . . . . 43 Bibliography . . .

Dissertation note: Thesis (M.A.)—Nile University, Egypt, 2022 . Abstract: Abstract: Object Detection, Tracking and Motion Estimation have been a major concern since the 1970s, from Self Driving Cars, Surveillance Cameras, Industrial robotics, Traffic monitoring, Medical diagnosis systems, to Activity recognition, are expecting a huge increase in demand for automated detection-tracking systems. Modern hardware specifications and evolving deep learning applications with advancement of Computer Vision and Digital Video Processing are resulting in a massive progress towards fully automated systems, with all advance models and systems like R-CNN, YOLO, SSD, and RetinaNet, there will always be a trade-off between precision (mAP) and speed (FPS) which puts a new limits to computer vision advancement. Technological merging has the potential to drive the intuition to achieve such advancements, and overcome some of the existing limitations. Introducement of a combination between Deep Neural Networks and Digital signal processing to enable once again progress to be done in improving Object Detection, Tracking and Motion Estimation in a real-time videos. Utilizing both of the fields, this thesis purposes a complete detection/ tracking framework utilizing YOLO v4 as a state-of-art object detector to detect the objects in the video sequences. In addition to a novel MinkowRadon tracking algorithm which utilizes the Radon Transformation and Minkowski Distance to translates the rest of video frames sequence to the signal’s domain, in an attempt iv to tackle extreme object tracking problems found in video sequences like eg. trembling moving cameras, deformation, motion blur, fast motion, and in-plane rotation. Tracking through signals have proven with a higher accuracy compared to the stateof- art tracking techniques that a combination between classical techniques and deep learning models is sufficient to solve most modern problems.

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Call number	Status	Date due	Barcode
Thesis	Main library	610/K.E.E/ 2022 (Browse shelf(Opens below))	Not for loan

Browsing Main library shelves Close shelf browser (Hides shelf browser)

Previous	No cover image available No cover image available	No cover image available No cover image available	No cover image available No cover image available	No cover image available No cover image available	No cover image available No cover image available	No cover image available No cover image available	No cover image available No cover image available	Next
Previous	610/ I.E.E/ 2022 AN END-TO-END FRAMEWORK FOR UNRESTRICTED AUTOMATIC LICENSE PLATE RECOGNITION USING SUPER-RESOLUTION GANS /	610/ IS.A 2014 A 63-Way Character Recognition /	610 / JA. A 2006 عالج نفسك بالحلبة	610/K.E.E/ 2022 EVOLUTION OF OBJECT DETECTION, TRACKING, AND MOTION ESTIMATION ALONG WITH DEEP NEURAL NETWORKS/	610 / K.M.H / 2018 A Hybrid Deep CNN-Reinforcement Learning Model for Autonomous Driving /	610/K.M.I/2020 An Investigative Framework For Natural Language–Based Neural Program Synthesis /	610/ K.M.S/ 2021 Self-Supervised Learning Framework for Sequential Data Applications /	Next

Supervisor:
Khaled Foad
Mustafa Elattar

Publication:
1-On the Application of Hierarchical Adaptive Structured Mesh" HASM®" Codec for Ultra Large Video Format
https://dl.acm.org/doi/abs/10.1145/3436829.3436870

2-nnDPI: A Novel Deep Packet Inspection Technique Using Word Embedding, Convolutional and Recurrent Neural Networks
https://ieeexplore.ieee.org/abstract/document/9257912

3-On Optimizing the Visual Quality of HASM-Based Streaming—The Study the Sensitivity of Motion Estimation Techniques for Mesh-Based Codecs in Ultra High Definition Large Format Real-Time Video Coding
https://link.springer.com/chapter/10.1007/978-981-33-6129-4_15

4-MinkowRadon: Multi-Object Tracking Using Radon Transformation and Minkowski Distance
https://ieeexplore.ieee.org/abstract/document/9581542

Thesis (M.A.)—Nile University, Egypt, 2022 .

"Includes bibliographical references"

Contents:
Chapters:
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2. MinkowRadon Multi-Object Tracking Technique . . . . . . . . . . . . . . 12
2.1 Dataset - MOTChallenge (MOT20 & MOT17) . . . . . . . . . . . 13
2.2 Model Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.3 Numerical Results and Discussion . . . . . . . . . . . . . . . . . . . 20
2.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3. The Adaptive MinkowYolo Multi-Object Detector-Tracker . . . . . . . . 24
3.1 Dataset - MOT + Visual Tracker Benchmark (VTB) . . . . . . . . 26
3.2 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.3 Numerical Results and Discussion . . . . . . . . . . . . . . . . . . . 37
3.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4. Conclusions and Future Work . . . . . . . . . . . . . . . . . . . . . . . . 43
Bibliography . . .

Abstract:
Object Detection, Tracking and Motion Estimation have been a major concern
since the 1970s, from Self Driving Cars, Surveillance Cameras, Industrial robotics,
Traffic monitoring, Medical diagnosis systems, to Activity recognition, are expecting
a huge increase in demand for automated detection-tracking systems. Modern hardware
specifications and evolving deep learning applications with advancement of Computer
Vision and Digital Video Processing are resulting in a massive progress towards
fully automated systems, with all advance models and systems like R-CNN, YOLO,
SSD, and RetinaNet, there will always be a trade-off between precision (mAP) and
speed (FPS) which puts a new limits to computer vision advancement. Technological
merging has the potential to drive the intuition to achieve such advancements, and
overcome some of the existing limitations. Introducement of a combination between
Deep Neural Networks and Digital signal processing to enable once again progress
to be done in improving Object Detection, Tracking and Motion Estimation in a
real-time videos. Utilizing both of the fields, this thesis purposes a complete detection/
tracking framework utilizing YOLO v4 as a state-of-art object detector to
detect the objects in the video sequences. In addition to a novel MinkowRadon tracking
algorithm which utilizes the Radon Transformation and Minkowski Distance to
translates the rest of video frames sequence to the signal’s domain, in an attempt
iv
to tackle extreme object tracking problems found in video sequences like eg. trembling
moving cameras, deformation, motion blur, fast motion, and in-plane rotation.
Tracking through signals have proven with a higher accuracy compared to the stateof-
art tracking techniques that a combination between classical techniques and deep
learning models is sufficient to solve most modern problems.

Text in English, abstracts in English .

There are no comments on this title.

to post a comment.