PosAlys 2D and 3D Pose Estimation Using Single RGB Images

Dr. Ali Hassan Dr. Farhan Riaz, NS Muzna Imran NS Syed Ali Raza

DSpace Home
→
E-Theses
→
CEME
→
Computer Engineering
→
BS
→
View Item

dc.contributor.author	Dr. Ali Hassan Dr. Farhan Riaz, NS Muzna Imran NS Syed Ali Raza
dc.date.accessioned	2025-04-30T09:22:44Z
dc.date.available	2025-04-30T09:22:44Z
dc.date.issued	2018
dc.identifier.other	DE-COMP-36
dc.identifier.uri	http://10.250.8.41:8080/xmlui/handle/123456789/52771
dc.description	Supervisors Dr. Ali Hassan Dr. Farhan Riaz	en_US
dc.description.abstract	3D Pose Estimation (Human joints detection in 3D) from a single 2D RGB image, without depth information, is quite an ill-posed problem, in the sense that it does not have a unique solution with respect to its 2D solution. Many techniques involving human-tailored feature extractors have been proposed up till now but with no impressive results have been reported, especially for in-the-wild human interaction scenarios. Neural Networks, though introduced decades before, have gained much popularity quite recently in every field because of the availability of huge data corpora and computing resources. 3D pose estimation from a 2D RGB image seems to be an interesting challenge to be tackled with Neural Networks. A 50-layered Convolutional Neural Network based on Microsoft's Residual Network architecture is proposed for the extraction of 2D Heat-maps (images showing probabilities at each pixel for each joint) and 3D Location-maps (images showing probabilistic coordinates, relative to detected pelvis, of each pixel for each joint). Afterwards, the location of joints are selected from Locationmaps where the 2D Heat-maps have given maximum probabilities for the joints. This output is temporally filtered to take advantage of correlation between detected joints in current frame with the detected joints in the previous frames. Then Skeleton-fitting is done on the output to bring the coordinates back to camera coordinates from pelvis-relative coordinates by optimization of the proposed objective function, which also stabilizes the output even further	en_US
dc.language.iso	en	en_US
dc.publisher	College of Electrical & Mechanical Engineering (CEME), NUST	en_US
dc.title	PosAlys 2D and 3D Pose Estimation Using Single RGB Images	en_US
dc.type	Project Report	en_US