Full Text Available
Note: Clicking the button above will open the full text document at the original institutional repository in a new window.
In medical imaging, the appearance of a certain body part on a radiograph depends not only on the position but also on the orientation of the X-ray imaging system with respect to the patient. Given a 2D image of a 3D scene, the problem of viewpoint estimation aims to determine the position and the o...
| Main Author: | |
|---|---|
| Other Authors: | |
| Format: | Thesis |
| Language: | Eng |
| Published: |
Department of Electrical Engineering
2024
|
| Subjects: | |
| Tags: |
No Tags, Be the first to tag this record!
|
| _version_ | 1867613244698394625 |
|---|---|
| access_status_str | Open Access |
| author | Hounkanrin, Mahouclo Anicet |
| author2 | Nicolls, Frederick |
| author_browse | Hounkanrin, Mahouclo Anicet Nicolls, Frederick |
| author_facet | Nicolls, Frederick Hounkanrin, Mahouclo Anicet |
| author_sort | Hounkanrin, Mahouclo Anicet |
| collection | Thesis |
| description | In medical imaging, the appearance of a certain body part on a radiograph depends not only on the position but also on the orientation of the X-ray imaging system with respect to the patient. Given a 2D image of a 3D scene, the problem of viewpoint estimation aims to determine the position and the orientation of the imaging sensor that resulted in that view. We investigate methods to solve the viewpoint estimation problem for medical images, notably the determination of orientation parameters. Machine learning models, particularly convolutional neural networks (CNNs), are developed to predict a human subject's orientation in a radiograph. Since deep learning models require data for training, we first generate a dataset of digitally reconstructed radiographs (DRRs) from a set of computed tomography (CT) scans using Fourier volume rendering (FVR). The dataset of DRRs is then used to train CNN models for viewpoint regression and classification. A label-softening strategy is used to improve the performance of the classification models. Meanwhile, a geometric structure-aware cost function is used to account for the geometric continuity of the viewpoint space. Several 3D rotation methods such as Euler angle, axis-angle, and quaternions are investigated for viewpoint representation. The results demonstrate that viewpoint estimation in medical imaging can be effectively solved using CNN-based classification and regression models. The geometric structure-aware cost function proves to be essential to the success of classification models for viewpoint estimation. The regression-based models, on the order hand, appear to be sensitive to the type of parametrization used to represent the viewpoints. In particular, the unit quaternion representation of 3D rotations proves to be more effective than other representations for viewpoint regression with CNN models. Moreover, we extend the proposed method to perform viewpoint estimation for natural images. The performance on the PASCAL3D+ dataset indicates that the application of the methods presented is not restricted to medical imaging. |
| format | Thesis |
| id | oai:open.uct.ac.za:11427/40304 |
| institution | University of Cape Town (South Africa) |
| language | Eng |
| last_indexed | 2026-06-10T12:33:04.194Z |
| license_str | Not specified — see source repository |
| provenance_str_mv | Harvested via OAI-PMH from UCTD — University of Cape Town Open Access Repository |
| publishDate | 2024 |
| publishDateRange | 2024 |
| publishDateSort | 2024 |
| publisher | Department of Electrical Engineering |
| publisherStr | Department of Electrical Engineering |
| record_format | dspace |
| source_str | UCTD — University of Cape Town Open Access Repository |
| spelling | oai:open.uct.ac.za:11427/40304 Viewpoint estimation in medical imaging Hounkanrin, Mahouclo Anicet Nicolls, Frederick Amayo Paul Electrical Engineering In medical imaging, the appearance of a certain body part on a radiograph depends not only on the position but also on the orientation of the X-ray imaging system with respect to the patient. Given a 2D image of a 3D scene, the problem of viewpoint estimation aims to determine the position and the orientation of the imaging sensor that resulted in that view. We investigate methods to solve the viewpoint estimation problem for medical images, notably the determination of orientation parameters. Machine learning models, particularly convolutional neural networks (CNNs), are developed to predict a human subject's orientation in a radiograph. Since deep learning models require data for training, we first generate a dataset of digitally reconstructed radiographs (DRRs) from a set of computed tomography (CT) scans using Fourier volume rendering (FVR). The dataset of DRRs is then used to train CNN models for viewpoint regression and classification. A label-softening strategy is used to improve the performance of the classification models. Meanwhile, a geometric structure-aware cost function is used to account for the geometric continuity of the viewpoint space. Several 3D rotation methods such as Euler angle, axis-angle, and quaternions are investigated for viewpoint representation. The results demonstrate that viewpoint estimation in medical imaging can be effectively solved using CNN-based classification and regression models. The geometric structure-aware cost function proves to be essential to the success of classification models for viewpoint estimation. The regression-based models, on the order hand, appear to be sensitive to the type of parametrization used to represent the viewpoints. In particular, the unit quaternion representation of 3D rotations proves to be more effective than other representations for viewpoint regression with CNN models. Moreover, we extend the proposed method to perform viewpoint estimation for natural images. The performance on the PASCAL3D+ dataset indicates that the application of the methods presented is not restricted to medical imaging. 2024-07-04T13:53:30Z 2024-07-04T13:53:30Z 2024 2024-07-03T13:29:59Z Thesis / Dissertation Doctoral PhD http://hdl.handle.net/11427/40304 Eng application/pdf Department of Electrical Engineering Faculty of Engineering and the Built Environment |
| spellingShingle | Electrical Engineering Hounkanrin, Mahouclo Anicet Viewpoint estimation in medical imaging |
| thesis_degree_str | Doctoral |
| title | Viewpoint estimation in medical imaging |
| title_full | Viewpoint estimation in medical imaging |
| title_fullStr | Viewpoint estimation in medical imaging |
| title_full_unstemmed | Viewpoint estimation in medical imaging |
| title_short | Viewpoint estimation in medical imaging |
| title_sort | viewpoint estimation in medical imaging |
| topic | Electrical Engineering |
| url | http://hdl.handle.net/11427/40304 |
| work_keys_str_mv | AT hounkanrinmahoucloanicet viewpointestimationinmedicalimaging |