Open Access System for Information Sharing

Login Library

 

Thesis
Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads
Full metadata record
Files in This Item:
There are no files associated with this item.
DC FieldValueLanguage
dc.contributor.author박충현-
dc.date.accessioned2023-04-07T16:32:56Z-
dc.date.available2023-04-07T16:32:56Z-
dc.date.issued2022-
dc.identifier.otherOAK-2015-09765-
dc.identifier.urihttp://postech.dcollection.net/common/orgView/200000600797ko_KR
dc.identifier.urihttps://oasis.postech.ac.kr/handle/2014.oak/117219-
dc.descriptionMaster-
dc.description.abstractThe recent success of neural networks enables a better interpretation of 3D point clouds, but processing a large-scale 3D scene remains a challenging problem. Most current approaches divide a large-scale scene into small regions and combine the local predictions together. However, this scheme inevitably involves additional stages for pre- and post-processing and may also degrade the final output due to predictions in a local perspective. This thesis introduces Fast Point Transformer that consists of a novel lightweight self-attention layer. Our approach encodes continuous 3D coordinates, and the voxel hashing-based architecture boosts computational efficiency. The proposed method is demonstrated with 3D semantic segmentation and 3D detection. The accuracy of our approach is competitive to the best voxel-based method, and our network achieves 136 times faster inference time than the state-of-the-art, Point Transformer, with a reasonable accuracy trade-off.-
dc.languageeng-
dc.publisher포항공과대학교-
dc.titleFast Point Transformer for Large-scale 3D Scene Understanding-
dc.title.alternative대규모 3차원 장면 이해를 위한 빠른 포인트 트랜스포머-
dc.typeThesis-
dc.contributor.college인공지능대학원-
dc.date.degree2022- 2-

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Views & Downloads

Browse