J Shanghai Jiaotong Univ Sci ›› 2023, Vol. 28 ›› Issue (1): 100-113.doi: 10.1007/s12204-023-2573-3
Previous Articles Next Articles
QIN Chao1 (秦 超), WANG Yafei1 (王亚飞), ZHANG Yuchao2 (张宇超), YIN Chengliang1∗ (殷承良)
Received:
2022-03-08
Online:
2023-01-28
Published:
2023-02-10
CLC Number:
QIN Chao1 (秦 超), WANG Yafei1 (王亚飞), ZHANG Yuchao2 (张宇超), YIN Chengliang1∗ (殷承良). Birds-Eye-View Semantic Segmentation and Voxels Semantic Segmentation Based on Frustum Voxels Modeling and Monocular Camera[J]. J Shanghai Jiaotong Univ Sci, 2023, 28(1): 100-113.
[1] BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. [2] READING C, HARAKEH A, CHAE J L, et al. Categorical depth distribution network for monocular 3D object detection [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 8551-8560. [3] ABBAS S A, ZISSERMAN A. A geometric approach to obtain a bird’s eye view from an image [C]//2019 IEEE/CVF International Conference on Computer Vision Workshop. Seoul: IEEE, 2019: 4095-4104. [4] LIN C C, WANG M S. A vision based top-view transformation model for a vehicle parking assistant [J]. Sensors, 2012, 12(4): 4431-4446. [5] DENG L Y, YANG M, LI H, et al. Restricted deformable convolution-based road scene semantic segmentation using surround view cameras [J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 21(10): 4350-4362. [6] S?MANN T, AMENDE K, MILZ S, et al. Efficient semantic segmentation for visual bird’s-eye view interpretation [M]//Intelligent autonomous systems 15. Cham: Springer, 2018: 679-688. [7] PAN B W, SUN J K, LEUNG H Y T, et al. Crossview semantic segmentation for sensing surroundings [J]. IEEE Robotics and Automation Letters, 2020, 5(3): 4867-4873. [8] LU C Y, VAN DE MOLENGRAFT M J G, DUBBELMAN G. Monocular semantic occupancy grid mapping with convolutional variational encoder–decoder networks [J]. IEEE Robotics and Automation Letters, 2019, 4(2): 445-452. [9] SCHULTER S, ZHAI M H, JACOBS N, et al. Learning to look around objects for top-view representations of outdoor scenes [M]//Computer vision – ECCV 2018. Cham: Springer, 2018: 815-831. [10] MANI K, DAGA S, GARG S, et al. MonoLayout: Amodal scene layout from a single image [C]//2020 IEEE Winter Conference on Applications of Computer Vision. Snowmass: IEEE, 2020: 1678-1686. [11] RODDICK T, CIPOLLA R. Predicting semantic map representations from images using pyramid occupancy networks [C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 11135-11144. [12] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation [M]//Medical image computing and computerassisted intervention – MICCAI 2015. Cham: Springer, 2015: 234-241. [13] DING X H, ZHANG X Y, MA N N, et al. RepVGG: making VGG-style ConvNets great again [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 13728-13737. [14] LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss fordense object detection [C]//2017 IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 2999-3007. [15] CAESAR H, BANKITI V, LANG A H, et al. nuScenes: A multimodal dataset for autonomous driving [C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: IEEE, 2020: 11618-11628. [16] KINGMA D P, BA J. Adam: A method for stochastic optimization[DB/OL]. (2017-01-30). https://arxiv.org/abs/1412.6980. [17] GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A review on deep learning techniques applied to semantic segmentation [DB/OL]. (2017-04-22). https://arxiv.org/abs/1704.06857. |
[1] | Fu Zeyu, Fu Zhuang, Guan Yisheng. Vascular Interventional Surgery Path Planning and 3D Visual Navigation [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 472-481. |
[2] | Wang Baomin, Ding Hewei, Teng Fei, Liu Hongqin. Damage Detection of X-ray Image of Conveyor Belts with Steel Rope Cores Based on Improved FCOS Algorithm [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 309-318. |
[3] | Wang Gang, Guan Yaonan, Li Dewei. Two-Stream Auto-Encoder Network for Unsupervised Skeleton-Based Action Recognition [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 330-336. |
[4] | Diao Zijian, Cao Shuai, Li Wenwei, Liang Jianan, Wen Guilin, Huang Weixi, Zhang Shouming. Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(2): 363-374. |
[5] | ZHOU Su (周苏), ZHONG Zebin∗ (钟泽滨). Real-Time Ranging of Vehicles and Pedestrians for Mobile Application on Smartphones [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(6): 1081-1090. |
[6] | YAN Congqiang1,2 (鄢丛强), GUO Zhengyun3,4 (郭正玉), CAI Yunze1,2∗∗ (蔡云泽). Data Augmentation of Ship Wakes in SAR Images Based on Improved CycleGAN [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(4): 702-711. |
[7] | LONARE Savita1,2* , BHRAMARAMBA Ravi2. Federated Approach for Privacy-Preserving Traffic Prediction Using Graph Convolutional Network [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(3): 509-517. |
[8] | LV Feng(吕峰), WANG Xinyan* (王新彦), LI Lei(李磊), JIANG Quan(江泉), YI Zhengyang(易政洋). Tree Detection Algorithm Based on Embedded YOLO Lightweight Network [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(3): 518-527. |
[9] | SONG Liboa (宋立博), FEI Yanqiongb (费燕琼). New Lite YOLOv4-Tiny Algorithm and Application on Crack Intelligent Detection [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(3): 528-536. |
[10] | SHEN Ao1,2‡ (沈傲), HU Jisu 2,3‡ (胡冀苏), JIN Pengfei4 (金鹏飞), ZHOU Zhiyong2 (周志勇), QIAN Xusheng 2,3 (钱旭升), ZHENG Yi2 (郑毅), BAO Jie 4 (包婕), WANG Ximing4∗ (王希明), DAI Yakang1,2∗ (戴亚康). Ensemble Attention Guided Multi-SEANet Trained with Curriculum Learning for Noninvasive Prediction of Gleason Grade Groups from MRI [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(1): 109-119. |
[11] | XUE Yongboa (薛永波),LIU Zhaob (刘钊), LI Zeyanga (李泽阳),ZHU Pinga* (朱平). CT Image Segmentation Method of Composite Material Based on Improved Watershed Algorithm and U-Net Neural Network Model [J]. J Shanghai Jiaotong Univ Sci, 2023, 28(6): 783-792. |
[12] | FU Jiawei∗ (傅家威), ZHAO Xu (赵 旭). Action-aware Encoder-Decoder Network for Pedestrian Trajectory Prediction [J]. J Shanghai Jiaotong Univ Sci, 2023, 28(1): 20-27. |
[13] | SONG Hao-hao (宋好好), LU Zhen (陆 臻). Image Fusion Scheme Based on Nonsubsampled Contourlet and Block-Based Cosine Transform [J]. J Shanghai Jiaotong Univ Sci, 2012, 17(1): 8-012. |
Viewed | ||||||||||||||||||||||||||||||||||||||||||||||||||
Full text 75
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||
Abstract 408
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||