Efficient UAV localization using combined autoencoder and SIFT

303 views

Authors

  • Ngo Van Quan Institute of Information Technology, Academy of Military Science and Technology
  • Phan Huy Anh (Corresponding Author) Institute of Electronics, Academy of Military Science and Technology
  • Bui Thi Thanh Tam Institute of Electronics, Academy of Military Science and Technology
  • Nguyen Chi Thanh Institute of Information Technology, Academy of Military Science and Technology

DOI:

https://doi.org/10.54939/1859-1043.j.mst.FEE.2024.142-148

Keywords:

GNSS; Autoencoder; SIFT; Visual Localization; UAV.

Abstract

In GNSS-denied environments, accurate Unmanned Aerial (UAV) localization faces significant challenges. This paper introduces a vision-based localization method combining autoencoder and SIFT algorithms, referred to as AE+SIFT. The method compresses high-resolution map images into low-dimensional vectors, which are stored in a database for efficient retrieval. During the localization process, UAV images are encoded and matched with the database, followed by SIFT and homography projection for precise positioning. The AE+SIFT approach enhances localization accuracy, achieving an average coordinate error of 3.94 meters relative to the ground truth. Notably, when UAV images are misaligned with reference images, our method outperforms the existing AE method in terms of accuracy.

References

[1]. Yang, Z., et al., “A Survey on UAV-Based Applications for Smart Agriculture,” IEEE Internet of Things Journal, vol. 8, no. 6, pp. 4132-4149, (2021).

[2]. Li, Y., et al., “Precision Agriculture with UAV-Based Remote Sensing: A Review,” Remote Sensing, vol. 12, no. 9, pp. 1-22, (2020). DOI: https://doi.org/10.3390/rs12223789

[3]. Deng, X., et al., “A Robust Vision-Based Localization System for UAVs in GPS-Denied Environments,” IEEE Transactions on Instrumentation and Measurement, vol. 70, (2021).

[4]. Humphreys, T. E., et al., “Assessing the Spoofing Threat: Development of a Portable GPS Civilian Spoofer,” Proceedings of the ION GNSS Meeting, (2008).

[5]. Mur-Artal, R., et al., “ORB-SLAM: A Versatile and Accurate Monocular SLAM System,” IEEE Transactions on Robotics, vol. 31, no. 5, pp. 1147-1163, (2015). DOI: https://doi.org/10.1109/TRO.2015.2463671

[6]. Lin, C., et al., “A Survey of Large-Scale Image Localization Methods for UAVs,” Journal of Computer Vision and Image Understanding, vol. 160, pp. 48-65, (2017).

[7]. Rublee, E., et al., “ORB: An Efficient Alternative to SIFT or SURF,” 2011 International Conference on Computer Vision, pp. 2564-2571, (2011). DOI: https://doi.org/10.1109/ICCV.2011.6126544

[8]. Lowe, D. G., “Distinctive Image Features from Scale-Invariant Keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, (2004). DOI: https://doi.org/10.1023/B:VISI.0000029664.99615.94

[9]. Ma, Y., et al., “Image Registration Techniques: A Survey,” IEEE Transactions on Medical Imaging, vol. 35, no. 7, pp. 1714-1735, (2016).

[10]. Johnson, J., Douze, M., and Je´gou, H., “Billion-scale similarity search with GPUs,” IEEE Transactions on Big Data, vol. 7, no. 3, pp. 535-547, (2021). DOI: https://doi.org/10.1109/TBDATA.2019.2921572

[11]. Lowe, D. G., “Object Recognition from Local Scale-Invariant Features,” Proceedings of the International Conference on Computer Vision (ICCV), vol. 2, pp. 1150-1157, (1999). DOI: https://doi.org/10.1109/ICCV.1999.790410

[12]. Fischler, M. A., and Bolles, R. C., “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography,” Communications of the ACM, vol. 24, no. 6, pp. 381-395, (1981). DOI: https://doi.org/10.1145/358669.358692

[13]. Kingma, D. P., and Welling, M., “Auto-Encoding Variational Bayes,” International Conference on Learning Representations (ICLR), (2014).

[14]. Douze, M., et al., “The FAISS Library for Efficient Similarity Search,” arXiv preprint arXiv:1702.08734, (2017).

[15]. Bay, H., Tuytelaars, T., and Van Gool, L., “SURF: Speeded Up Robust Features,” European Conference on Computer Vision (ECCV), (2006). DOI: https://doi.org/10.1007/11744023_32

[16]. Bing, L. I., et al., “UAV Image Matching Based on Improving the ORB Algorithm,” Bulletin of Surveying and Mapping, (2024).

[17]. Mughal, M. H., Khokhar, M. J., and Shahzad, M., “Assisting UAV Localization via Deep Contextual Image Matching,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 14, pp. 2445-2457, (2021). DOI: https://doi.org/10.1109/JSTARS.2021.3054832

[18]. Li, Q., et al., “GeoFormer: An Effective Transformer-based Siamese Network for UAV Geo-localization,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, (2024). DOI: https://doi.org/10.1109/JSTARS.2024.3392812

[19]. Vaswani, A., et al., “Attention is All You Need,” Advances in Neural Information Processing Systems, (2017).

[20]. Bianchi, M., and Barfoot, T. D., “UAV Localization Using Autoencoded Satellite Images,” IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 1761-1768, (2021). DOI: https://doi.org/10.1109/LRA.2021.3060397

[21]. Di Piazza, T., et al., “Leveraging Edge Detection and Neural Networks for Better UAV Localization,” arXiv preprint arXiv:2404.06207, (2024). DOI: https://doi.org/10.1109/IGARSS53475.2024.10641666

[22]. Hou, X., et al., “Deep Feature Consistent Variational Autoencoder,” 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1133-1141, (2016). DOI: https://doi.org/10.1109/WACV.2017.131

Downloads

Published

06-12-2024

How to Cite

[1]
Ngo Van Quan, Phan Huy Anh, Bui Thi Thanh Tam, and Nguyen Chi Thanh, “Efficient UAV localization using combined autoencoder and SIFT”, JMST, no. FEE, pp. 142–148, Dec. 2024.

Issue

Section

Electronics - Technical Physics

Most read articles by the same author(s)