Publications

Show all

2026

1.
Object detection with multimodal large vision-language models: An in-depth review

Ranjan Sapkota; Manoj Karkee

Object detection with multimodal large vision-language models: An in-depth review Journal Article

In: Information Fusion, vol. 126, pp. 103575, 2026, ISSN: 1566-2535.

Abstract | Links | BibTeX | Tags: Information fusion, Language and vision fusion, Large language models, Object detection, Vision-language models

2025

2.
Multi-Modal LLMs in Agriculture: A Comprehensive Review

Ranjan Sapkota; Rizwan Qureshi; Muhammad Usman Hadi; Syed Zohaib Hassan; Ferhat Sadak; Maged Shoman; Muhammad Sajjad; Fayaz Ali Dharejo; Achyut Paudel; Jiajia Li; Zhichao Meng; John Shutske; Manoj Karkee

Multi-Modal LLMs in Agriculture: A Comprehensive Review Journal Article

In: IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 22510–22540, 2025, ISSN: 1558-3783.

Abstract | Links | BibTeX | Tags: Agriculture, Analytical models, ChatGPT, Computational modeling, Computer vision, Data models, Deep learning, Farming, generative artificial intelligence, Hidden Markov models, Large language models (LLMs), Machine Learning, Precision agriculture, Reviews, Training, Transformers, Translation, Vision-language models