| Regression Tasks |
Text-based tokenization leads to precision loss in continuous value prediction. |
Tokenization of "100" into ["1", "00"] causes errors in AGB (Above-Ground Biomass) estimation tasks. |
High |
REO-VLM |
Improved environmental monitoring and carbon stock calculations. |
| Multispectral/SAR Adaptation |
RGB-centric architectures underperform on non-RGB data (e.g., SAR, hyperspectral). |
SpectralGPT struggles with SAR data due to inherited RGB-focused pretraining frameworks. |
High |
SpectralGPT |
Enhanced utility in flood mapping, mineral exploration, and military surveillance. |
| Multimodal Output Limitation |
Text-only outputs limit utility in dense predictions (e.g., segmentation, 3D). |
VLMs cannot generate segmentation masks or 3D models, restricting flood mapping applications. |
Moderate-High |
CPSeg |
Enable real-time hazard mapping and infrastructure planning. |
| Temporal Data Handling |
Static image analysis neglects temporal dynamics critical for trends. |
Changen2’s synthetic temporal data lacks real-world complexity for deforestation monitoring. |
Moderate |
Changen2 |
Improved climate change prediction and disaster response. |
| Benchmarking |
No unified standards for cross-task evaluation (e.g., VQA, RSICC). |
RSVQA-HR and LEVIR-CC use incompatible metrics, complicating cross-model comparisons. |
Moderate |
RSVQA, LEVIR-CC |
Accelerate model innovation via standardized benchmarks. |
| Ethical AI & Bias |
Auto-annotated datasets inherit biases from foundation models. |
RS5M’s captions generated by BLIP2/GPT-4 reflect urban/rural bias, skewing agricultural monitoring insights. |
Moderate |
RS5M |
Fairer AI applications in policy-making and environmental justice. |
| Sustainability |
High computational costs hinder accessibility and scalability. |
SkyEyeGPT’s training consumes excessive GPU resources, limiting adoption in resource-constrained regions. |
High |
SkyEyeGPT |
Democratize access to VLM tools for global environmental monitoring. |