Wednesday, March 11, 2026

Show HN: Satellite imagery object detection using text prompts https://ift.tt/QOCzo1S

Show HN: Satellite imagery object detection using text prompts I built a browser-based tool for detecting objects in satellite imagery using vision-language models (VLMs). You draw a polygon on the map and enter a text prompt such as "swimming pools", "oil tanks", or "buses". The system scans the selected area tile-by-tile and returns detections projected back onto the map as GeoJSON. Pipeline: select area and zoom level, split the region into mercantile tiles, run each tile with the prompt through a VLM, convert predicted bounding boxes to geographic coordinates (WGS84), and render the results back on the map. It works reasonably well for distinct structures in a zero-shot setting. occluded objects are still better handled by specialized detectors like YOLO models. There is a public demo and no login required. I am mainly interested in feedback on detection quality, performance tradeoffs between VLMs and specialized detectors, and potential real-world use cases. https://ift.tt/PgWLdfv March 9, 2026 at 01:22PM

No comments:

Post a Comment

Show HN: Don't share code. Share the prompt https://ift.tt/IyCw38h

Show HN: Don't share code. Share the prompt Hey HN, I'm Mario. I recently talked to a colleague about AI, agents and how software de...