Forked this to test out curiosity with a GUI and packaging it into an application for UNIX systems. The GUI works fine (see exmaple below), but it is not fully packaged for UNIX systems.
No screenshots. No multi-modal LLMs or special permissions needed.