π§Ύ
1 entries
Title: moondream
Entries are written by users and may reflect personal opinions or experiences.
Entries
Moondream is a lightweight, open-source vision language model available on GitHub. It boasts two versions: a larger 2 billion parameter model and a smaller 500 million parameter model optimized for resource-constrained devices. The model excels at image understanding tasks such as captioning and question answering. Users can interact with Moondream via Python or Node.js client libraries, or through a Hugging Face Transformers integration offering GPU support. The project's GitHub repository includes code, documentation, and example applications.