Kimi Unveils Multimodal Image Understanding Model API
TMTPOST -- Kimi, a platform developed by Moonshot AI, has officially launched its new multimodal image understanding model, moonshot-v1-vision-preview. This model enhances the capabilities of the moonshot-v1 series by adding advanced multimodal features, including image recognition, text recognition, and comprehension.
The Vision model is designed with a pay-as-you-go pricing model, with the token consumption based on the number of images processed. Each image is calculated with 1024 tokens as part of the input request's token usage. Depending on the specific model, the price ranges from 12 to 60 RMB per 1 million tokens.
More News