Kimi Unveils Multimodal Image Understanding Model API| TMTPOST

中文

HOME

BRIEF NEWS

OPINION

FEATURES

LIVE

EVENTS

Jan. 15, 2025

Kimi Unveils Multimodal Image Understanding Model API

TMTPOST -- Kimi, a platform developed by Moonshot AI, has officially launched its new multimodal image understanding model, moonshot-v1-vision-preview. This model enhances the capabilities of the moonshot-v1 series by adding advanced multimodal features, including image recognition, text recognition, and comprehension. The Vision model is designed with a pay-as-you-go pricing model, with the token consumption based on the number of images processed. Each image is calculated with 1024 tokens as part of the input request's token usage. Depending on the specific model, the price ranges from 12 to 60 RMB per 1 million tokens.

Subscribe To Our News