This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
What is a multimodal model?
A model that can only process images but not text.
A model that can understand and work with more than one type of data, such as text and images.
A model that generates video content only.
How can developers programmatically generate images using Foundry image generation models?
By sending text prompts through the OpenAI Responses API using a deployed image model
By uploading images through the Foundry Playground UI.
By calling the GPT-4.1 model endpoint.
When you generate images programmatically using the OpenAI Python SDK with Microsoft Foundry, which value should you pass as the model parameter in the request?
The original base model name (for example, gpt-image-1.5).
The deployment name you gave the image generation model in your Foundry resource.
The name you gave your Foundry resource.
Why is video generation with Sora models in Microsoft Foundry handled as an asynchronous job?
Because video generation requires user interaction during rendering.
Because the REST API doesn't support synchronous requests.
Because video generation is resource‑intensive and takes time to complete.
You must answer all questions before checking your work.
Was this page helpful?
Need help with this topic?
Want to try using Ask Learn to clarify or guide you through this topic?