I couldn't find information about this in the documentation. Could someone clarify if this API is available and how to access it?
If you are working with computer vision, you might consider using other APIs. For example, for image generation, Image Playground should help.
The Apple Foundation Models introduced in WWDC25 accepts text as an input and generates text as an output. If your input is not text, depending on your concrete use case, you might consider converting your input to a text description, which the models should be able to handle.
Best,
——
Ziqiao Chen
Worldwide Developer Relations.