时间:2025-08-02 09:25:28 来源:网络整理编辑:知識
Apple is dabbling in AI image-editing with an open-source multimodal AI model.Earlier this week, res
Apple is dabbling in AI image-editing with an open-source multimodal AI model.
Earlier this week, researchers from Apple and the University of California, Santa Barbara released MLLM-Guided Image Editing, or "MGIE;" a multimodal AI model that can edit images like Photoshop, based on simple text commands.
On the AI development front, Apple has been characteristically cautious about its plans. It was also one of the few companies that didn't announce any big AI plans in the wake of last year's ChatGPT hype. However, Apple reportedly has an in-house version of a ChatGPT-esque chatbot dubbed "Apple GPT" and Tim Cook said Apple will be making some major AI announcements later this year.
SEE ALSO:Tim Cook says big Apple AI announcement is coming later this yearWhether this announcement includes an AI image editing tool remains to be seen, but based on this model, Apple is definitely doing some research and development.
While there are already AI image editing tools out there, "human instructions are sometimes too brief for current methods to capture and follow," said the research paper. This often leads to lackluster or failed results. MGIE is a different approach that uses MLLMs, or multimodal large language models, to understand the text prompts or "expressive instruction," as well as image training data. Effectively, learning from MLLMs helps MGIE understand natural language commands without the need for heavy description.
In examples from the research, MGIE can take an input image of a pepperoni pizza and using the prompt, "make this more healthy" infer that "this" is referring to the pepperoni pizza and "more healthy" can be interpreted as adding vegetables. Thus, the output image is a pepperoni pizza with some green vegetables scattered on top.
In another example comparing MGIE to other models, the input image is a forested shoreline and a tranquil body of water. With the prompt "add lightning and make the water reflect the lightning," other models omit the lightning reflection, but MGIE successfully captures it.
MGIE is available as an open-source model on GitHub and as a demo version hosted on Hugging Face.
TopicsAppleArtificial Intelligence
You can now play 'Solitaire' and 'Tic2025-08-02 09:20
一波N折 !紅牌+點球+逆轉 尤文7分鍾3球打垮羅馬2025-08-02 08:50
重返意甲 !官方 :26歲皮亞特克租借加盟佛羅倫薩2025-08-02 08:42
泰山VS海港首發:莫伊塞斯費萊尼領銜 奧斯卡出戰2025-08-02 08:37
This German startup wants to be your bank (without being a bank)2025-08-02 07:46
曼聯前瞻 :傑拉德率隊戰死敵 C羅首發位置或不保2025-08-02 07:27
專家:鄭龍未進名單是怎麽進場地的? 裁判不應向他出示紅牌2025-08-02 07:04
海港兩度進足協杯決賽全都失利 國門失誤葬送冠軍2025-08-02 06:59
Snapchat is about to explode in popularity, report says2025-08-02 06:49
曝大連人球員教練賽後毆打裁判 媒體人:情節惡劣(gif)2025-08-02 06:48
This app is giving streaming TV news a second try2025-08-02 09:18
哈維:球隊犯了很多低級錯誤 應該進行自我批評2025-08-02 09:04
韓國女足公布女足亞洲杯名單 池笑然李玟娥領銜2025-08-02 08:56
曝大連人球員教練賽後毆打裁判 媒體人:情節惡劣(gif)2025-08-02 08:54
The U.S. will no longer have the final say on internet domain names2025-08-02 08:10
多倫多FC官宣簽下因西涅 球員在賽季結束後正式加盟2025-08-02 07:37
名記 :博格巴若合同到期離開曼聯 大概率加盟巴黎2025-08-02 07:29
曝12強賽後國足將進入新循環 戴偉浚符合未來建隊標準2025-08-02 07:26
Dog elected for third term as mayor of Minnesota town2025-08-02 06:46
國足集訓大框架不動個別位置調整 新麵孔為中場變陣打基礎2025-08-02 06:46