时间:2026-01-09 02:29:14 来源:网络整理编辑:綜合
Apple is dabbling in AI image-editing with an open-source multimodal AI model.Earlier this week, res
Apple is dabbling in AI image-editing with an open-source multimodal AI model.
Earlier this week, researchers from Apple and the University of California, Santa Barbara released MLLM-Guided Image Editing, or "MGIE;" a multimodal AI model that can edit images like Photoshop, based on simple text commands.
On the AI development front, Apple has been characteristically cautious about its plans. It was also one of the few companies that didn't announce any big AI plans in the wake of last year's ChatGPT hype. However, Apple reportedly has an in-house version of a ChatGPT-esque chatbot dubbed "Apple GPT" and Tim Cook said Apple will be making some major AI announcements later this year.
SEE ALSO:Tim Cook says big Apple AI announcement is coming later this yearWhether this announcement includes an AI image editing tool remains to be seen, but based on this model, Apple is definitely doing some research and development.
While there are already AI image editing tools out there, "human instructions are sometimes too brief for current methods to capture and follow," said the research paper. This often leads to lackluster or failed results. MGIE is a different approach that uses MLLMs, or multimodal large language models, to understand the text prompts or "expressive instruction," as well as image training data. Effectively, learning from MLLMs helps MGIE understand natural language commands without the need for heavy description.
In examples from the research, MGIE can take an input image of a pepperoni pizza and using the prompt, "make this more healthy" infer that "this" is referring to the pepperoni pizza and "more healthy" can be interpreted as adding vegetables. Thus, the output image is a pepperoni pizza with some green vegetables scattered on top.
In another example comparing MGIE to other models, the input image is a forested shoreline and a tranquil body of water. With the prompt "add lightning and make the water reflect the lightning," other models omit the lightning reflection, but MGIE successfully captures it.
MGIE is available as an open-source model on GitHub and as a demo version hosted on Hugging Face.
TopicsAppleArtificial Intelligence
Carlos Beltran made a very interesting hair choice2026-01-09 02:18
曼聯球迷民調 :55%不希望C羅留隊 40%挺B費當隊長2026-01-09 01:57
國足集訓名單 :廣州隊10人巴西歸化4人在列 武磊缺席2026-01-09 01:34
電視劇都不敢這麽演!新西蘭女足隊員上演烏龍戴帽2026-01-09 01:23
Xiaomi accused of copying again, this time by Jawbone2026-01-09 00:55
瓜帥:利物浦是討厭鬼 渣叔 :要盡可能讓他更討厭2026-01-09 00:53
斯坦福橋16場不敗!切爾西占先機 一數據冠絕英倫2026-01-09 00:45
張康陽拒9億歐出售國米 拆家 ?7000萬歐標價勞塔羅2026-01-09 00:41
More than half of women in advertising have faced sexual harassment, report says2026-01-09 00:17
曝瑞超金靴阿德本羅接近加盟北京國安 年薪280萬歐2026-01-09 00:07
This weird squid looks like it has googly eyes, guys2026-01-09 02:23
粵媒:連續15輪西甲打替補 武磊的留守選擇令人難解2026-01-09 02:14
媒體人:鄭智4月底前無法返廣州隊 誰帶隊備戰?還能留隊嗎?2026-01-09 02:03
水慶霞再次澄清“沒房住” 坦言任何球隊都不能永遠勝利2026-01-09 01:39
Pole vaulter claims his penis is not to blame2026-01-09 01:34
曼聯球迷民調 :55%不希望C羅留隊 40%挺B費當隊長2026-01-09 01:14
巴薩名宿:梅西該回諾坎普退役 巴黎隻是群雇傭兵2026-01-09 00:26
穆帥吃紅牌活該?在主裁判雷區跳舞 尤文無辜躺槍2026-01-09 00:22
Carlos Beltran made a very interesting hair choice2026-01-09 00:01
海港巴西雙槍洛佩斯歸隊心切 保利尼奧遭飛來橫禍2026-01-08 23:47