📝 文章摘要
以下是文章摘要: Ideogram 4 作为一款拥有 93 亿参数的开源 AI 画图模型,正式发布并支持本地部署。该模型在文字渲染和海报设计领域表现卓越,被公认为能比肩 Midjourney 等闭源模型的开源天花板。基准测试显示,其在布局控制上甚至超越了所有闭源模型,专业设计师盲测胜率高达 47.9%。文章提供了详细的本地部署教程,包括模型列表、ComfyUI 配置及工作流使用。通过结构化 JSON Prompt,用户可精确控制版式布局与颜色搭配,实现设计稿级的创作,彻底摆脱云端限制。
<a href='https://520zyw.com/tag/ideogram-4'>Ideogram 4</a>, <a href='https://520zyw.com/tag/%e5%bc%80%e6%ba%90'>开源</a>, <a href='https://520zyw.com/tag/93%e4%ba%bf%e5%8f%82%e6%95%b0'>93亿参数</a>, <a href='https://520zyw.com/tag/%e6%96%87%e5%ad%97%e6%b8%b2%e6%9f%93'>文字渲染</a>, <a href='https://520zyw.com/tag/%e6%9c%ac%e5%9c%b0%e9%83%a8%e7%bd%b2'>本地部署</a>, <a href='https://520zyw.com/tag/comfyui'>ComfyUI</a>, <a href='https://520zyw.com/tag/lora%e5%be%ae%e8%b0%83'>LoRA微调</a>, <a href='https://520zyw.com/tag/%e6%b5%b7%e6%8a%a5%e8%ae%be%e8%ae%a1'>海报设计</a>, <a href='https://520zyw.com/tag/%e5%95%86%e4%b8%9a%e8%a7%86%e8%a7%89'>商业视觉</a>, <a href='https://520zyw.com/tag/%e5%9f%ba%e5%87%86%e6%b5%8b%e8%af%95'>基准测试</a>, <a href='https://520zyw.com/tag/%e5%b8%83%e5%b1%80%e6%8e%a7%e5%88%b6'>布局控制</a>, <a href='https://520zyw.com/tag/%e8%ae%be%e8%ae%a1%e5%b8%88%e7%9b%b2%e6%b5%8b'>设计师盲测</a>

Ideogram 4 开源了!93 亿参数,文字渲染天花板级,真能打平 Midjourney?
AI 画图圈最近炸了个大新闻 ——Ideogram 官方直接把 Ideogram 4 的权重放出来了,93 亿参数版本,支持本地部署、LoRA 微调,还能接 ComfyUI 工作流。
关注 AI 画图的应该都知道 Ideogram,它家的文字渲染能力一直是公认的强。这次开源意味着什么?你终于不用受云端的限制了,自己电脑上就能跑出接近商业级的效果。关键是 —— 这可能是目前唯一一个真的能跟 GPT-Image、Midjourney 掰掰手腕的开源模型。

20260626072903 103447 scaled

跟之前的 SD、FLUX 这些比,Ideogram 4 不光是画质上去了,更重要的是它在文字生成、海报设计、商业视觉这块做了深度优化。还有个挺有意思的东西:官方搞了个结构化 JSON Prompt,你可以精确描述图片内容、版式布局、颜色搭配、光照风格,甚至每个元素的位置 ——AI 不再是 "随便生成一张图",而是真的像按设计稿来做创作。后面我会详细聊 Ideogram 4 的主要特性、硬件要求、怎么本地部署、ComfyUI 怎么用,还有实际生成效果到底怎么样。

20260626072830 264823

先看跑分吧。Design Arena 那个第三方图像 Elo 排行榜上,Ideogram 4 在所有已知开源 AI 画图模型里,是绝对领先的,甩开其他开源模型一大截。
再看基准测试。标准开源基准测了四项:布局控制(7Bench)、空间推理和对象保真度(SpatialGenEval)、文本渲染(X-Omni OCR)、提示对齐(Prism)。Ideogram 4 在这几项上都大幅缩小了跟顶级闭源模型的差距。尤其是布局控制(7Bench),它比所有闭源模型都强。

20260626073120 390207 scaled20260626073300 486476

还有个更有说服力的 ——ContraLabs 找了旗下十位顶尖专业设计师做了盲测,比字体设计。结果 Ideogram 4 胜率一骑绝尘,四个模型里被选为最佳的概率高达 47.9%,比第二名 Gemini 3.1 Flash Image Preview(30.0%)高了一大截,FLUX.2 [max] 才 15.5%,Grok Imagine 1.0 只有 15.0%。

20260626073356 903510

说了这么多,它到底有没有这么神?普通电脑能不能跑?别急,接下来我就本地部署一下,完整测一遍给大家看。

本地部署教程

1、 Ideogram 4 开源模型列表:(下载链接详见下载栏)

【ideogram4_fp8_scaled】【ideogram4_unconditional_fp8_scaled】【qwen3vl_8b_fp8_scaled】【gemma4_e4b_it_fp8_scaled】【flux2-vae】

下载好模型以后,将模型文件放到如下对应的模型存储位置

📂 ComfyUI/
├── 📂 models/
│ ├── 📂 diffusion_models/
│ │ ├── ideogram4_fp8_scaled.safetensors
│ │ └── ideogram4_unconditional_fp8_scaled.safetensors
│ ├── 📂 text_encoders/
│ │ ├── qwen3vl_8b_fp8_scaled.safetensors
│ │ └── gemma4_e4b_it_fp8_scaled.safetensors
│ └── 📂 vae/
│ └── flux2-vae.safetensors

XN7E2{6O@FSU0KI3NL}2P1L

2、安装新版 ComfyUI 客户端

目前只有最新版的客户端才支持载入对应的生图工作流,所以如果你之前安装过旧版的 ComfyUI,建议升级或覆盖安装下。
20260626074504 910506 scaled

3、下载工作流

下载工作流以后,直接将其拖入到 ComfyUI 即可使用!
20260626075017 601150 scaled

魔法提示:

A high-resolution portrait photograph of a stunning mixed Asian woman with distinctive K-pop inspired styling and natural beauty. She has flowing long black hair that cascades over her shoulders, large expressive dark eyes with subtle makeup, and naturally full lips that create a captivating smile. Her slim, athletic figure is elegantly posed in a confident yet playful stance, wearing delicate sheer lingerie in soft neutral tones. The scene is illuminated by warm, intimate bedroom lighting that creates a golden glow across her detailed, flawless skin, with soft shadows adding depth and dimension to the portrait.

20260626075550 540896
还有废墟中的汽车
20260626075651 225476


Pose: Leaning lightly against tree, looking upward
Location: Dense forest with sun rays
Makeup: Fresh glow + soft pink lipstick
Hairstyle: Braided or half-tied
Clothes: Indo-western outfit with deep neckline
Text Tattoo: Minimal on collarbone
Expression: Peaceful, dreamy

20260626095733 177232
功夫熊猫


{
  "high_level_description": "A stylized DreamWorks-style 3D character portrait of a chubby panda kung fu master in a wide horse-stance pose on a misty mountain training ground at golden hour, rendered with exaggerated cartoon proportions and cinematic warm key light against cool blue rim light.",
  "compositional_deconstruction": {
    "background": "Misty mountain peaks layered into the deep background with soft atmospheric haze, fading from dusty rose sky at the top through pale lavender to muted teal-blue silhouettes of distant ridges. Warm amber golden-hour glow spills across the upper-left of the scene while cool blue rim light separates the foreground silhouette from the misty backdrop. Packed-earth ground in warm tan-brown tones extends across the lower portion, scattered with fallen leaves in ochre and rust. Shallow depth of field falls off gently into the mountains.",
    "elements": [
      {
        "type": "obj",
        "desc": "Weathered wooden training post planted upright in the dirt just behind the panda's right shoulder, slightly out of focus. Rough grey-brown bark texture, tapered top, base disappearing into the packed earth."
      },
      {
        "type": "obj",
        "desc": "Chubby giant panda kung fu master, the unambiguous hero subject filling roughly 75% of the frame height from head near upper-third to feet near lower edge, centered horizontally, facing the camera in front view. Stylized CGI in DreamWorks register — large round head, chunky limbs, simplified plastic-cartoon fur clumps in rich black and creamy white. Oversized expressive dark eyes with bright specular catchlights looking directly at the camera, friendly closed-mouth smile, small rounded black ears. Standing in a wide horse-stance martial arts pose, weight settled evenly on both feet, both paws raised in open-palm guard position at chest height. Wearing loose-fitting brown kung fu shorts gathered at the waist with a knotted cloth belt in faded ochre, fabric folds catching the warm amber key light. Small beige cloth wrist wraps tied around each paw above the wrist. Strong clean specular highlights on the black nose and eyes, no micro-skin texture."
      },
      {
        "type": "obj",
        "desc": "Scattered bamboo stalks in teal-green and fallen leaves in ochre and rust tones strewn across the packed-earth ground in the foreground, partially framing the panda's feet. A few broken bamboo segments lie diagonally, leaves curled at the edges."
      }
    ]
  }
}

20260626095831 078567

本文最后更新于2026年6月29日,若涉及的内容可能已经失效,直接留言反馈补链即可,我们会处理,谢谢
声明:本站所有内容均由互联网收集整理、网友上传,并且以计算机技术研究交流为目的,仅供大家参考、学习,请勿用于任何商业目的与商业用途,如需商用请支持正版!如亲下载后改变其用途与使用方式,与本站无任何关系,本站已经进行告知义务!我们只做安全认证测试如果资源侵犯了您的版权利益,请联系站长邮箱:17606723350@163.com