Viral AI company DeepSeek releases new image model family

2 days ago 23
ARTICLE AD

DeepSeek, the viral AI company, has released a new set of multimodal AI models that it claims can outperform OpenAI’s DALL-E 3.

The models, which are available for download from the AI dev platform Hugging Face, are a part of a new model family that DeepSeek is calling Janus Pro. They range in size from 1 billion parameters to 7 billion parameters. Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.

DeepSeek imageImage outputs from DeepSeek’s Janus models. Image Credits:DeepSeek

Janus Pro, which DeepSeek describes as a “novel autoregressive framework,” can both analyze and create new images. According to the company, on two AI evaluation benchmarks, GenEval and DPG-Bench, the largest Janus Pro model, Janus Pro 7B, beats DALL-E 3 as well as models such as PixArt-alpha, Emu3-Gen, and Stability AI‘s Stable Diffusion XL.

Some of those models are on the older side, granted. But Janus Pro 7B’s performance is impressive, considering the model’s relatively small size.

“Janus Pro surpasses previous unified model and matches or exceeds the performance of task-specific models,” DeepSeek writes in a post on Hugging Face. “The simplicity, high flexibility, and effectiveness of Janus Pro make it a strong candidate for next-generation unified multimodal models.”

DeepSeek imageDeepSeek’s new Janus models compared with the competition.Image Credits:DeepSeek

DeepSeek, a Chinese AI lab funded largely by the quantitative trading firm High-Flyer Capital Management, broke into the mainstream consciousness this week after its chatbot app rose to the top of the Apple App Store charts. DeepSeek’s language models, which were trained using compute-efficient techniques, have led many Wall Street analystsand technologists — to question whether the U.S. can maintain its lead in the AI race, and whether the demand for AI chips will sustain.

Kyle Wiggers is a senior reporter at TechCrunch with a special interest in artificial intelligence. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Brooklyn with his partner, a piano educator, and dabbles in piano himself. occasionally — if mostly unsuccessfully.

Subscribe for the industry’s biggest tech news

Related

Read Entire Article