New AI Tools & Updates: Imagen3, SyncTweedies, DressRecon and More

Imagen3 Released

Google has officially launched Imagen3, their image-generating AI. It’s now available to all Gemini users around the world on Gemini Advanced. Imagen3 produces very high-quality images and is more accurate than other image-generating AIs.

Veo for Video Generation

In addition, Google has introduced “Veo,” an AI that generates videos.

Image generation with Imagen 3 is now available to all Gemini users around the world.

Imagen 3 is our highest quality image generation model yet and brings an even higher degree of photorealism, better instruction following, and fewer distracting artifacts than ever before. pic.twitter.com/E8CrcyFcz5
— Google Gemini App (@GeminiApp) October 9, 2024

SyncTweedies Framework

SyncTweedies is a new system that helps create different types of images using pre-trained models. It works by syncing several image diffusion processes to create fuzzy images, panoramic views, textures on 3D shapes, and more.

https://synctweedies.github.io/

https://synctweedies.github.io/static//synctweedies.pdf Visit synctweedies.github.io
For more info, check out SyncTweedies here:

https://github.com/KAIST-Visual-AI-Group/SyncTweedies

DressRecon Technology

DressRecon is a tool that builds realistic 3D human body models from regular videos. It focuses on how people interact with loose clothing and objects. By combining general body shape data with specific video details, it achieves high-quality 3D reconstructions.

https://jefftan969.github.io/dressrecon/

Learn more about DressRecon here:

https://github.com/jefftan969/dressrecon

Pyramidal Flow Matching

Pyramidal Flow Matching is a fast video generation model that uses open-source datasets for training. It helps create videos more efficiently.

https://pyramid-flow.github.io/

You can find the Pyramidal Flow Matching code here:

https://github.com/jy0205/Pyramid-Flow

https://huggingface.co/rain1011/pyramid-flow-sd3

Gmail Update

Gmail has added a new feature! You can now reply to and summarize emails right on the same screen, thanks to Gemini’s chat feature.

これは超画期的。Gmailにてそのままの画面でメールの返信、要約が可能に。これは全員使うべき機能。

メール右側にGeminiのチャット画面が搭載。メールを開いたままで「返信を作って」と入力すると文面を読み込んで返信を作成、そのまま下書きに挿入も。… pic.twitter.com/kccLUZKHpO
— チャエン | 重要AIニュースを毎日発信⚡️ (@masahirochaen) October 9, 2024

LLavaImageTagger

LLavaImageTagger is a helpful tool that automatically organizes your images. If you have many images without names, it analyzes them, generates keywords, and adds them to the metadata.

Check it out here:

https://github.com/jabberjabberjabber/LLavaImageTagger

Introducing ARIA

ARIA is a new AI model that can handle different types of information, like images, text, and code. It excels in understanding videos and documents.

Find out more about ARIA here:

https://huggingface.co/rhymes-ai/Aria

FluxBooru Updates

FluxBooru v0.1 focuses on generating “non-aesthetic” content. It’s been improved to create more interesting images.

See it here: FluxBooru Hugging Face.

https://huggingface.co/spaces/bghira/FluxBooru-CFG3.5

https://huggingface.co/terminusresearch/flux-booru-CFG3.5

Flux Fusion Update

The new version of Flux Fusion Merge has been released, making it faster to generate images.

https://civitai.com/models/630820

Llama 3.1 in Medicine

Llama 3.1 is being trained with synthetic data from doctors. This makes it better at tasks in the medical field, like answering questions and summarizing medical research.

Training Llama 3.1 on clinician-created synthetic data, using prompt engineering techniques and RAG; Neuromnia developed Nia: a human-centric AI co-pilot to support work on some of the most pressing challenges for autism care ➡️ https://t.co/mXqooP0dsV pic.twitter.com/AP1gO6lCt3
— AI at Meta (@AIatMeta) October 10, 2024

Kaiber Update

Kaiber, the AI video generation service used in a Linkin Park music video, is getting an update. Keep an eye out for the preview!

https://kaiber.ai/comingsoon

Sign up for updates here: Kaiber.

https://kaiber.ai/

Bolt Update

The AI tool Bolt, which helps create web applications in Japanese, has been updated. You can now purchase tokens to use the service.

Token Reloading has landed on prod! 🫡#BoltRelease notes:

– $10 per 10 million tokens
– Buy in increments of 10
– Available only on personal plans right now
– Team plan token reloading landing 🔜, until then we have increased team user limits to 30m tokens/day
– Higher… pic.twitter.com/npVmESk3Bb
— StackBlitz (@stackblitz) October 9, 2024

https://bolt.new/

Blinkshot Consistency Mode

Blinkshot, which generates images in real-time, now has a consistency mode. This keeps the background and subject the same while generating images.

Check it out: Blinkshot.

Shipped "consistency mode" to Blinkshot!

This will generate consistent images, keeping the background or main subject mostly the same pic.twitter.com/3G0y7fXXNr
— Hassan (@nutlope) October 8, 2024

https://www.blinkshot.io/

Fotographer AI Demo Video

Here’s a demo video of “Fotographer AI,” an automatic product image generation service that recreates scenes from Tokyo.

Learn more about Fotographer.ai: Fotographer.

FAI: video creation

Today, I created an amazing video using FAI! 🌸✨
Straight from Tokyo, Japan, it captures the beauty and realism that AI can bring to life. The sense of beauty is truly breathtaking! #AI #Tokyo #Creativity pic.twitter.com/nt9DF0yCTF
— Fotographer AI (@FotographerAI) October 10, 2024

https://fotographer.ai/en/

LM Studio 0.3.4 Link

You can find the link to “LM Studio 0.3.4,” a tool that runs large language models quickly on Apple Silicon Macs.

Check it out here: LM Studio.

https://lmstudio.ai/

Generative AI Exhibition

The “Generative AI Anything Exhibition” is looking for exhibitors. You can apply in different categories like Audio, Image, Language, and more.

Find more information here: Exhibition Form.

🎉【第二回生成AIなんでも展示会】展示者を募集中！

📢 募集枠：
音声：2名
画像：2名
言語：2名
その他：1〜2枠
✨ 前回と違う展示物を優先的に展示したいです。AIを利用したゲームやロボット、XRも大歓迎！

⚠️ 禁止事項：
イベント内での営利活動や求人活動
会社名での出展…
— saldra(サルドラ) (@sald_ra) October 8, 2024

https://docs.google.com/forms/d/e/1FAIpQLScdUKqCq1sDbelwYmUEp0VDwamksIFEhjPE-7qxCjAGr-SSNQ/viewform

Meta’s AI Expansion

Meta has announced that its conversational AI is now available in 21 new markets.

That’s all for today! It seems like Google is really stepping up! Keep in mind, you need to sign up for Gemini Advanced, which costs 2,900 yen a month, but you can try it for free for the first month. It might be worth checking out!