Generative Model Encoder/Decoder

A Latent Multi-Scale Residual Transformer Approach for Cross-Modal Medical Image Synthesis

Abstract: Cross-modal generation has emerged as a crucial method for addressing the challenge of filling in missing modalities in medical imaging. Existing approaches predominantly utilize ...

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

GLM-Image explained: Huawei-powered AI that seriously challenges Nvidia, here’s how

For the past few years, a single axiom has ruled the generative AI industry: if you want to build a state-of-the-art model, ...

GitHub

GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models

1 Centre for Digital Music, Queen Mary University of London, U.K. 2 Music & Audio Machine Learning Lab, Universal Music Group, London, U.K. Multimodal contrastive models have achieved strong ...

IEEE

High-Capacity Image Steganography Via Latent Diffusion Models

Abstract: Generative steganography has recently attracted considerable attention due to its superior security properties. However, most existing approaches suffer from limited hiding capacity. To ...

Hosted on MSN

Meta just achieved mind-reading using AI

This video covers two breakthroughs that push “mind reading” from fiction into early reality: UT Austin decoding language from brain activity, and Meta decoding visual perception from brain waves. At ...

InfoQ

Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results