Radiance Field methods have recently revolutionized novel-view synthesis of scenes captured with multiple photos or videos. However, achieving high visual quality still requires neural networks that are costly to train and render, while recent faster methods i...

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models

2023 / ACM Transactions on Graphics / 336 citations

No code

Hila Chefer, Yuval Alaluf, Yael Vinker, Lior Wolf, Daniel Cohen‐Or

Recent text-to-image generative models have demonstrated an unparalleled ability to generate diverse and creative imagery guided by a target text prompt. While revolutionary, current state-of-the-art diffusion models may still fail in generating images that fu...

Blended Latent Diffusion

2023 / ACM Transactions on Graphics / 276 citations

No code

Omri Avrahami, Ohad Fried, Dani Lischinski

The tremendous progress in neural image generation, coupled with the emergence of seemingly omnipotent vision-language models has finally enabled text-based interfaces for creating and editing images. Handling generic images requires a diverse underlying gener...

Low-Light Image Enhancement with Wavelet-Based Diffusion Models

2023 / ACM Transactions on Graphics / 194 citations

No code

Hai Jiang, Ao Luo, Haoqiang Fan, Songchen Han, Shuaicheng Liu

Diffusion models have achieved promising results in image restoration tasks, yet suffer from time-consuming, excessive computational resource consumption, and unstable restoration. To address these issues, we propose a robust and efficient Diffusion-based Low-...

MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes

2023 / ACM Transactions on Graphics / 173 citations

No code

Christian Reiser, Rick Szeliski, Dor Verbin, Pratul P. Srinivasan, Ben Mildenhall, and 3 more

Neural radiance fields enable state-of-the-art photorealistic view synthesis. However, existing radiance field representations are either too compute-intensive for real-time rendering or require too much memory to scale to large scenes. We present a Memory-Eff...

Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

2023 / ACM Transactions on Graphics / 156 citations

No code

Simon Alexanderson, Rajmund Nagy, Jonas Beskow, Gustav Eje Henter

Diffusion models have experienced a surge of interest as highly expressive yet efficiently trainable probabilistic models. We show that these models are an excellent fit for synthesising human motion that co-occurs with audio, e.g., dancing and co-speech gesti...

Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

2023 / ACM Transactions on Graphics / 139 citations

No code

Rinon Gal, Moab Arar, Yuval Atzmon, Amit H. Bermano, Gal Chechik, and 1 more

Text-to-image personalization aims to teach a pre-trained diffusion model to reason about novel, user provided concepts, embedding them into new scenes guided by natural language prompts. However, current personalization approaches struggle with lengthy traini...

GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents

2023 / ACM Transactions on Graphics / 126 citations

No code

Tenglong Ao, Zeyi Zhang, Libin Liu

The automatic generation of stylized co-speech gestures has recently received increasing attention. Previous systems typically allow style control via predefined text labels or example motion clips, which are often not flexible enough to convey user intent acc...

HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion

2023 / ACM Transactions on Graphics / 121 citations

No code

Mustafa Işık, Martin Rünz, Markos Georgopoulos, Taras Khakhulin, J. Starck, and 2 more

Representing human performance at high-fidelity is an essential building block in diverse applications, such as film production, computer games or videoconferencing. To close the gap to production-level quality, we introduce HumanRF 1 , a 4D dynamic neural sce...

3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models

2023 / ACM Transactions on Graphics / 114 citations

No code

Biao Zhang, Jiapeng Tang, Matthias Nießner, Peter Wonka

We introduce 3DShape2VecSet, a novel shape representation for neural fields designed for generative diffusion models. Our shape representation can encode 3D shapes given as surface models or point clouds, and represents them as neural fields. The concept of ne...

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation

2023 / ACM Transactions on Graphics / 103 citations

No code

X. Zheng, Hao Pan, Peng‐Shuai Wang, Xin Tong, Yang Liu, and 1 more

Although the recent rapid evolution of 3D generative neural networks greatly improves 3D shape generation, it is still not convenient for ordinary users to create 3D shapes and control the local geometry of generated shapes. To address these challenges, we pro...

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

2023 / ACM Transactions on Graphics / 100 citations

No code

Yuan Liu, Peng Wang, Cheng Lin, Xiaoxiao Long, J. Wang, and 3 more

We present a neural rendering-based method called NeRO for reconstructing the geometry and the BRDF of reflective objects from multiview images captured in an unknown environment. Multiview reconstruction of reflective objects is extremely challenging because ...

ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models

2023 / ACM Transactions on Graphics / 87 citations

No code

Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, and 4 more

Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffus...

OctFormer: Octree-based Transformers for 3D Point Clouds

2023 / ACM Transactions on Graphics / 81 citations

No code

Peng‐Shuai Wang

We propose octree-based transformers, named OctFormer, for 3D point cloud learning. OctFormer can not only serve as a general and effective backbone for 3D point cloud segmentation and object detection but also have linear complexity and is scalable for large-...

NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads

2023 / ACM Transactions on Graphics / 68 citations

No code

Tobias Kirschstein, Shenhan Qian, Simon Giebenhain, Tim Walter, Matthias Nießner

We focus on reconstructing high-fidelity radiance fields of human heads, capturing their animations over time, and synthesizing re-renderings from novel viewpoints at arbitrary time steps. To this end, we propose a new multi-view capture setup composed of 16 c...

AvatarReX: Real-time Expressive Full-body Avatars

2023 / ACM Transactions on Graphics / 67 citations

No code

Zerong Zheng, Xiaochen Zhao, Hongwen Zhang, Boning Liu, Yebin Liu

We present AvatarReX, a new method for learning NeRF-based full-body avatars from video data. The learnt avatar not only provides expressive control of the body, hands and the face together, but also supports real-time animation and rendering. To this end, we ...

Flexible Isosurface Extraction for Gradient-Based Mesh Optimization

2023 / ACM Transactions on Graphics / 65 citations

No code

Tianchang Shen, Jacob Munkberg, Jon Hasselgren, Kangxue Yin, Zian Wang, and 5 more

This work considers gradient-based mesh optimization, where we iteratively optimize for a 3D surface mesh by representing it as the isosurface of a scalar field, an increasingly common paradigm in applications including photogrammetry, generative modeling, and...

Real-Time Radiance Fields for Single-Image Portrait View Synthesis

2023 / ACM Transactions on Graphics / 64 citations

No code

Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, and 5 more

We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural ra...

A Neural Space-Time Representation for Text-to-Image Personalization

2023 / ACM Transactions on Graphics / 63 citations

No code

Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen‐Or

A key aspect of text-to-image personalization methods is the manner in which the target concept is represented within the generative process. This choice greatly affects the visual fidelity, downstream editability, and disk space needed to store the learned co...

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance

2023 / ACM Transactions on Graphics / 59 citations

No code

Longwen Zhang, Qiwei Qiu, Hongyang Lin, Qixuan Zhang, Shi Cheng, and 5 more

Emerging Metaverse applications demand accessible, accurate and easy-to-use tools for 3D digital human creations in order to depict different cultures and societies as if in the physical world. Recent large-scale vision-language advances pave the way for novic...

Page 1 of 2

Previous Next