We introduce 3DShape2VecSet, a novel shape representation for neural fields designed for generative diffusion models. Our shape representation can encode 3D shapes given as surface models or point clouds, and represents them as neural fields. The concept of ne...

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation

2023 / ACM Transactions on Graphics / 103 citations

No code

X. Zheng, Hao Pan, Peng‐Shuai Wang, Xin Tong, Yang Liu, and 1 more

Although the recent rapid evolution of 3D generative neural networks greatly improves 3D shape generation, it is still not convenient for ordinary users to create 3D shapes and control the local geometry of generated shapes. To address these challenges, we pro...

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

2023 / ACM Transactions on Graphics / 100 citations

No code

Yuan Liu, Peng Wang, Cheng Lin, Xiaoxiao Long, J. Wang, and 3 more

We present a neural rendering-based method called NeRO for reconstructing the geometry and the BRDF of reflective objects from multiview images captured in an unknown environment. Multiview reconstruction of reflective objects is extremely challenging because ...

ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models

2023 / ACM Transactions on Graphics / 87 citations

No code

Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, and 4 more

Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffus...

OctFormer: Octree-based Transformers for 3D Point Clouds

2023 / ACM Transactions on Graphics / 81 citations

No code

Peng‐Shuai Wang

We propose octree-based transformers, named OctFormer, for 3D point cloud learning. OctFormer can not only serve as a general and effective backbone for 3D point cloud segmentation and object detection but also have linear complexity and is scalable for large-...

NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads

2023 / ACM Transactions on Graphics / 68 citations

No code

Tobias Kirschstein, Shenhan Qian, Simon Giebenhain, Tim Walter, Matthias Nießner

We focus on reconstructing high-fidelity radiance fields of human heads, capturing their animations over time, and synthesizing re-renderings from novel viewpoints at arbitrary time steps. To this end, we propose a new multi-view capture setup composed of 16 c...

AvatarReX: Real-time Expressive Full-body Avatars

2023 / ACM Transactions on Graphics / 67 citations

No code

Zerong Zheng, Xiaochen Zhao, Hongwen Zhang, Boning Liu, Yebin Liu

We present AvatarReX, a new method for learning NeRF-based full-body avatars from video data. The learnt avatar not only provides expressive control of the body, hands and the face together, but also supports real-time animation and rendering. To this end, we ...

Flexible Isosurface Extraction for Gradient-Based Mesh Optimization

2023 / ACM Transactions on Graphics / 65 citations

No code

Tianchang Shen, Jacob Munkberg, Jon Hasselgren, Kangxue Yin, Zian Wang, and 5 more

This work considers gradient-based mesh optimization, where we iteratively optimize for a 3D surface mesh by representing it as the isosurface of a scalar field, an increasingly common paradigm in applications including photogrammetry, generative modeling, and...

Real-Time Radiance Fields for Single-Image Portrait View Synthesis

2023 / ACM Transactions on Graphics / 64 citations

No code

Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, and 5 more

We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural ra...

A Neural Space-Time Representation for Text-to-Image Personalization

2023 / ACM Transactions on Graphics / 63 citations

No code

Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen‐Or

A key aspect of text-to-image personalization methods is the manner in which the target concept is represented within the generative process. This choice greatly affects the visual fidelity, downstream editability, and disk space needed to store the learned co...

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance

2023 / ACM Transactions on Graphics / 59 citations

No code

Longwen Zhang, Qiwei Qiu, Hongyang Lin, Qixuan Zhang, Shi Cheng, and 5 more

Emerging Metaverse applications demand accessible, accurate and easy-to-use tools for 3D digital human creations in order to depict different cultures and societies as if in the physical world. Recent large-scale vision-language advances pave the way for novic...

From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans

2023 / ACM Transactions on Graphics / 57 citations

No code

Marilyn Keller, Keenon Werling, Soyong Shin, Scott L. Delp, Sergi Pujades, and 2 more

Great progress has been made in estimating 3D human pose and shape from images and video by training neural networks to directly regress the parameters of parametric human models like SMPL. However, existing body models have simplified kinematic structures tha...

EgoLocate: Real-time Motion Capture, Localization, and Mapping with Sparse Body-mounted Sensors

2023 / ACM Transactions on Graphics / 56 citations

No code

Xinyu Yi, Yuxiao Zhou, Marc Habermann, Vladislav Golyanik, Shaohua Pan, and 2 more

Human and environment sensing are two important topics in Computer Vision and Graphics. Human motion is often captured by inertial sensors, while the environment is mostly reconstructed using cameras. We integrate the two techniques together in EgoLocate, a sy...

Word-As-Image for Semantic Typography

2023 / ACM Transactions on Graphics / 56 citations

No code

Shir Iluz, Yael Vinker, Amir Hertz, Daniel Berio, Daniel Cohen‐Or, and 1 more

A word-as-image is a semantic typography technique where a word illustration presents a visualization of the meaning of the word, while also preserving its readability. We present a method to create word-as-image illustrations automatically. This task is highl...

Globally Consistent Normal Orientation for Point Clouds by Regularizing the Winding-Number Field

2023 / ACM Transactions on Graphics / 52 citations

No code

Rui Xu, Zhiyang Dou, Ningna Wang, Shiqing Xin, Shuangmin Chen, and 4 more

Estimating normals with globally consistent orientations for a raw point cloud has many downstream geometry processing applications. Despite tremendous efforts in the past decades, it remains challenging to deal with an unoriented point cloud with various impe...

UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image

2023 / ACM Transactions on Graphics / 51 citations

No code

Dani Valevski, Matan Kalman, Eyal Molad, Eyal Segalis, Yossi Matias, and 1 more

Text-driven image generation methods have shown impressive results recently, allowing casual users to generate high quality images by providing textual descriptions. However, similar capabilities for editing existing images are still out of reach. Text-driven ...

Textured Mesh Quality Assessment: Large-scale Dataset and Deep Learning-based Quality Metric

2023 / ACM Transactions on Graphics / 49 citations

No code

Yana Nehmé, Johanna Delanoy, Florent Dupont, Jean‐Philippe Farrugia, Patrick Le Callet, and 1 more

Over the past decade, three-dimensional (3D) graphics have become highly detailed to mimic the real world, exploding their size and complexity. Certain applications and device constraints necessitate their simplification and/or lossy compression, which can deg...

Page 2 of 7

Previous Next