Login | Register

clip-mesh: generating textured meshes from text using pretrained image-text models

Title:

clip-mesh: generating textured meshes from text using pretrained image-text models

mohammad khalid, nasir (2023) clip-mesh: generating textured meshes from text using pretrained image-text models. Masters thesis, Concordia University.

[thumbnail of MohammadKhalid_MCompSc_S_2023.pdf]
Preview
Text (application/pdf)
MohammadKhalid_MCompSc_S_2023.pdf - Accepted Version
Available under License Spectrum Terms of Access.
8MB

Abstract

The following thesis introduces a novel technique for generating textured mesh models without any 3D supervision based solely on a text prompt. This is done by deforming the control shape of a limit subdivided surface along with its texture and normal map to match an input text prompt. The generated mesh asset can be easily integrated into games or modeling applications that rely on widespread rasterization based rendering techniques. The approach relies on a pre-trained Contrastive Language-Image Pre-Training (CLIP) model to compare the input text prompt with differentiably rendered images of our initialized 3D model. Unlike previous works that focused on stylization or required training of generative models, it performs optimization on mesh parameters directly to generate shape, texture, or both. To ensure that the optimization produces plausible meshes and textures, this work introduces several techniques including image augmentations, camera tuning and use of a pre-trained prior that generates CLIP image embeddings given a text embedding. Overall, this method offers a promising solution for zero-shot generation of 3D models, demonstrating the potential of CLIP-based techniques for the field of computer graphics

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:mohammad khalid, nasir
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science
Date:5 April 2023
Thesis Supervisor(s):Popa, Tiberiu and Belilovsky, Eugene
ID Code:992098
Deposited By: Nasir Khalid
Deposited On:21 Jun 2023 14:42
Last Modified:21 Jun 2023 14:42
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top