Sat, Oct 19, 4:38 PM CDT

Stability AI Introduces TripoSR: Fast 3D Object Generation from Single Images

Mar 10, 2024 at 03:09 pm by PJeditor



  • Stability AI is releasing TripoSR in partnership with Tripo AI, generating high-quality 3D models from a single image in under a second.

  • TripoSR runs under low inference budgets (even without a GPU), which is both accessible and practical for a wide range of users and applications.

  • The model weights and source code are available for download here under the MIT license, allowing commercialized, personal, and research use. 

We have partnered with Tripo AI to develop TripoSR, a fast 3D object reconstruction model inspired by the recent work of LRM: Large Reconstruction Model For Single Image to 3D. This new image-to-3D model is designed to cater to the growing demands of entertainment, gaming, industrial design, and architecture professionals, with responsive outputs for visualizing detailed 3D objects.

 

Sample input images and the corresponding 3D models were reconstructed using the TripoSR model.

Performance

TripoSR can create detailed 3D models in a fraction of the time of other models. When tested on an Nvidia A100, it generates draft-quality 3D outputs (textured meshes) in around 0.5 seconds, outperforming other open image-to-3D models such as OpenLRM. In addition to speed, our model is fully accessible to users with or without GPUs. 

The plot shows the 3D performance with F-Score (higher the better) vs. inference time (lower the better).

 

Comparing TripoSR 3D reconstructions with those from OpenLRM.

Technical Details

Our training data preparation incorporates diverse data rendering techniques that more closely replicate the distribution of images found in the real world, significantly improving the model's ability to generalize. We carefully curated a CC-BY, a higher-quality subset of the Objaverse dataset, for the training data. On the model side, we also introduced several technical improvements over the base LRM model, including channel number optimization, mask supervision, and a more efficient crop rendering strategy. You can read the technical report for more details. 

We invite developers, designers, and creators to explore its capabilities, contribute to its evolution, and discover its potential to transform their work and industries. 

The code for the TripoSR model is now available on Tripo AI’s GitHub, and the model weights are available on Hugging Face. Please refer to our technical report for more details on the TripoSR model.

 
Sections: Industry News



Privacy Notice

This site uses cookies to deliver the best experience. Our own cookies make user accounts and other features possible. Third-party cookies are used to display relevant ads and to analyze how Renderosity is used. By using our site, you acknowledge that you have read and understood our Terms of Service, including our Cookie Policy and our Privacy Policy.