A Unified, High-Fidelity, and Multi-Format 3D Asset Generation Framework powered by Trellis
Trellis represents a monumental leap in the field of 3D content creation. Developed by researchers at Microsoft Research (MSRA) and HKU, the Trellis model introduces a revolutionary approach to generating high-quality 3D assets from single images or textual prompts. Unlike previous generation methods that struggle with topology or speed, Trellis utilizes a Structured Latent Diffusion (SLAT) architecture to ensure that every Trellis-generated asset maintains professional-grade geometry and texture. As a cornerstone of the Story321 model library, Trellis empowers creators to bypass traditional modeling bottlenecks. Whether you are building a Trellis-based game environment or a Trellis-driven VR experience, the Trellis framework provides the precision and versatility required for modern digital production. The Trellis ecosystem is designed to be the definitive solution for high-fidelity 3D synthesis in the AI era.

Trellis is a state-of-the-art 3D generative model that leverages large-scale pre-training on a massive dataset of over 500,000 3D objects. At its heart, Trellis is built upon a Rectified Flow Transformer architecture, scaling up to 2 billion and even 4 billion parameters in its Trellis.2 iteration. The core innovation of Trellis lies in its ability to map visual features into a structured latent space (SLAT), which Trellis then uses to decode into multiple 3D representations simultaneously. Trellis does not just output a simple mesh; it can generate 3D Gaussian Splatting (3DGS), Radiance Fields, and high-resolution Meshes, all from the same Trellis latent code. This unified approach makes Trellis uniquely capable of handling complex topologies and non-manifold geometries that other models often fail to represent. By integrating advanced O-Voxel representations, Trellis ensures that even the most intricate details of a Trellis model are preserved with surgical accuracy.
Trellis offers unified generation of 3D Gaussians, Meshes, and Radiance Fields.
Trellis employs a 2B-4B parameter Rectified Flow Transformer for superior reasoning.
Trellis achieves lightning-fast feed-forward inference without per-instance optimization.
Inside the Trellis Engine: Innovation at Scale
The Trellis SLAT architecture is the secret to its success. Trellis organizes 3D data into a structured grid that balances global structure with local detail, allowing Trellis to maintain consistency across different views.
A single Trellis inference run can be decoded into different formats. This means Trellis users can switch between Mesh and 3DGS without re-running the heavy Trellis diffusion process, saving massive computational resources.
Trellis is trained on a diverse corpus of 3D data, enabling Trellis to understand a wide array of categories, from organic characters to hard-surface mechanical parts. Trellis's knowledge base is truly world-class.
Trellis allows for latent-space manipulations, meaning you can perform Trellis-based local edits or generate variations of a Trellis model by simply adjusting the input parameters or the Trellis latent seed.
From Image to Trellis 3D: A Step-by-Step Breakdown
Trellis starts by taking an input image and passing it through a visual encoder (like DINOv2). Trellis then projects these features into a 3D sparse grid, initializing the Trellis structured latent space.
The Trellis Rectified Flow Transformer performs denoising within the latent space. Through a series of Trellis inference steps, the model refines the 3D structure, ensuring the Trellis output aligns perfectly with the input prompt.
Once the Trellis latent is finalized, specialized Trellis decoders transform the data into a usable 3D format. Whether it is a Trellis Mesh or a Trellis Gaussian Splat, the final output is ready for immediate deployment.
Empowering Industries with Trellis 3D Technology
Game studios use Trellis to automate the creation of background assets and props. Trellis significantly reduces the cost per asset, allowing Trellis-powered pipelines to scale content production exponentially.
Retailers use Trellis to turn product photos into interactive 3D models. Trellis provides customers with a 360-degree view of products, enhancing the Trellis-driven shopping experience and reducing return rates.
Trellis is the perfect tool for building virtual worlds. By using Trellis to generate environments and avatars, creators can populate Trellis-based digital spaces with diverse and high-quality 3D content in minutes.
Deepening Your Understanding of the Trellis Ecosystem
Trellis is a large-scale model. While basic Trellis inference can run on a 16GB VRAM GPU (like an RTX 3090), the full Trellis-X-Large or Trellis.2 models perform best on 24GB+ VRAM hardware to handle the deep Trellis latent processing.
Trellis stands out due to its SLAT architecture and multi-format output. Unlike models that focus only on meshes, Trellis provides a more flexible Trellis framework that includes 3D Gaussians and Radiance Fields with higher topological accuracy.
Yes, Trellis supports text-to-3D through its multi-modal Trellis embedding system. You can describe an object, and Trellis will synthesize a corresponding Trellis 3D asset that follows your textual instructions precisely.
Microsoft has released the Trellis research and code for academic purposes. The Trellis community is actively building plugins to bring Trellis capabilities into software like Blender and Unreal Engine.
Join the 3D revolution with the Trellis model on Story321. Start generating high-fidelity Trellis assets and transform your creative vision into a Trellis 3D reality with just one click.