StoryGold Logo

STORYGOLD

foundation model for parametric CAD

Diamond

We're building a 3D reasoning model

01 Vibecoding for CAD design

StoryGold is training a foundation model that understands and generates the 3D world through parametric CAD. Unlike existing AI models that produce static meshes or images, our model generates fully editable, production-ready CAD files.

About illustration

02 Training on terabytes of data

We're building the world's largest parametric CAD dataset. Focusing on parametric CAD enables our model to preserve engineering intent (dimensions, constraints, relationships) that are essential for downstream editing, simulation, and manufacturing.

About illustration

03 Building general intelligence

True general intelligence must understand the physical world. In Jennifer's previous role at Waymo, she built large-scale deep learning models to navigate the real world. It's clear to us that 3D reasoning is required to truly build general intelligence.

About illustration

We've got lots of experience building foundation models at Waymo and Google

LoRA Fine-Tuning for Vision-Language Models

LoRA is touted as being as effective as full-fine tuning, but is it true? LoRA fine-tuning performs significantly worse than full fine-tuning for vision-language models learning spatial reasoning on CAD data, suggesting LoRA's low-rank assumption fails when teaching models genuinely new capabilities far from their pre-training distribution. This potentially means that effective CAD model training requires full fine-tuning with 100,000+ examples and substantial GPU resources, rather than the more efficient LoRA approach that works well for tasks closer to the model's existing knowledge.

Blog post 1 illustration
Blog post 2 illustration

How good is GPT-5 at 3D?

GPT-5 can write syntactically perfect code with the right prompting techniques, but when it comes to actually understanding 3D space and geometry, even OpenAI's most advanced model falls dramatically short of specialized fine-tuned systems. This reveals a critical insight: you can't teach a model to truly "see" and reason about 3D space; some capabilities require fundamentally different training approaches that go far beyond clever prompting. In this post, we present an empirical analysis on GPT-5's performance on parametric CAD generation.

We believe that building AGI requires intelligence that can understand the 3D world