Mohamed Elrefaie
NeurIPS 2024 · Datasets & BenchmarksMIT Research Spotlight

DrivAerNet++: A Large-Scale Multimodal Car Dataset with CFD Simulations and Deep Learning Benchmarks

Mohamed Elrefaie, Florin Morar, Angela Dai, Faez Ahmed

Massachusetts Institute of Technology · Technical University of Munich

0
Car designs with high-fidelity CFD
0TB
Of multimodal data
0M
CPU-hours of simulation
0
Per-part semantic labels
0
Data modalities
The dataset

The largest public dataset for aerodynamic car design

DrivAerNet++ comprises 8,150 diverse car designs simulated with high-fidelity computational fluid dynamics, spanning fastback, notchback, and estateback configurations — with underbody and wheel variants representing both combustion and electric vehicles. Generated on MIT SuperCloud across 60 nodes and 2,880 CPU cores, it is the largest and most comprehensive resource of its kind for data-driven aerodynamic design.

High-fidelity CFD: streamlines and surface pressure across diverse DrivAerNet++ geometries.
High-fidelity CFD: streamlines and surface pressure across diverse DrivAerNet++ geometries.
Design space

Every conventional car shape, parametrized

Each 3D geometry is fully described by 26 design parameters chosen for their aerodynamic impact, with ranges constrained to manufacturable, aesthetically plausible designs. This breadth lets models generalize, supports exploration of unconventional designs, and exposes how geometric features drive aerodynamic performance.

Morphing through the 26-parameter design space.
Morphing through the 26-parameter design space.
Shape variation across fastback, notchback, and estateback configurations.
Shape variation across fastback, notchback, and estateback configurations.
Multimodality

One car, ten representations

Every design ships as parametric data, full 3D volumetric fields (pressure, velocity, turbulence), surface fields (Cp, wall shear stress), streamlines, point clouds, meshes, aerodynamic coefficients, part annotations, photorealistic renderings, and sketches.

Volumetric & surface fields

Full 3D CFD fields plus pressure coefficient and wall-shear-stress distributions on every surface.

Part-level annotations

Detailed semantic labels for 29 car components — wheels, mirrors, doors — enabling segmentation, classification, and automated meshing.

29 class labels

Sketch-to-design

Hand-drawn-style sketches (Canny & CLIPasso) and photorealistic renderings bridge conceptual creativity and computational design.

Modalities: parametric models, fields, point clouds, meshes, renderings, and sketches.
Modalities: parametric models, fields, point clouds, meshes, renderings, and sketches.
Per-part semantic annotations across 29 component classes.
Per-part semantic annotations across 29 component classes.
Benchmarks

A community benchmark, not just a dataset

DrivAerNet++ ships with train/val/test splits and deep-learning baselines for surrogate modeling of drag, surface fields, and full volumetric flow. It powers the CarBench leaderboard and is integrated into NVIDIA PhysicsNeMo (FIGConvUNet, AeroGraphNet) and Baidu PaddleScience — and served as the basis of an IJCAI 2024 competition.

DrivAerNet++ vs. existing aerodynamics datasets: largest in scale, broadest in modality coverage.
DrivAerNet++ vs. existing aerodynamics datasets: largest in scale, broadest in modality coverage.
Reference

Citation

@inproceedings{NEURIPS2024_013cf29a, author = {Elrefaie, Mohamed and Morar, Florin and Dai, Angela and Ahmed, Faez}, booktitle = {Advances in Neural Information Processing Systems}, title = {DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks}, volume = {37}, pages = {499--536}, year = {2024} }
Keep exploring

Related work