We build open tools for efficient AI deployment. Our research focuses on quantization methods that preserve model quality while dramatically reducing hardware requirements — bringing 400B+ parameter models to a single machine.
baa.ai · SWAN Paper · GitHub