[WIP] Enable training SAEs on vision models #37

luciaquirke · 2024-12-03T10:06:17Z

Update input flattening to support hidden tensors with more batch dimensions
Enable specifying dummy inputs - the default transformers dummy input is of shape [3, 5] and uses the 'input_ids' key
Improve logging when hookpoints aren't specified correctly

TODO:

Enable training from command line (replace tokenizer with image processor)
Decide whether to support image data in MemmapDataset
Check if we can detect vision models and produce correct dummy input (key value, image shape) in the init

Maybe like

processor = AutoProcessor.from_pretrained(args.model, token=args.hf_token)
target_column = "pixel_values" if isinstance(processor, BaseImageProcessor) else "input_ids"

process.size.shortest_edge exists but no channel count

luciaquirke added 2 commits December 3, 2024 09:39

enable finetuning SAEs

a908ca2

add tensor dimension information to train vision models

d250458

luciaquirke changed the title ~~[WIP] Enable training SAEs on vision models~~ Enable training SAEs on vision models Dec 3, 2024

luciaquirke force-pushed the lucia/vision branch from 7431a52 to dab69e7 Compare December 3, 2024 10:34

Rename instance dims -> sample dims

a78566e

luciaquirke force-pushed the lucia/vision branch 4 times, most recently from 685fcad to c700c96 Compare December 4, 2024 00:20

luciaquirke requested a review from norabelrose December 4, 2024 00:21

luciaquirke changed the title ~~Enable training SAEs on vision models~~ [WIP] Enable training SAEs on vision models Dec 4, 2024

luciaquirke force-pushed the lucia/vision branch from c700c96 to b3a0e55 Compare December 11, 2024 00:26

Remove support for multiple feature dimensions

22102da

luciaquirke force-pushed the lucia/vision branch from b3a0e55 to 22102da Compare December 11, 2024 00:27

Support vision models from the command line

5c72292

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Enable training SAEs on vision models #37

[WIP] Enable training SAEs on vision models #37

luciaquirke commented Dec 3, 2024 •

edited

Loading

[WIP] Enable training SAEs on vision models #37

Are you sure you want to change the base?

[WIP] Enable training SAEs on vision models #37

Conversation

luciaquirke commented Dec 3, 2024 • edited Loading

luciaquirke commented Dec 3, 2024 •

edited

Loading