Architecture
General SSL algorithms need to work on arbitrary kinds of data including discrete, continuous, or multimodal inputs.
The DABS baselines use a transformer that operates on patch/token embeddings, but we encourage other approaches that are generally-applicable (e.g. Perceivers).