Main API of this framework

Base Class

  1. Base Trainer

  2. Base Dataset

Mixin Class

  1. LMMsDataMixin

Processor Class

  1. AeroDataProcessor

  2. LLaVADataProcessor

  3. … (lots of processor)

Collator

  1. Vision Collator (Most of the collator we want to use)

Dataset

  1. Vision Audio

  2. Vision

Proto

  1. Data Proto

  2. LMMs Proto